Conclusion#
The conclusion is stated upfront.
First, not using microLib is to ensure the controllability of the source code behavior, preventing the source lib from being altered by vendor mischief, while also enabling the source code to have cross-platform capabilities, avoiding behavioral differences due to different C standard libraries.
From this, it can be inferred that if microLib is not used, the original libc will be introduced, which will lead to many interaction functions in the C source library generating Semihosting code, causing the program to run sluggishly under debug and unable to run at all outside of debug.
In summary, if microLib is not used, the use of C standard library functions must be very cautious to avoid system interaction functions generating Semihosting code.
The Ancestor's Law Cannot Be Changed#
Since I first entered the industry, my boss has firmly prohibited the use of printf series functions, or simply banned all libc functions, including malloc, memset, strcpy, etc., or stated that the standard C library should only introduce stdint.h, stdlib.h, stdbool.h. This has led to a massive amount of self-implemented string operations, buffer operations, trace logs, etc. If malloc can be justified as a way to avoid thread stack overflow, static memory can control the scale of bugs, then why are other implementations not allowed?
In summary, the ancestor's law cannot be changed, and it remains so in current projects.
Clue#
While looking at this guy's introduction to bootloader vulnerabilities (which is actually also a stack operation issue), I came across the following article. It might explain the origin of this ancestor's law.
Verification#
Some of the code is as follows. clock(); is a typical piece of code that generates system interaction. In this project, microLib has been removed.
#include <stdio.h>
#include <time.h>
int main(void)
{
// Omitted some initialization code
while (1)
{
clock_t tTime = clock();
HAL_GPIO_TogglePin(GPIOH, GPIO_PIN_5);
HAL_Delay(500);
}
}
Running under debug
First, here is the disassembly related to the clock() function, which jumps to address 0x0800037C to execute the clock() function.
109: clock_t tTime = clock();
0x08002386 F7FDFFF9 BL.W 0x0800037C clock
0x0800238A 9001 STR r0,[sp,#0x04]
0x0800238C F6414000 MOVW r0,#0x1C00
0x08002390 F6C50002 MOVT r0,#0x5802
0x08002394 2120 MOVS r1,#0x20
The clock() function is where the BKPT soft breakpoint appears.
Here, BKPT is the Cortex-M Break Point (software breakpoint) instruction, and the constant 0xAB is a special code for Semihosting. If the debugging tool used happens to support Semihosting, the software may not even stop and run normally. But if power is lost and restarted, it becomes problematic; “The BKPT instruction executed in non-debug mode will directly cause the Cortex-M processor to enter Hardfault.”
0x0800037C 2100 MOVS r1,#0x00
0x0800037E 2010 MOVS r0,#0x10
0x08000380 BEAB BKPT 0xAB ; will stop at this line, this is the soft breakpoint of Semihosting code
0x08000382 4905 LDR r1,[pc,#20] ; @0x08000398
0x08000384 6809 LDR r1,[r1,#0x00]
0x08000386 1A40 SUBS r0,r0,r1
Can Optimization Avoid This?#
In fact, modern compilers are already very intelligent, and enabling even a little optimization can fix many bugs that you may not have noticed.
When compiled with O3 optimization, BKPT still inevitably appears; this libc should be linked in binary form into the burned file.
Which Functions to Avoid?#
The reference article mentions the following, excerpted here.
1. Standard Input/Output (Standard I/O)#
- printf series functions: such as printf, fprintf, sprintf, etc., used for formatted output to standard output devices (usually the host console).
- scanf series functions: such as scanf, fscanf, sscanf, etc., used for formatted input from standard input devices (usually the host keyboard input).
2. File Operations#
- fopen: open a file.
- fclose: close a file.
- fread: read data from a file.
- fwrite: write data to a file.
- fseek: move the file pointer to a specified position.
- ftell: get the current position of the file pointer.
- fflush: flush the file output buffer.
3. Time and Date#
- time: get the current time.
- clock: get processor time.
- difftime: calculate the time difference between two time points.
- strftime: format time and date as a string.
4. Error Handling#
- perror: output error information to the standard error device.
- strerror: return the error message string corresponding to the error code.
5. System Calls#
- exit: terminate the program and return a status code.
- system: execute a system command (rarely used in embedded systems, but may be useful when debugging on the host).
6. Other Auxiliary Functions#
- getenv: get the value of an environment variable.
- putenv: set an environment variable (rarely used).
- remove: delete a file.
- rename: rename a file.
The Original Article Is Very Interesting, Strongly Recommended to Watch#
Clearly, we fit the characteristics outlined in feature 4, and we are even more extreme.
【The Latency and Causes of "Embedded Appendicitis"】
General MacArthur once commented: Seeing a doctor at a certain place, cancer starts. You pseudo-expert, making Semihosting sound so terrifying, “even the compiler is default implanted,” how come I’m still living well? How come I’ve never encountered it?
To put it bluntly, you might fit the following characteristics:
- Most of the time using Arm Compiler 5;
- Most of the time defaulting to use MicroLib;
- Encountering the phenomenon of “everything works fine in debug mode, but the program crashes when run directly” when not selecting MicroLib under Arm Compiler 6—thus silently noting in a notebook that only MicroLib can be used;
- Never using libc functions other than malloc, even including printf;
- Using program templates created by experts;
- Application development based on example projects provided by chip manufacturers;
- Using software platforms like RT-Thread that provide “one-stop services.”
Don’t be fooled by the many points I listed; they actually fall into two categories:
- A blind cat accidentally catching a rat—good luck.
- Someone else carrying the burden for you.
This article is synchronized and updated by Mix Space to xLog. The original link is https://www.yono233.cn/posts/shoot/24_8_13_Semihosting