>On another note: C++ compiler should by default keep statistics about the chain...

WalterBright · on April 29, 2024

True, but when you're compiling a.c and then b.c, the .h files get reparsed all over again.

highfrequency · on April 29, 2024

Correct me if I'm wrong, but I believe the parent comment's strategy of only #including header files within .c files only helps reduce duplicate header file parsing within each compilation unit. So it wouldn't do anything to improve the case you mention (duplicate header compilation across compilation units) anyway, while adding much additional overhead in manually tracking header file dependencies.

Also, given your experience in compilers - keen to see if you agree that because modern compilers optimize away re-scanning of the same header file within a compilation unit anyway (in the presence of include guards), the strategy of only #including header files within .c files is close to useless.

WalterBright · on April 29, 2024

The not rescanning if the #include guards are there goes back to the mid 1980s. It's not a modern feature :-)

> the strategy of only #including header files within .c files is close to useless

It probably is. It also means the user of the .h file has to manage the .h file's dependencies, which is not the best practice. .h files should be self-contained.

kazinator · on April 29, 2024

This is not only not true, but not possible. Many kind sof defintiions in .h files may not be repeated without error, like:

  struct foo { int bar; };

or

  typedef int xyzzy_t;

This is why you have include guards:

  #ifndef FOO_H_3DF0_755A
  #define FOO_H_3DF0_755A

  struct foo { int bar; }

  #endif

GCC optimized the handling of headers with include guards 30 years ago already.

If you think most of your compile time is spent in preprocessing, benchmark a clean build with the optimization set to -O0 versus your full optimization like -O2 or whatever you are using.

Both builds perform preprocessing; thus the preprocessing time is bounded by the total time spent in the -O0 build: if the actual semantic analysis of tokens and code generation at -O0 were to take next to no time at all, then we would have to attribute the time to tokenizing and preprocessing. But even then, all the additional time observed under -O2 is not tokenizing and preprocessing.

WalterBright · on April 29, 2024

Um, I think there is a misunderstanding.

    gcc -c a.c
    gcc -c b.c

requires reparsing of the .h files used by both.