https://docs.fileformat.com/compression/tar/
I don't understand how this format was ever thought to be a good idea
* filename size is exactly 100 bytes
* file size is octal at 8^12 (correction 8^11, 1 byte is /0 so 8 GiB). Why octal and not binary?
* there is extra "end of file" markers of 1024KiB of zeros
* contains NUMERIC owner/group IDs... on a serialized data format meant to be sent between computers (who's IDs other than root might not agree right?)
Why the extra EoF? Is this out of concern for data corruption? If so, why not a simple CRC checker for both detection and recovery?
The whole thing seems poorly designed yet is ubiquitous in Linux. I've used it for more than a decade without ever asking about it's protocol.
Some alternatives off the top of my head:
* remove EoF waste
* 2byte filename lengths, allowing file NAMES up to 64KiB and removing wasted bytes of the 100 bytes which are unused.
* add CRC checksum
* NAMED owner/group, permitting some kind of cross-platform usage. Or just remove this feature entirely (preferable IMO)
* Don't use octal for file size and get nearly infinite file sizes.