I have deleted A lot of them just as a result of encoding all data files to UTF-8 with no bom after which you can checking if the filesize is identical. But definitely if a person places an ad in there, the filesize differs...These are definitely terrific sources to put via LLM and translate to English. I have witnessed DeepL pointed out quite a bi