aparser.log забивает вот этим - размер файла растет очень быстро Спойлер: log file Use of uninitialized value in subroutine entry at build/core.to_build.pl line 19429. Use of uninitialized value in split at build/core.to_build.pl line 19805. Use of uninitialized value in subroutine entry at build/core.to_build.pl line 19429. Use of uninitialized value in split at build/core.to_build.pl line 19805. Use of uninitialized value in subroutine entry at build/core.to_build.pl line 19429. Use of uninitialized value in split at build/core.to_build.pl line 19805. Use of uninitialized value in subroutine entry at build/core.to_build.pl line 19429. Use of uninitialized value in split at build/core.to_build.pl line 19805. Use of uninitialized value in subroutine entry at build/core.to_build.pl line 19429. Use of uninitialized value in split at build/core.to_build.pl line 19805. Use of uninitialized value in subroutine entry at build/core.to_build.pl line 19429. Use of uninitialized value in split at build/core.to_build.pl line 19805. Use of uninitialized value in subroutine entry at build/core.to_build.pl line 19429. Dec 20 16:07:44.37019 GC takes 2.56165552139282 Use of uninitialized value in split at build/core.to_build.pl line 19805. Use of uninitialized value in subroutine entry at build/core.to_build.pl line 19429. Use of uninitialized value in split at build/core.to_build.pl line 19805. Use of uninitialized value in subroutine entry at build/core.to_build.pl line 19429. Use of uninitialized value in split at build/core.to_build.pl line 19805. Use of uninitialized value in subroutine entry at build/core.to_build.pl line 19429. Use of uninitialized value in split at build/core.to_build.pl line 19805. Use of uninitialized value in subroutine entry at build/core.to_build.pl line 19429. Use of uninitialized value in split at build/core.to_build.pl line 19805. Use of uninitialized value in subroutine entry at build/core.to_build.pl line 19429. Use of uninitialized value in split at build/core.to_build.pl line 19805, <$__ANONIO__> line 1360. Use of uninitialized value in subroutine entry at build/core.to_build.pl line 19429, <$__ANONIO__> line 1360. Use of uninitialized value in split at build/core.to_build.pl line 19805. Use of uninitialized value in subroutine entry at build/core.to_build.pl line 19429. Use of uninitialized value in split at build/core.to_build.pl line 19805. Use of uninitialized value in subroutine entry at build/core.to_build.pl line 19429. Use of uninitialized value in split at build/core.to_build.pl line 19805. Use of uninitialized value in subroutine entry at build/core.to_build.pl line 19429. Use of uninitialized value in split at build/core.to_build.pl line 19805. Use of uninitialized value in subroutine entry at build/core.to_build.pl line 19429. Use of uninitialized value in split at build/core.to_build.pl line 19805. Use of uninitialized value in subroutine entry at build/core.to_build.pl line 19429.
Хорошо было бы приложить настройки задания, правила оформления задач https://a-parser.com/threads/2450/
А с нагрузкой на центральный процессор нет проблем? У меня x64 версия, но последняя, пресет заказной, парсинг идет через парсер - HTML::TextExtractor и постоянно нагрузка на ЦП в среднем 90-95%, даже если снижать потоки. В логе aparser.log, такое же, как у вас выше.
Иногда вместо ссылок попадаются картинки в виде x-raw-image, т.е. встроенные прямо в html, там нет доменов, и в лог попадала ошибка - это исправлено. На сам парсинг это никак не влияет