2025-09-25 20:16:45 [scrapy.utils.log] INFO: Scrapy 2.11.1 started (bot: news_scraper) 2025-09-25 20:16:45 [scrapy.utils.log] INFO: Versions: lxml 6.0.0.0, libxml2 2.14.4, cssselect 1.3.0, parsel 1.10.0, w3lib 2.3.1, Twisted 25.5.0, Python 3.11.13 (main, Jul 15 2025, 19:29:01) [GCC 14.2.0], pyOpenSSL 25.1.0 (OpenSSL 3.5.1 1 Jul 2025), cryptography 45.0.5, Platform Linux-5.15.0-139-generic-x86_64-with 2025-09-25 20:16:45 [scrapy.addons] INFO: Enabled addons: [] 2025-09-25 20:16:45 [asyncio] DEBUG: Using selector: EpollSelector 2025-09-25 20:16:45 [scrapy.utils.log] DEBUG: Using reactor: twisted.internet.asyncioreactor.AsyncioSelectorReactor 2025-09-25 20:16:45 [scrapy.utils.log] DEBUG: Using asyncio event loop: asyncio.unix_events._UnixSelectorEventLoop 2025-09-25 20:16:45 [scrapy.extensions.telnet] INFO: Telnet Password: 344f9cd47053dc3f 2025-09-25 20:16:45 [botocore.hooks] DEBUG: Changing event name from creating-client-class.iot-data to creating-client-class.iot-data-plane 2025-09-25 20:16:45 [botocore.hooks] DEBUG: Changing event name from before-call.apigateway to before-call.api-gateway 2025-09-25 20:16:45 [botocore.hooks] DEBUG: Changing event name from request-created.machinelearning.Predict to request-created.machine-learning.Predict 2025-09-25 20:16:45 [botocore.hooks] DEBUG: Changing event name from before-parameter-build.autoscaling.CreateLaunchConfiguration to before-parameter-build.auto-scaling.CreateLaunchConfiguration 2025-09-25 20:16:45 [botocore.hooks] DEBUG: Changing event name from before-parameter-build.route53 to before-parameter-build.route-53 2025-09-25 20:16:45 [botocore.hooks] DEBUG: Changing event name from request-created.cloudsearchdomain.Search to request-created.cloudsearch-domain.Search 2025-09-25 20:16:45 [botocore.hooks] DEBUG: Changing event name from docs.*.autoscaling.CreateLaunchConfiguration.complete-section to docs.*.auto-scaling.CreateLaunchConfiguration.complete-section 2025-09-25 20:16:45 [botocore.hooks] DEBUG: Changing event name from before-parameter-build.logs.CreateExportTask to before-parameter-build.cloudwatch-logs.CreateExportTask 2025-09-25 20:16:45 [botocore.hooks] DEBUG: Changing event name from docs.*.logs.CreateExportTask.complete-section to docs.*.cloudwatch-logs.CreateExportTask.complete-section 2025-09-25 20:16:45 [botocore.hooks] DEBUG: Changing event name from before-parameter-build.cloudsearchdomain.Search to before-parameter-build.cloudsearch-domain.Search 2025-09-25 20:16:45 [botocore.hooks] DEBUG: Changing event name from docs.*.cloudsearchdomain.Search.complete-section to docs.*.cloudsearch-domain.Search.complete-section 2025-09-25 20:16:45 [botocore.loaders] DEBUG: Loading JSON file: /usr/local/lib/python3.11/site-packages/botocore/data/endpoints.json 2025-09-25 20:16:45 [botocore.loaders] DEBUG: Loading JSON file: /usr/local/lib/python3.11/site-packages/botocore/data/sdk-default-configuration.json 2025-09-25 20:16:45 [botocore.hooks] DEBUG: Event choose-service-name: calling handler 2025-09-25 20:16:45 [botocore.loaders] DEBUG: Loading JSON file: /usr/local/lib/python3.11/site-packages/botocore/data/s3/2006-03-01/service-2.json.gz 2025-09-25 20:16:45 [botocore.loaders] DEBUG: Loading JSON file: /usr/local/lib/python3.11/site-packages/botocore/data/s3/2006-03-01/endpoint-rule-set-1.json.gz 2025-09-25 20:16:45 [botocore.loaders] DEBUG: Loading JSON file: /usr/local/lib/python3.11/site-packages/botocore/data/partitions.json 2025-09-25 20:16:45 [botocore.hooks] DEBUG: Event creating-client-class.s3: calling handler 2025-09-25 20:16:45 [botocore.hooks] DEBUG: Event creating-client-class.s3: calling handler ._handler at 0x7ff35dfd7c40> 2025-09-25 20:16:45 [botocore.hooks] DEBUG: Event creating-client-class.s3: calling handler 2025-09-25 20:16:45 [botocore.endpoint] DEBUG: Setting s3 timeout as (60, 60) 2025-09-25 20:16:45 [botocore.loaders] DEBUG: Loading JSON file: /usr/local/lib/python3.11/site-packages/botocore/data/_retry.json 2025-09-25 20:16:45 [botocore.client] DEBUG: Registering retry handlers for service: s3 2025-09-25 20:16:45 [botocore.utils] DEBUG: Registering S3 region redirector handler 2025-09-25 20:16:45 [botocore.utils] DEBUG: Registering S3Express Identity Resolver 2025-09-25 20:16:45 [scrapy.middleware] INFO: Enabled extensions: ['scrapy.extensions.corestats.CoreStats', 'scrapy.extensions.telnet.TelnetConsole', 'scrapy.extensions.memusage.MemoryUsage', 'scrapy.extensions.closespider.CloseSpider', 'scrapy.extensions.feedexport.FeedExporter', 'scrapy.extensions.logstats.LogStats', 'scrapy.extensions.throttle.AutoThrottle'] 2025-09-25 20:16:45 [scrapy.crawler] INFO: Overridden settings: {'AUTOTHROTTLE_ENABLED': True, 'BOT_NAME': 'news_scraper', 'CLOSESPIDER_TIMEOUT': 1800, 'CONCURRENT_REQUESTS': 4, 'DOWNLOAD_DELAY': 2, 'FEED_EXPORT_ENCODING': 'utf-8', 'LOG_FILE': '/opt/scrapyd/logs/news_scraper/tuoitre_timestamp/7cc84e3e9a4c11f086971e907748958e.log', 'NEWSPIDER_MODULE': 'news_scraper.spiders', 'REQUEST_FINGERPRINTER_IMPLEMENTATION': '2.7', 'ROBOTSTXT_OBEY': True, 'SPIDER_MODULES': ['news_scraper.spiders'], 'TWISTED_REACTOR': 'twisted.internet.asyncioreactor.AsyncioSelectorReactor'} 2025-09-25 20:16:45 [scrapy.middleware] INFO: Enabled downloader middlewares: ['scrapy.downloadermiddlewares.robotstxt.RobotsTxtMiddleware', 'scrapy.downloadermiddlewares.httpauth.HttpAuthMiddleware', 'scrapy.downloadermiddlewares.downloadtimeout.DownloadTimeoutMiddleware', 'scrapy.downloadermiddlewares.defaultheaders.DefaultHeadersMiddleware', 'scrapy.downloadermiddlewares.useragent.UserAgentMiddleware', 'news_scraper.middlewares.NewsScraperDownloaderMiddleware', 'scrapy.downloadermiddlewares.retry.RetryMiddleware', 'scrapy.downloadermiddlewares.redirect.MetaRefreshMiddleware', 'scrapy.downloadermiddlewares.httpcompression.HttpCompressionMiddleware', 'scrapy.downloadermiddlewares.redirect.RedirectMiddleware', 'scrapy.downloadermiddlewares.cookies.CookiesMiddleware', 'scrapy.downloadermiddlewares.httpproxy.HttpProxyMiddleware', 'scrapy.downloadermiddlewares.stats.DownloaderStats'] 2025-09-25 20:16:45 [scrapy.middleware] INFO: Enabled spider middlewares: ['scrapy.spidermiddlewares.httperror.HttpErrorMiddleware', 'scrapy.spidermiddlewares.offsite.OffsiteMiddleware', 'scrapy.spidermiddlewares.referer.RefererMiddleware', 'scrapy.spidermiddlewares.urllength.UrlLengthMiddleware', 'scrapy.spidermiddlewares.depth.DepthMiddleware'] 2025-09-25 20:16:45 [scrapy.middleware] INFO: Enabled item pipelines: [] 2025-09-25 20:16:45 [scrapy.core.engine] INFO: Spider opened 2025-09-25 20:16:45 [botocore.hooks] DEBUG: Changing event name from creating-client-class.iot-data to creating-client-class.iot-data-plane 2025-09-25 20:16:45 [botocore.hooks] DEBUG: Changing event name from before-call.apigateway to before-call.api-gateway 2025-09-25 20:16:45 [botocore.hooks] DEBUG: Changing event name from request-created.machinelearning.Predict to request-created.machine-learning.Predict 2025-09-25 20:16:45 [botocore.hooks] DEBUG: Changing event name from before-parameter-build.autoscaling.CreateLaunchConfiguration to before-parameter-build.auto-scaling.CreateLaunchConfiguration 2025-09-25 20:16:45 [botocore.hooks] DEBUG: Changing event name from before-parameter-build.route53 to before-parameter-build.route-53 2025-09-25 20:16:45 [botocore.hooks] DEBUG: Changing event name from request-created.cloudsearchdomain.Search to request-created.cloudsearch-domain.Search 2025-09-25 20:16:45 [botocore.hooks] DEBUG: Changing event name from docs.*.autoscaling.CreateLaunchConfiguration.complete-section to docs.*.auto-scaling.CreateLaunchConfiguration.complete-section 2025-09-25 20:16:45 [botocore.hooks] DEBUG: Changing event name from before-parameter-build.logs.CreateExportTask to before-parameter-build.cloudwatch-logs.CreateExportTask 2025-09-25 20:16:45 [botocore.hooks] DEBUG: Changing event name from docs.*.logs.CreateExportTask.complete-section to docs.*.cloudwatch-logs.CreateExportTask.complete-section 2025-09-25 20:16:45 [botocore.hooks] DEBUG: Changing event name from before-parameter-build.cloudsearchdomain.Search to before-parameter-build.cloudsearch-domain.Search 2025-09-25 20:16:45 [botocore.hooks] DEBUG: Changing event name from docs.*.cloudsearchdomain.Search.complete-section to docs.*.cloudsearch-domain.Search.complete-section 2025-09-25 20:16:45 [botocore.loaders] DEBUG: Loading JSON file: /usr/local/lib/python3.11/site-packages/botocore/data/endpoints.json 2025-09-25 20:16:46 [botocore.loaders] DEBUG: Loading JSON file: /usr/local/lib/python3.11/site-packages/botocore/data/sdk-default-configuration.json 2025-09-25 20:16:46 [botocore.hooks] DEBUG: Event choose-service-name: calling handler 2025-09-25 20:16:46 [botocore.loaders] DEBUG: Loading JSON file: /usr/local/lib/python3.11/site-packages/botocore/data/s3/2006-03-01/service-2.json.gz 2025-09-25 20:16:46 [botocore.loaders] DEBUG: Loading JSON file: /usr/local/lib/python3.11/site-packages/botocore/data/s3/2006-03-01/endpoint-rule-set-1.json.gz 2025-09-25 20:16:46 [botocore.loaders] DEBUG: Loading JSON file: /usr/local/lib/python3.11/site-packages/botocore/data/partitions.json 2025-09-25 20:16:46 [botocore.hooks] DEBUG: Event creating-client-class.s3: calling handler 2025-09-25 20:16:46 [botocore.hooks] DEBUG: Event creating-client-class.s3: calling handler ._handler at 0x7ff35d230b80> 2025-09-25 20:16:46 [botocore.hooks] DEBUG: Event creating-client-class.s3: calling handler 2025-09-25 20:16:46 [botocore.endpoint] DEBUG: Setting s3 timeout as (60, 60) 2025-09-25 20:16:46 [botocore.loaders] DEBUG: Loading JSON file: /usr/local/lib/python3.11/site-packages/botocore/data/_retry.json 2025-09-25 20:16:46 [botocore.client] DEBUG: Registering retry handlers for service: s3 2025-09-25 20:16:46 [botocore.utils] DEBUG: Registering S3 region redirector handler 2025-09-25 20:16:46 [botocore.utils] DEBUG: Registering S3Express Identity Resolver 2025-09-25 20:16:46 [scrapy.extensions.logstats] INFO: Crawled 0 pages (at 0 pages/min), scraped 0 items (at 0 items/min) 2025-09-25 20:16:46 [tuoitre_timestamp] INFO: Spider opened: tuoitre_timestamp 2025-09-25 20:16:46 [scrapy.extensions.telnet] INFO: Telnet console listening on 127.0.0.1:6031 2025-09-25 20:16:46 [scrapy.core.engine] DEBUG: Crawled (200) (referer: None) 2025-09-25 20:16:50 [scrapy.core.engine] DEBUG: Crawled (200) (referer: None) 2025-09-25 20:16:54 [scrapy.downloadermiddlewares.redirect] DEBUG: Redirecting (301) to from 2025-09-25 20:16:56 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:16:57 [scrapy.dupefilters] DEBUG: Filtered duplicate request: - no more duplicates will be shown (see DUPEFILTER_DEBUG to show all duplicates) 2025-09-25 20:16:59 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:16:59 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/nhieu-chien-dau-co-my-xuat-kich-chan-may-bay-nga-o-alaska-20250925215103274.htm 2025-09-25 20:17:00 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:00 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/tp-hcm-thuc-day-hop-tac-voi-my-20250925181924804.htm 2025-09-25 20:17:06 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:06 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/khoanh-khac-rot-nuoc-mat-cua-co-gai-ten-lua-le-thi-hang-20250925165447017.htm 2025-09-25 20:17:08 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:08 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/thuong-hieu-nhung-bai-hoc-bao-ve-uy-tin-ky-2-doanh-nghiep-kien-ngoi-sao-tiktok-2025092511311789.htm 2025-09-25 20:17:10 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:10 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/ba-con-nong-dan-hao-hung-tham-gia-chuong-trinh-mua-vang-thang-lon-2025-20250925061831553.htm 2025-09-25 20:17:12 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:12 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/lap-quy-nha-o-quoc-gia-lam-nha-xa-hoi-pho-thu-tuong-yeu-cau-xay-xong-phai-co-nguoi-thue-20250925152749586.htm 2025-09-25 20:17:13 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:14 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/nguy-co-thung-mang-nhi-o-tre-tu-nhung-thoi-quen-tuong-vo-hai-20250925164455486.htm 2025-09-25 20:17:16 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:16 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/tap-trung-xay-dung-du-thao-nghi-quyet-ve-chan-hung-va-phat-trien-van-hoa-20250925141113241.htm 2025-09-25 20:17:19 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:19 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/dua-du-khach-tham-quan-bang-nhung-cu-click-chuot-20250925125933009.htm 2025-09-25 20:17:21 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:21 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/phat-hien-85-ho-ngam-moi-duoi-lop-bang-nam-cuc-co-the-lam-muc-nuoc-bien-dang-20250925140737054.htm 2025-09-25 20:17:23 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:23 [tuoitre_timestamp] INFO: Invalid article: https://tuoitre.vn/video/xoa-xam-co-xoa-duoc-triet-de-hay-khong-165725.htm 2025-09-25 20:17:25 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:25 [tuoitre_timestamp] INFO: Invalid article: https://tuoitre.vn/video/6-dau-hieu-tham-lang-canh-bao-dot-quy-truoc-1-thang-183817.htm 2025-09-25 20:17:27 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:27 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/so-giao-duc-tp-hcm-sap-kiem-tra-thu-chi-dau-nam-hoc-xu-nghiem-truong-vi-pham-20250925175344002.htm 2025-09-25 20:17:29 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:30 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/thieu-kien-truc-su-truong-chuyen-doi-so-viet-nam-doi-mat-lo-hong-an-ninh-20250925180355886.htm 2025-09-25 20:17:32 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:32 [scrapy.core.engine] DEBUG: Crawled (200) (referer: None) 2025-09-25 20:17:32 [protego._protego] DEBUG: Rule at line 1 without any user agent to enforce it on. 2025-09-25 20:17:32 [protego._protego] DEBUG: Rule at line 2 without any user agent to enforce it on. 2025-09-25 20:17:32 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/voronkova-irina-ngoai-binh-khung-da-co-mat-tai-binh-chung-thong-tin-binh-doan-19-20250925152534822.htm 2025-09-25 20:17:34 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:34 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/minh-quan-vinh-hien-khien-ben-johns-bat-ngo-o-ppa-asia-20250925162516823.htm 2025-09-25 20:17:36 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:36 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/viet-huong-tiet-lo-4-ky-bien-hinh-cho-vai-ba-ngoai-70-tuoi-20250925181947627.htm 2025-09-25 20:17:36 [scrapy.core.engine] DEBUG: Crawled (200) (referer: None) 2025-09-25 20:17:36 [protego._protego] DEBUG: Rule at line 1 without any user agent to enforce it on. 2025-09-25 20:17:36 [protego._protego] DEBUG: Rule at line 2 without any user agent to enforce it on. 2025-09-25 20:17:39 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:39 [tuoitre_timestamp] INFO: 2025-09-23 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://cuoituan.tuoitre.vn/om-viec-thay-vi-nhay-viec-20250918152332459.htm 2025-09-25 20:17:40 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:40 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://cuoi.tuoitre.vn/vua-duoc-mo-nao-vua-choi-guitar-sao-hay-the-20250925140342323.htm 2025-09-25 20:17:40 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:40 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/bao-bualoi-rat-manh-co-the-di-thang-vao-da-nang-hue-hoac-vao-bac-trung-bo-20250925150835913.htm 2025-09-25 20:17:40 [scrapy.core.engine] DEBUG: Crawled (200) (referer: None) 2025-09-25 20:17:40 [protego._protego] DEBUG: Rule at line 1 without any user agent to enforce it on. 2025-09-25 20:17:40 [protego._protego] DEBUG: Rule at line 2 without any user agent to enforce it on. 2025-09-25 20:17:41 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:41 [tuoitre_timestamp] INFO: 2025-09-24 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://cuoituan.tuoitre.vn/phau-thuat-tham-my-lieu-thuoc-tu-tin-moi-cua-dan-ong-20250917093835022.htm 2025-09-25 20:17:42 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:43 [tuoitre_timestamp] INFO: Invalid article: https://tuoitre.vn/video/nguoi-nhat-ve-chai-phat-hien-thi-the-tre-so-sinh-trong-thung-cac-tong-o-trung-my-tay-tphcm-187404.htm 2025-09-25 20:17:43 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:43 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://cuoi.tuoitre.vn/vo-thuc-xuyen-dem-giai-bai-toan-sinh-con-duoc-thuong-20250924234103451.htm 2025-09-25 20:17:44 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:44 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://podcast.tuoitre.vn/khi-gen-z-thanh-nan-nhan-cua-bat-coc-online-105008.htm 2025-09-25 20:17:45 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:45 [tuoitre_timestamp] INFO: Invalid article: https://tuoitre.vn/video/tien-luong-day-them-gio-cua-nha-giao-duoc-tinh-nhu-the-nao-187420.htm 2025-09-25 20:17:46 [scrapy.extensions.logstats] INFO: Crawled 31 pages (at 31 pages/min), scraped 0 items (at 0 items/min) 2025-09-25 20:17:48 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:48 [tuoitre_timestamp] INFO: Invalid article: https://tuoitre.vn/video/niem-vui-doan-tu-sau-khi-nguoi-chong-duoc-mien-di-tu-187413.htm 2025-09-25 20:17:50 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:50 [tuoitre_timestamp] INFO: Invalid article: https://tuoitre.vn/video/se-som-thong-tin-ket-qua-xu-ly-vu-mv-anh-em-truoc-sau-nhu-mot-cua-nhom-ngu-ho-tuong-187422.htm 2025-09-25 20:17:52 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:53 [tuoitre_timestamp] INFO: Invalid article: https://tuoitre.vn/lan-chiem-via-he-long-duong-e961.htm 2025-09-25 20:17:53 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:17:53 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/nam-ca-si-muon-danh-tham-phan-nhan-2-8-ti-hua-giup-duong-su-thang-kien-20250925181629854.htm 2025-09-25 20:18:39 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:18:39 [tuoitre_timestamp] INFO: Invalid article: https://tuoitre.vn/nha-o-xa-hoi-e1197.htm 2025-09-25 20:18:46 [scrapy.extensions.logstats] INFO: Crawled 36 pages (at 5 pages/min), scraped 0 items (at 0 items/min) 2025-09-25 20:19:17 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:19:17 [tuoitre_timestamp] INFO: Invalid article: https://tuoitre.vn/chinh-quyen-2-cap-e1963.htm 2025-09-25 20:19:31 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:19:31 [tuoitre_timestamp] INFO: Invalid article: https://tuoitre.vn/nam-hoc-moi-e144.htm 2025-09-25 20:19:46 [scrapy.extensions.logstats] INFO: Crawled 38 pages (at 2 pages/min), scraped 0 items (at 0 items/min) 2025-09-25 20:19:50 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:19:50 [tuoitre_timestamp] INFO: Invalid article: https://tuoitre.vn/tiep-suc-den-truong-2025-e1982.htm 2025-09-25 20:19:59 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:19:59 [tuoitre_timestamp] INFO: Invalid article: https://tuoitre.vn/gia-vang-e592.htm 2025-09-25 20:20:01 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:20:01 [tuoitre_timestamp] INFO: Invalid article: https://tuoitre.vn/dai-hoi-dang-bo-tphcm-e1981.htm 2025-09-25 20:20:04 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:20:04 [tuoitre_timestamp] INFO: Invalid article: https://tuoitre.vn/trung-tam-tai-chinh-quoc-te-e537.htm 2025-09-25 20:20:07 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:20:07 [tuoitre_timestamp] INFO: Invalid article: https://tuoitre.vn/champions-league-2025-2026-e1996.htm 2025-09-25 20:20:10 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:20:10 [tuoitre_timestamp] INFO: Invalid article: https://tuoitre.vn/sieu-bao-ragasa-e1997.htm 2025-09-25 20:20:12 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:20:12 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/nhom-thanh-nien-tu-khanh-hoa-len-dak-lak-siet-no-lanh-an-20250925170040191.htm 2025-09-25 20:20:14 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:20:15 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/uu-tien-lam-lai-quoc-lo-am-anh-nhat-da-nang-qua-lao-20250925165017842.htm 2025-09-25 20:20:16 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:20:16 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/hang-trieu-nguoi-my-se-bi-bop-khau-phan-duoi-thoi-ong-trump-20250925172842956.htm 2025-09-25 20:20:19 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:20:19 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/cuu-tong-giam-doc-sjc-le-thuy-hang-noi-pham-toi-vi-lo-cho-doi-song-anh-em-20250925175308138.htm 2025-09-25 20:20:22 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:20:22 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/khoi-tai-san-vo-ong-pham-nhat-vuong-da-vuot-moc-ti-usd-20250925181917004.htm 2025-09-25 20:20:24 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:20:24 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/vua-cua-sieu-bao-ragasa-ngoi-bat-dong-truoc-song-du-20250925175410604.htm 2025-09-25 20:20:26 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:20:26 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/my-dua-dan-phong-khong-toi-tan-ap-sat-vung-phi-quan-su-han-trieu-20250925182528114.htm 2025-09-25 20:20:29 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:20:29 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/vo-minh-lam-ngoc-doi-thu-van-xuc-dong-khi-xem-phim-ve-chuong-vang-vong-co-20250925181554534.htm 2025-09-25 20:20:31 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:20:31 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/trung-tam-logistics-cai-mep-ha-la-du-an-trong-diem-keu-goi-dau-tu-20250925192713862.htm 2025-09-25 20:20:32 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:20:33 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/doanh-nghiep-phap-ton-ca-mo-tien-thue-tham-tu-lat-tay-nhan-vien-gia-om-tron-viec-20250925173341429.htm 2025-09-25 20:20:35 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:20:35 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/lang-lam-thuyen-thung-phu-my-thang-giai-nhiep-anh-heritage-20250925184558646.htm 2025-09-25 20:20:38 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:20:38 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/nha-vo-dich-marathon-shewarge-alene-qua-doi-dot-ngot-o-tuoi-30-20250925194502421.htm 2025-09-25 20:20:40 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:20:40 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/toyota-muon-mo-rong-xe-xanh-pho-cap-o-to-o-viet-nam-20250925171601731.htm 2025-09-25 20:20:42 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:20:42 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/vi-pham-iuu-4-thuyen-truong-tau-ca-o-quang-ngai-bi-phat-gan-900-trieu-dong-20250925201614835.htm 2025-09-25 20:20:44 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:20:45 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/ru-nu-sinh-12-tuoi-quan-he-thieu-nien-lanh-an-9-nam-tu-20250925201559482.htm 2025-09-25 20:20:46 [scrapy.extensions.logstats] INFO: Crawled 59 pages (at 21 pages/min), scraped 0 items (at 0 items/min) 2025-09-25 20:20:47 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:20:47 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/khai-mac-dien-dan-ket-noi-san-pham-ocop-vung-dong-bang-song-cuu-long-20250925212914451.htm 2025-09-25 20:20:50 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:20:50 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/xa-xuan-thoi-son-van-dong-ban-giao-mat-bang-cho-du-an-khu-do-thi-dai-hoc-quoc-te-20250925193958941.htm 2025-09-25 20:20:52 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:20:52 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/nu-giam-doc-quan-he-khach-hang-cua-ngan-hang-chiem-doat-hon-20-ti-cua-khach-20250925204422045.htm 2025-09-25 20:20:55 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:20:55 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/lanh-dao-ha-noi-yeu-cau-xu-nghiem-nhom-nguoi-xa-trom-chat-thai-be-phot-xuong-song-to-lich-20250925203103553.htm 2025-09-25 20:20:58 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:20:58 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/ba-nguoi-trung-quoc-giet-tai-xe-taxi-o-quang-ngai-lanh-an-11-19-nam-tu-20250925203410999.htm 2025-09-25 20:21:00 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:21:00 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/nghiem-tri-hanh-vi-tu-tap-gay-roi-can-quay-tren-duong-pho-da-nang-20250925202303609.htm 2025-09-25 20:21:02 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:21:02 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/dac-san-quang-tri-vao-tp-hcm-tim-thi-truong-phan-phoi-20250925180208863.htm 2025-09-25 20:21:05 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:21:05 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/chup-ha-noi-dem-dep-thi-de-tim-nhung-goc-khac-di-thi-kho-20250925210213268.htm 2025-09-25 20:21:06 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:21:06 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/israel-ban-pha-nhieu-muc-tieu-quan-su-o-thu-do-cua-yemen-20250925212458339.htm 2025-09-25 20:21:10 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:21:10 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/chau-a-sach-bong-o-giai-bong-chuyen-the-gioi-20250925203731201.htm 2025-09-25 20:21:12 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:21:12 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/hoang-sao-2-lan-thoat-hiem-truoc-filler-vao-vong-1-8-pool-10-bi-the-gioi-20250925213142425.htm 2025-09-25 20:21:15 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:21:15 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/nga-noi-drone-xam-pham-dan-mach-la-dan-dung-de-khieu-khich-20250925221336563.htm 2025-09-25 20:21:17 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:21:17 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/lotte-rut-lui-lien-danh-sunshine-dia-chua-the-nhay-vao-du-an-thu-thiem-eco-smart-city-2025092522505519.htm 2025-09-25 20:21:20 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:21:20 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/mua-nhieu-nguoi-duoi-16-tuoi-lam-massage-thieu-nu-16-tuoi-va-dong-bon-bi-de-nghi-tu-9-den-15-nam-tu-20250925231823824.htm 2025-09-25 20:21:21 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:21:22 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/bo-giao-duc-chua-co-phuong-an-sap-nhap-tinh-gian-dai-hoc-nhu-tin-don-20250925185907706.htm 2025-09-25 20:21:24 [scrapy.core.engine] DEBUG: Crawled (200) (referer: https://tuoitre.vn/) 2025-09-25 20:21:24 [tuoitre_timestamp] INFO: 2025-09-25 is out of date range: from 2025-09-26 to 2025-09-26, skipping article: https://tuoitre.vn/gioi-chuyen-gia-cong-bo-so-lieu-cac-vu-hack-nham-vao-ngan-hang-viet-20250925182021902.htm 2025-09-25 20:21:24 [scrapy.core.engine] INFO: Closing spider (finished) 2025-09-25 20:21:24 [boto3.s3.transfer] DEBUG: Opting out of CRT Transfer Manager. Preferred client: auto, CRT available: False, Instance Optimized: False. 2025-09-25 20:21:24 [boto3.s3.transfer] DEBUG: Using default client. pid: 171333, thread: 140683213757240 2025-09-25 20:21:24 [s3transfer.utils] DEBUG: Acquiring 0 2025-09-25 20:21:24 [s3transfer.tasks] DEBUG: UploadSubmissionTask(transfer_id=0, {'transfer_future': }) about to wait for the following futures [] 2025-09-25 20:21:24 [s3transfer.tasks] DEBUG: UploadSubmissionTask(transfer_id=0, {'transfer_future': }) done waiting for dependent futures 2025-09-25 20:21:24 [s3transfer.tasks] DEBUG: Executing task UploadSubmissionTask(transfer_id=0, {'transfer_future': }) with kwargs {'client': , 'config': , 'osutil': , 'request_executor': , 'transfer_future': } 2025-09-25 20:21:24 [s3transfer.futures] DEBUG: Submitting task PutObjectTask(transfer_id=0, {'bucket': 'dagster-output-data', 'key': 'tuoitre_timestamp/tuoitre_timestamp_7cc84e3e9a4c11f086971e907748958e_scheduled_2025-09-26.jl', 'extra_args': {}}) to executor for transfer request: 0. 2025-09-25 20:21:24 [s3transfer.utils] DEBUG: Acquiring 0 2025-09-25 20:21:24 [s3transfer.tasks] DEBUG: PutObjectTask(transfer_id=0, {'bucket': 'dagster-output-data', 'key': 'tuoitre_timestamp/tuoitre_timestamp_7cc84e3e9a4c11f086971e907748958e_scheduled_2025-09-26.jl', 'extra_args': {}}) about to wait for the following futures [] 2025-09-25 20:21:24 [s3transfer.utils] DEBUG: Releasing acquire 0/None 2025-09-25 20:21:24 [s3transfer.tasks] DEBUG: PutObjectTask(transfer_id=0, {'bucket': 'dagster-output-data', 'key': 'tuoitre_timestamp/tuoitre_timestamp_7cc84e3e9a4c11f086971e907748958e_scheduled_2025-09-26.jl', 'extra_args': {}}) done waiting for dependent futures 2025-09-25 20:21:24 [s3transfer.tasks] DEBUG: Executing task PutObjectTask(transfer_id=0, {'bucket': 'dagster-output-data', 'key': 'tuoitre_timestamp/tuoitre_timestamp_7cc84e3e9a4c11f086971e907748958e_scheduled_2025-09-26.jl', 'extra_args': {}}) with kwargs {'client': , 'fileobj': , 'bucket': 'dagster-output-data', 'key': 'tuoitre_timestamp/tuoitre_timestamp_7cc84e3e9a4c11f086971e907748958e_scheduled_2025-09-26.jl', 'extra_args': {}} 2025-09-25 20:21:24 [botocore.hooks] DEBUG: Event before-parameter-build.s3.PutObject: calling handler 2025-09-25 20:21:24 [botocore.hooks] DEBUG: Event before-parameter-build.s3.PutObject: calling handler 2025-09-25 20:21:24 [botocore.hooks] DEBUG: Event before-parameter-build.s3.PutObject: calling handler 2025-09-25 20:21:24 [botocore.hooks] DEBUG: Event before-parameter-build.s3.PutObject: calling handler 2025-09-25 20:21:24 [botocore.hooks] DEBUG: Event before-parameter-build.s3.PutObject: calling handler 2025-09-25 20:21:24 [botocore.hooks] DEBUG: Event before-parameter-build.s3.PutObject: calling handler > 2025-09-25 20:21:24 [botocore.hooks] DEBUG: Event before-parameter-build.s3.PutObject: calling handler > 2025-09-25 20:21:24 [botocore.hooks] DEBUG: Event before-parameter-build.s3.PutObject: calling handler 2025-09-25 20:21:24 [botocore.hooks] DEBUG: Event before-endpoint-resolution.s3: calling handler 2025-09-25 20:21:24 [botocore.hooks] DEBUG: Event before-endpoint-resolution.s3: calling handler > 2025-09-25 20:21:24 [botocore.regions] DEBUG: Calling endpoint provider with parameters: {'Bucket': 'dagster-output-data', 'Region': 'us-east-1', 'UseFIPS': False, 'UseDualStack': False, 'Endpoint': 'https://lake-api.actable.ai/', 'ForcePathStyle': True, 'Accelerate': False, 'UseGlobalEndpoint': True, 'Key': 'tuoitre_timestamp/tuoitre_timestamp_7cc84e3e9a4c11f086971e907748958e_scheduled_2025-09-26.jl', 'DisableMultiRegionAccessPoints': False, 'UseArnRegion': True} 2025-09-25 20:21:24 [botocore.regions] DEBUG: Endpoint provider result: https://lake-api.actable.ai/dagster-output-data 2025-09-25 20:21:24 [botocore.regions] DEBUG: Selecting from endpoint provider's list of auth schemes: "sigv4". User selected auth scheme is: "None" 2025-09-25 20:21:24 [botocore.regions] DEBUG: Selected auth type "v4" as "v4" with signing context params: {'region': 'us-east-1', 'signing_name': 's3', 'disableDoubleEncoding': True} 2025-09-25 20:21:24 [botocore.hooks] DEBUG: Event before-call.s3.PutObject: calling handler 2025-09-25 20:21:24 [botocore.hooks] DEBUG: Event before-call.s3.PutObject: calling handler 2025-09-25 20:21:24 [botocore.handlers] DEBUG: Adding expect 100 continue header to request. 2025-09-25 20:21:24 [botocore.hooks] DEBUG: Event before-call.s3.PutObject: calling handler > 2025-09-25 20:21:24 [botocore.hooks] DEBUG: Event before-call.s3.PutObject: calling handler 2025-09-25 20:21:24 [botocore.hooks] DEBUG: Event before-call.s3.PutObject: calling handler 2025-09-25 20:21:24 [botocore.endpoint] DEBUG: Making request for OperationModel(name=PutObject) with params: {'url_path': '/tuoitre_timestamp/tuoitre_timestamp_7cc84e3e9a4c11f086971e907748958e_scheduled_2025-09-26.jl', 'query_string': {}, 'method': 'PUT', 'headers': {'User-Agent': 'Boto3/1.34.57 md/Botocore#1.34.162 ua/2.0 os/linux#5.15.0-139-generic md/arch#x86_64 lang/python#3.11.13 md/pyimpl#CPython cfg/retry-mode#legacy Botocore/1.34.162', 'Content-MD5': '1B2M2Y8AsgTpgAmY7PhCfg==', 'Expect': '100-continue'}, 'body': , 'auth_path': '/dagster-output-data/tuoitre_timestamp/tuoitre_timestamp_7cc84e3e9a4c11f086971e907748958e_scheduled_2025-09-26.jl', 'url': 'https://lake-api.actable.ai/dagster-output-data/tuoitre_timestamp/tuoitre_timestamp_7cc84e3e9a4c11f086971e907748958e_scheduled_2025-09-26.jl', 'context': {'client_region': 'us-east-1', 'client_config': , 'has_streaming_input': True, 'auth_type': 'v4', 's3_redirect': {'redirected': False, 'bucket': 'dagster-output-data', 'params': {'Bucket': 'dagster-output-data', 'Key': 'tuoitre_timestamp/tuoitre_timestamp_7cc84e3e9a4c11f086971e907748958e_scheduled_2025-09-26.jl', 'Body': }}, 'input_params': {'Bucket': 'dagster-output-data', 'Key': 'tuoitre_timestamp/tuoitre_timestamp_7cc84e3e9a4c11f086971e907748958e_scheduled_2025-09-26.jl'}, 'signing': {'region': 'us-east-1', 'signing_name': 's3', 'disableDoubleEncoding': True}, 'endpoint_properties': {'authSchemes': [{'disableDoubleEncoding': True, 'name': 'sigv4', 'signingName': 's3', 'signingRegion': 'us-east-1'}]}}} 2025-09-25 20:21:24 [botocore.hooks] DEBUG: Event request-created.s3.PutObject: calling handler 2025-09-25 20:21:24 [botocore.hooks] DEBUG: Event request-created.s3.PutObject: calling handler > 2025-09-25 20:21:24 [botocore.hooks] DEBUG: Event choose-signer.s3.PutObject: calling handler > 2025-09-25 20:21:24 [botocore.hooks] DEBUG: Event choose-signer.s3.PutObject: calling handler 2025-09-25 20:21:24 [botocore.hooks] DEBUG: Event before-sign.s3.PutObject: calling handler 2025-09-25 20:21:24 [botocore.hooks] DEBUG: Event before-sign.s3.PutObject: calling handler > 2025-09-25 20:21:24 [botocore.auth] DEBUG: Calculating signature using v4 auth. 2025-09-25 20:21:24 [botocore.auth] DEBUG: CanonicalRequest: PUT /dagster-output-data/tuoitre_timestamp/tuoitre_timestamp_7cc84e3e9a4c11f086971e907748958e_scheduled_2025-09-26.jl content-md5:1B2M2Y8AsgTpgAmY7PhCfg== host:lake-api.actable.ai x-amz-content-sha256:UNSIGNED-PAYLOAD x-amz-date:20250925T202124Z content-md5;host;x-amz-content-sha256;x-amz-date UNSIGNED-PAYLOAD 2025-09-25 20:21:24 [botocore.auth] DEBUG: StringToSign: AWS4-HMAC-SHA256 20250925T202124Z 20250925/us-east-1/s3/aws4_request 36db56ff004743b23d85f25f833eddae3c36c1eded11fb0d46cde80e4fd346ff 2025-09-25 20:21:24 [botocore.auth] DEBUG: Signature: ea1c14fa7e321d25f818255928e7f0ddae51de6e6da14bba2bef9ef8eb7ae43e 2025-09-25 20:21:24 [botocore.hooks] DEBUG: Event request-created.s3.PutObject: calling handler 2025-09-25 20:21:24 [botocore.hooks] DEBUG: Event request-created.s3.PutObject: calling handler 2025-09-25 20:21:24 [botocore.endpoint] DEBUG: Sending http request: 2025-09-25 20:21:24 [botocore.httpsession] DEBUG: Certificate path: /usr/local/lib/python3.11/site-packages/certifi/cacert.pem 2025-09-25 20:21:24 [urllib3.connectionpool] DEBUG: Starting new HTTPS connection (1): lake-api.actable.ai:443 2025-09-25 20:21:24 [botocore.awsrequest] DEBUG: Waiting for 100 Continue response. 2025-09-25 20:21:24 [botocore.awsrequest] DEBUG: 100 Continue response seen, now sending request body. 2025-09-25 20:21:24 [urllib3.connectionpool] DEBUG: https://lake-api.actable.ai:443 "PUT /dagster-output-data/tuoitre_timestamp/tuoitre_timestamp_7cc84e3e9a4c11f086971e907748958e_scheduled_2025-09-26.jl HTTP/1.1" 200 0 2025-09-25 20:21:24 [botocore.parsers] DEBUG: Response headers: {'Server': 'nginx/1.24.0 (Ubuntu)', 'Date': 'Thu, 25 Sep 2025 20:21:24 GMT', 'Content-Length': '0', 'Connection': 'keep-alive', 'Accept-Ranges': 'bytes', 'ETag': '"d41d8cd98f00b204e9800998ecf8427e"', 'Strict-Transport-Security': 'max-age=31536000; includeSubDomains', 'Vary': 'Origin, Accept-Encoding', 'X-Amz-Bucket-Region': 'us-east-1', 'X-Amz-Id-2': 'dd9025bab4ad464b049177c95eb6ebf374d3b3fd1af9251148b658df7ac2e3e8', 'X-Amz-Request-Id': '1868A0194900BA76', 'X-Content-Type-Options': 'nosniff', 'X-Ratelimit-Limit': '25637', 'X-Ratelimit-Remaining': '25637', 'X-Xss-Protection': '1; mode=block'} 2025-09-25 20:21:24 [botocore.parsers] DEBUG: Response body: b'' 2025-09-25 20:21:24 [botocore.hooks] DEBUG: Event needs-retry.s3.PutObject: calling handler 2025-09-25 20:21:24 [botocore.retryhandler] DEBUG: No retry needed. 2025-09-25 20:21:24 [botocore.hooks] DEBUG: Event needs-retry.s3.PutObject: calling handler > 2025-09-25 20:21:24 [s3transfer.utils] DEBUG: Releasing acquire 0/None 2025-09-25 20:21:24 [scrapy.extensions.feedexport] INFO: Stored jsonlines feed (0 items) in: s3://dagster-output-data/tuoitre_timestamp/tuoitre_timestamp_7cc84e3e9a4c11f086971e907748958e_scheduled_2025-09-26.jl 2025-09-25 20:21:24 [scrapy.statscollectors] INFO: Dumping Scrapy stats: {'downloader/request_bytes': 23547, 'downloader/request_count': 76, 'downloader/request_method_count/GET': 76, 'downloader/response_bytes': 10090548, 'downloader/response_count': 76, 'downloader/response_status_count/200': 75, 'downloader/response_status_count/301': 1, 'dupefilter/filtered': 1, 'elapsed_time_seconds': 279.068489, 'feedexport/success_count/S3FeedStorage': 1, 'finish_reason': 'finished', 'finish_time': datetime.datetime(2025, 9, 25, 20, 21, 24, 932645, tzinfo=datetime.timezone.utc), 'httpcompression/response_bytes': 26856900, 'httpcompression/response_count': 74, 'log_count/DEBUG': 193, 'log_count/INFO': 85, 'memusage/max': 186347520, 'memusage/startup': 124477440, 'request_depth_max': 2, 'response_received_count': 75, 'robotstxt/request_count': 4, 'robotstxt/response_count': 4, 'robotstxt/response_status_count/200': 4, 'scheduler/dequeued': 72, 'scheduler/dequeued/memory': 72, 'scheduler/enqueued': 72, 'scheduler/enqueued/memory': 72, 'start_time': datetime.datetime(2025, 9, 25, 20, 16, 45, 864156, tzinfo=datetime.timezone.utc)} 2025-09-25 20:21:24 [scrapy.core.engine] INFO: Spider closed (finished)