Releases: apify/crawlee-python
Releases · apify/crawlee-python
0.4.1
0.4.1 (2024-11-11)
🚀 Features
- Add
max_crawl_depth
option toBasicCrawler
(#637) (77deaa9) by @Prathamesh010 - Add BeautifulSoupParser type alias (#674) (b2cf88f) by @Pijukatel
🐛 Bug Fixes
- Fix total_size usage in memory size monitoring (#661) (c2a3239) by @janbuchar
- Add HttpHeaders to module exports (#664) (f0c5ca7) by @vdusek
- Fix unhandled ValueError in request handler result processing (#666) (0a99d7f) by @janbuchar
- Fix BaseDatasetClient.iter_items type hints (#680) (a968b1b) by @Pijukatel
0.4.0
0.4.0 (2024-11-01)
🚀 Features
- Add headers in unique key computation (#609) (6c4746f) by @Prathamesh010
- Add
pre_navigation_hooks
toPlaywrightCrawler
(#631) (5dd5b60) by @Prathamesh010 - Add
always_enqueue
option to bypass URL deduplication (#621) (4e59fa4) by @Rutam21 - Split and add extra configuration to export_data method (#580) (6751635) by @deshansh
🐛 Bug Fixes
- Use strip in headers normalization (#614) (a15b21e) by @vdusek
- Merge payload and data fields of Request (#542) (d06fcef) by @vdusek
- Default ProxyInfo port if httpx.URL port is None (#619) (8107a6f) by @steffansafey
Chore
0.3.9
0.3.9 (2024-10-23)
🚀 Features
- Key-value store context helpers (#584) (fc15622) by @janbuchar
- Added get_public_url method to KeyValueStore (#572, closes #514) (3a4ba8f) by @akshay11298
🐛 Bug Fixes
- Workaround for JSON value typing problems (#581, closes #563) (403496a) by @janbuchar
0.3.8
0.3.7
0.3.6
0.3.5
0.3.4
0.3.3
0.3.3 (2024-09-05)
🐛 Bug Fixes
- Deduplicate requests by unique key before submitting them to the queue (#499) (6a3e0e7) by @janbuchar
0.3.2
0.3.2 (2024-09-04)
🐛 Bug Fixes
- Double incrementation of
item_count
(#443, closes #442) (cd9adf1) by @cadlagtrader - Field alias in
BatchRequestsOperationResponse
(#485) (126a862) by @janbuchar - JSON handling with Parsel (#490, closes #488) (ebf5755) by @janbuchar