Releases: apify/crawlee-python
Releases · apify/crawlee-python
1.6.1
1.6.1 (2026-03-30)
🐛 Bug Fixes
- Handle invalid URLs in
RequestList(#1803) (0b2e3fc) by @Mantisus - playwright: Filter unsupported context options in persistent browser (#1796) (69ad22e) by @sushant-mutnale
- Remove double usage_count increment in Session.retire() (#1816) (c40d411) by @vdusek
- Defer page object cleanup to make it accessible in error handlers (#1814) (7eeb500) by @janbuchar
⚡ Performance
1.6.0
1.6.0 (2026-03-20)
🚀 Features
- Allow non-href links extract & enqueue (#1781) (6db365d) by @kozlice
- Add
post_navigation_hooksto crawlers (#1795) (38ceda6) by @Mantisus - Add page lifecycle hooks to
BrowserPool(#1791) (6f2ac13) by @Mantisus - Expose
BrowserTypeandCrawleePage(#1798) (b50b9f2) by @Mantisus - Expose
use_stateinBasicCrawler(#1799) (d121873) by @Mantisus
🐛 Bug Fixes
1.5.0
1.5.0 (2026-03-06)
🚀 Features
- Use specialized Playwright docker images in templates (#1757) (747c0cf) by @Pijukatel
- Add
discover_valid_sitemapsutility (#1777) (872447b) by @Mantisus
🐛 Bug Fixes
- Prevent list modification during iteration in BrowserPool (#1703) (70309d9) by @vdusek
- Fix
max_requests_per_crawlexcluding failed requests (#1766) (d6bb0b4) by @Pijukatel - playwright: Dispose of
APIResponsebody forsend_request(#1771) (29d301b) by @kozlice - Return
Nonefromadd_requestwhen storage client fails to enqueue request (#1775) (944753a) by @Mantisus - Re-use pre-existing browser context in
PlaywrightBrowserController(#1778) (4487543) by @Pijukatel
1.4.0
1.4.0 (2026-02-17)
🚀 Features
- Dynamic memory snapshots (#1715) (568a7b1) by @Pijukatel
- Add
MySQLandMariaDBsupport forSqlStorageClient(#1749) (202b500) by @Mantisus
🐛 Bug Fixes
- Make log levels consistent in ServiceLocator (#1746) (4163413) by @janbuchar
- Fix
PlaywrightCrawlerunintentionally setting the global configuration (#1747) (fa58438) by @Pijukatel - Fix
Snapshotterhandling of out of order samples (#1735) (387c712) by @Pijukatel
⚡ Performance
1.3.2
1.3.2 (2026-02-09)
🐛 Bug Fixes
- Use
max()instead ofmin()forrequest_max_durationstatistic (#1701) (85c4335) by @vdusek - Prevent mutation of default URL patterns list in
block_requests(#1702) (fcf9adb) by @vdusek - Keep None values for
user_datainRequest(#1707) (3c575bc) by @Mantisus - Respect
max_open_pages_per_browserlimit forPlaywrightBrowserControlleron concurrentnew_pagecalls (#1712) (2e5534b) by @Mantisus
1.3.1
1.3.1 (2026-01-30)
🐛 Bug Fixes
- Reset all counter in metadata with
purgeforRequestQueue(#1686) (ee09260) by @Mantisus - Set default
http3=FalseforImpitHttpClient(#1685) (3f390f6) by @Mantisus - Prevent get_request from permanently blocking requests (#1684) (da416f9) by @Mirza-Samad-Ahmed-Baig
- Do not share state between different crawlers unless requested (#1669) (64c246b) by @Pijukatel
1.3.0
1.2.1
1.2.1 (2025-12-16)
🐛 Bug Fixes
1.2.0
1.1.1
1.1.1 (2025-12-02)
🐛 Bug Fixes
- Unify separators in
unique_keyconstruction (#1569) (af46a37) by @vdusek - Fix
same-domainstrategy ignoring public suffix (#1572) (3d018b2) by @Pijukatel - Make context helpers work in
FailedRequestHandlerandErrorHandler(#1570) (b830019) by @Pijukatel - Fix non-ASCII character corruption in
FileSystemStorageClienton systems without UTF-8 default encoding (#1580) (f179f86) by @Mantisus - Respect
<base>when enqueuing (#1590) (de517a1) by @Mantisus