crawl4ai

mirror of https://github.com/unclecode/crawl4ai.git synced 2026-06-11 00:08:01 +00:00

Files

UncleCode d11c004fbb Enhanced BFS Strategy: Improved monitoring, resource management & configuration

- Added CrawlStats for comprehensive crawl monitoring
- Implemented proper resource cleanup with shutdown mechanism
- Enhanced URL processing with better validation and politeness controls
- Added configuration options (max_concurrent, timeout, external_links)
- Improved error handling with retry logic
- Added domain-specific queues for better performance
- Created comprehensive documentation

Note: URL normalization needs review - potential duplicate processing
with core crawler for internal links. Currently commented out pending
further investigation of edge cases.

2024-11-08 15:57:23 +08:00

examples

feat: Enhance crawler flexibility and LLM extraction capabilities

2024-10-14 21:03:28 +08:00

Update documents and README

2024-09-25 16:52:11 +08:00

md _sync

Push async version last changes for merge to main branch

2024-09-24 20:52:08 +08:00

scrapper

Enhanced BFS Strategy: Improved monitoring, resource management & configuration

2024-11-08 15:57:23 +08:00