Commit Graph

  • 9d694da939 fix(models): make model fields optional with default values UncleCode 2025-01-15 22:58:14 +08:00
  • 20c027b79c chore(cleanup): remove unused files and improve type hints UncleCode 2025-01-14 13:07:18 +08:00
  • 8878b3d032 Updated the correct link for "Contribution guidelines" in README.md (#445) devatbosch 2025-01-13 18:27:31 +05:30
  • 1ab9d115cf Fixing minor typos in README (#440) Jōnin bingi 2025-01-13 04:23:52 -08:00
  • 8ec12d7d68 Apply Ruff Corrections UncleCode 2025-01-13 19:19:58 +08:00
  • c3370ec5da refactor(scraping): replace ScrapingMode enum with strategy pattern UncleCode 2025-01-13 17:53:12 +08:00
  • f3ae5a657c feat(scraping): add LXML-based scraping mode for improved performance UncleCode 2025-01-12 20:46:23 +08:00
  • 825c78a048 refactor(dispatcher): migrate to modular dispatcher system with enhanced monitoring UncleCode 2025-01-11 21:10:27 +08:00
  • 3865342c93 Merge branch 'next' into next-cdp UncleCode 2025-01-10 16:01:49 +08:00
  • ac5f461d40 feat(crawler): add memory-adaptive dispatcher with rate limiting UncleCode 2025-01-10 16:01:18 +08:00
  • f9c601eb7e docs(urls): update documentation URLs to new domain UncleCode 2025-01-09 16:24:41 +08:00
  • e8b4ac6046 docs(urls): update documentation URLs to new domain UncleCode 2025-01-09 16:22:41 +08:00
  • 051a6cf974 docs(readme): update personal story and project vision UncleCode 2025-01-08 21:13:31 +08:00
  • 1c9464b988 Update all documents UncleCode 2025-01-08 19:31:31 +08:00
  • 6838901788 Update All docs 2025 8th Jan UncleCode 2025-01-08 19:31:17 +08:00
  • ad5e5d21ca Remove .codeiumignore from version control and add to .gitignore UncleCode 2025-01-08 13:09:23 +08:00
  • 26d821c0de Remove .codeiumignore from version control and add to .gitignore UncleCode 2025-01-08 13:08:19 +08:00
  • 010677cbee chore: add .gitattributes file UncleCode 2025-01-08 13:05:00 +08:00
  • c110d459fb Update .gitattributes UncleCode 2025-01-07 21:20:17 +08:00
  • 4d1975e0a7 Update .gitattributes UncleCode 2025-01-07 21:18:45 +08:00
  • 82734a750c Update .gitattributes UncleCode 2025-01-07 21:11:45 +08:00
  • 56fa4e1e42 refactor(doc) UncleCode 2025-01-07 20:53:10 +08:00
  • ca3e33122e refactor(docs): reorganize documentation structure and update styles UncleCode 2025-01-07 20:49:50 +08:00
  • b53835d34f Delete .codeiumignore unclecode-patch-6 UncleCode 2025-01-06 19:17:31 +08:00
  • fe52311bf4 Merge branch 'main' of https://github.com/unclecode/crawl4ai UncleCode 2025-01-06 15:20:30 +08:00
  • 01b73950ee Merge branch 'vr0.4.267' UncleCode 2025-01-06 15:20:28 +08:00
  • 12880f1ffa Update gitignore vr0.4.267 UncleCode 2025-01-06 15:19:01 +08:00
  • 53be88b677 Update gitignore UncleCode 2025-01-06 15:18:37 +08:00
  • 3427ead8b8 Update CHANGELOG UncleCode 2025-01-06 15:13:43 +08:00
  • 32652189b0 Docs: Add Code of Conduct for the project (#410) aravind 2025-01-06 10:22:51 +05:30
  • ae376f15fb docs(extraction): add clarifying comments for CSS selector behavior UncleCode 2025-01-05 19:39:15 +08:00
  • 72fbdac467 fix(extraction): JsonCss selector and crawler improvements UncleCode 2025-01-05 19:26:46 +08:00
  • 0857c7b448 Merge branch 'main' of https://github.com/unclecode/crawl4ai into next UncleCode 2025-01-05 17:05:59 +08:00
  • 07b4c1c0ed fix: not working long page screenshot (#403) Guilume 2025-01-05 17:04:34 +08:00
  • b11a91e1dd Update gitignore next-browser-farm UncleCode 2025-01-04 16:07:18 +08:00
  • 196dc79ec7 fix: prevent memory leaks by ensuring proper closure of Playwright pages UncleCode 2025-01-03 21:17:23 +08:00
  • 7aaaaae461 feat(browser-farm): Add Docker browser support for remote crawling UncleCode 2025-01-02 18:41:36 +08:00
  • 24b3da717a refactor(): UncleCode 2025-01-02 17:53:30 +08:00
  • 98acc4254d refactor: UncleCode 2025-01-01 19:47:22 +08:00
  • eac78c7993 Merge branch 'vr0.4.246' UncleCode 2025-01-01 19:43:01 +08:00
  • da1bc0f7bf Update version file vr0.4.246 UncleCode 2025-01-01 19:42:35 +08:00
  • aa4f92f458 refactor(crawler): UncleCode 2025-01-01 19:39:42 +08:00
  • a96e05d4ae refactor(crawler): optimize response handling and default settings UncleCode 2025-01-01 19:39:02 +08:00
  • 5c95fd92b4 fix(browser): resolve merge conflicts in browser channel configuration UncleCode 2025-01-01 19:05:47 +08:00
  • 4cb2a62551 Update README vr0.4.245 UncleCode 2025-01-01 18:59:55 +08:00
  • 5b4fad9e25 - Bump version to 0.4.244 UncleCode 2025-01-01 18:58:43 +08:00
  • ea0ac25f38 refactor(browser): vr0.4.244 UncleCode 2025-01-01 18:58:15 +08:00
  • 7688aca7d6 Update Version UncleCode 2025-01-01 18:44:27 +08:00
  • a7215ad972 fix(browser): update default browser channel to chromium and simplify channel selection logic UncleCode 2025-01-01 18:38:33 +08:00
  • 8e2403a7da fix(browser)!: default to Chromium channel for new headless mode (#387) Arno.Edwards 2025-01-01 18:37:50 +08:00
  • 318554e6bf Merge branch 'v0.4.243' v0.4.243 UncleCode 2025-01-01 18:11:15 +08:00
  • c64979b8dd docs: update README v0.4.243 UncleCode 2025-01-01 18:10:38 +08:00
  • bfe21b29d4 build: streamline package discovery and bump to v0.4.243 UncleCode 2025-01-01 17:53:51 +08:00
  • f76886b32b build: streamline package discovery and bump to v0.4.244 v0.4.242 UncleCode 2025-01-01 17:53:51 +08:00
  • e9d9a6ffe8 fix: ensure js_snippet files are included in package UncleCode 2025-01-01 17:38:59 +08:00
  • 5313c71a0d docs: update REAME browser installation command v0.4.241 UncleCode 2025-01-01 17:24:44 +08:00
  • d36ef3d424 refactor(install): use chromium as default browser UncleCode 2025-01-01 17:19:54 +08:00
  • 4a4f613238 docs: simplify installation instructions UncleCode 2025-01-01 16:54:03 +08:00
  • dc6a24618e feat(install): add doctor command and force browser install UncleCode 2025-01-01 16:33:43 +08:00
  • 74a7c6dbb6 feat(install): specify chrome and chromium for playwright UncleCode 2025-01-01 16:10:08 +08:00
  • 67f65f958b refactor(build): simplify setup.py configuration UncleCode 2025-01-01 15:52:01 +08:00
  • 78b6ba5cef build: modernize package configuration with pyproject.toml UncleCode 2025-01-01 15:45:27 +08:00
  • 3f019d34cc docs: update project description emojis UncleCode 2025-01-01 15:39:33 +08:00
  • 304260e484 refactor(install): simplify Playwright installation error handling UncleCode 2025-01-01 15:33:36 +08:00
  • 704bd66b63 Uphrade plawyright installation command to install dependencies UncleCode 2025-01-01 15:23:16 +08:00
  • 1acc162c18 Bumb version v0.4.241 UncleCode 2025-01-01 15:16:06 +08:00
  • 553c97a0c1 Fix bug reported in issue https://github.com/unclecode/crawl4ai/issues/396 UncleCode 2025-01-01 15:15:14 +08:00
  • bd66befcf0 Fix issue in 0.4.24 walkthrough UncleCode 2024-12-31 21:07:58 +08:00
  • 3e769a9c6c Fix issue in 0.4.24 walkthrough UncleCode 2024-12-31 21:07:33 +08:00
  • 19b0a5ae82 Update 0.4.24 walkthrough UncleCode 2024-12-31 21:01:46 +08:00
  • bd71f7f4ea Add 0.4.24 walkthrough v0.4.24 UncleCode 2024-12-31 20:22:33 +08:00
  • 171ce25ba6 Fixe typo in CHANGELOG UncleCode 2024-12-31 19:49:00 +08:00
  • 6c5a44f774 chore: bump version to 0.4.25 UncleCode 2024-12-31 19:45:48 +08:00
  • 5c3c05bf93 docs: update README badges and Docker section, reorganize documentation structure UncleCode 2024-12-31 19:45:02 +08:00
  • 67d0999bc3 chore: resolve merge conflicts for v0.4.24 v0.4.24 UncleCode 2024-12-31 19:24:03 +08:00
  • 553a4622bf chore: prepare for version 0.4.24 UncleCode 2024-12-31 19:18:36 +08:00
  • 6f81ef006d Remove .local folder from remote repository UncleCode 2024-12-31 17:37:50 +08:00
  • a04870a662 Remove .do folder UncleCode 2024-12-31 17:37:14 +08:00
  • f7d26390c5 Remove .do folder UncleCode 2024-12-31 17:36:22 +08:00
  • 141783fb2d Remove .do folder from remote repository UncleCode 2024-12-31 17:35:57 +08:00
  • 2fedd4876e Update gitignore UncleCode 2024-12-31 17:35:34 +08:00
  • e187b0aaf0 update gitignore UncleCode 2024-12-31 17:34:31 +08:00
  • e95374d7c6 Delete .do/deploy.template.yaml (#394) UncleCode 2024-12-31 10:33:59 +01:00
  • 406702a77f Delete .do/deploy.template.yaml unclecode-patch-5 UncleCode 2024-12-31 17:33:39 +08:00
  • 8f2d0cda2f Remove .do folder from remote UncleCode 2024-12-31 17:32:55 +08:00
  • 9d261d2b9c Recreate .do folder with temporary file UncleCode 2024-12-31 17:32:44 +08:00
  • 7792fe0e4c Recreate .do folder for removal UncleCode 2024-12-31 17:31:51 +08:00
  • 86259244e4 Add ".do" to gitignore UncleCode 2024-12-31 17:30:09 +08:00
  • 0ec593fa90 Update the Tutorial section for new document version UncleCode 2024-12-31 17:27:31 +08:00
  • 7391d6be73 Update README.md (#390) UncleCode 2024-12-30 14:24:43 +01:00
  • 494ee32619 Update README.md unclecode-patch-4 UncleCode 2024-12-30 21:24:30 +08:00
  • e4e23065f1 Update README.md (#389) UncleCode 2024-12-30 14:24:06 +01:00
  • 8a4952c128 Update README.md unclecode-patch-3 UncleCode 2024-12-30 21:23:19 +08:00
  • fb33a24891 Commit Message: - Added examples for Amazon product data extraction methods - Updated configuration options and enhance documentation - Minor refactoring for improved performance and readability - Cleaned up version control settings. UncleCode 2024-12-29 20:05:18 +08:00
  • 78768fd714 Update simple-crawling.md (#379) Robin Singh 2024-12-27 09:42:59 +00:00
  • f2d9912697 Renames browser_config param to config in AsyncWebCrawler UncleCode 2024-12-26 16:34:36 +08:00
  • 9a4ed6bbd7 Commit Message: Enhance crawler capabilities and documentation UncleCode 2024-12-26 15:17:07 +08:00
  • d5ed451299 Enhance crawler capabilities and documentation - Add llm.txt generator - Added SSL certificate extraction in AsyncWebCrawler. - Introduced new content filters and chunking strategies for more robust data extraction. - Updated documentation. UncleCode 2024-12-25 21:34:31 +08:00
  • d97a075082 Delete a.md unclecode-patch-2 UncleCode 2024-12-25 19:43:39 +08:00
  • bacbeb3ed4 Fix #340 example llm_extraction (#358) Haopeng138 2024-12-24 12:56:07 +01:00