Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature Request: Implement Question-Based Crawler with Search Engine Integration #413

Open
TheCutestCat opened this issue Jan 6, 2025 · 0 comments

Comments

@TheCutestCat
Copy link
Contributor

Hi,
I'd like to contribute to the Question-Based Crawler feature listed in the roadmap under "Natural language driven web discovery and content extraction". I propose implementing this using free search engine APIs (like DuckDuckGo) combined with LLM-based filtering of the retrieved results. The functionality could be implemented similarly to EXA's approach.

Questions/Discussion Points:

  1. What would be the most appropriate location in the current codebase structure to implement these advanced features?
  2. Which branch should this feature development be based on?

I would like to take on this task and contribute to its development. Looking forward to guidance on the architectural decisions before proceeding.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant