Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Finish challenge #291

Open
wants to merge 5 commits into
base: master
Choose a base branch
from

Conversation

marselmustafin
Copy link

@marselmustafin marselmustafin commented Dec 16, 2024

Solution Overview

The core idea of the solution is to implement a scraper that employs multiple strategies for parsing the image gallery displayed at the top of the search results page. Since Google may return different results for similar queries each strategy relies on specific selectors and is encapsulated within its own class.

Supported Cases

The solution currently handles the following cases:

Van Gogh Paintings (challenge case) Screenshot 2024-12-14 at 18 07 23
List of Popes (another type of carousel) list-of-popes
Red Hot Chili Peppers Members (grid) rhcp-members
Van Gogh Paintings (new request, another grid) van-gogh-paintings-new

Commands

  • ruby scrape.rb - runs scraper on all html documents and saves results in files dir.
  • bundle exec rspec --color spec/scraper/knowledge_graph_spec.rb - runs tests

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant