- Published on
Google System Design | L5 | Reject
- Author
- Shared Anonymously
You are building a company which scrapes events from various news sources, Social media and other sources.
We also need to provide ability to search, book tickets, rate the events.
Basically a web crawler + ticket booking website.
Deep dive was focused on:
- How to do make unstructred data that we scrape from news source to structured data at scale. We don't want to show duplicate events.
- How can you identify two events from two different news sources to be the same event.
- How will you change the status of an event based on a future news event
- How will you indentify the correctness of an event
Came up with design for web crawler,discovery of events, and booking logic.
However, major conversation was around above topics. Gave few ideas like vector search, similiarity
Report • Mark as Helpful