Creating Companies Table
Let’s say that you’re in charge of creating a new companies table at your new startup’s database. Your boss wants you to track all major companies that could be interesting leads for salespeople.
You’re given a huge list of scraped company names from many different websites with a few fields such as:
No. of Employees
The scraped list may include duplicates and misspellings. How would you create a new
production_companies table prioritizing correctness while also ensuring we do not include duplicates?