Comparing Search Engines

Let’s say you’re working on a building a better search engine for Google. You build it and want to see if it serves better results than the existing one in production.

How would you determine which search engine performed better? Which metrics would you track?

