Metaweb has built a large database to support collaborative web applications. Now we need to help people find all the information stored inside it, so we're looking for a principal engineer/architect to play a key role in our relevance, ranking and fuzzy matching efforts.
This isn't the typical "search the web" job. It's a whole new way of thinking about how data is stored, accessed and used. It's a very hands-on position; your chance to help us revolutionize data storage and search, by driving, designing and implementing ranking solutions and improving our database and search platform.
If you're interested, send us answers to some of the following questions:
1. What's the difference between a (traditional SQL) database and a search engine? What do you see as some of the issues involved with integrating them?
2. What about the search results of www.freebase.com could use improvement? How would you make these improvements?
3. Wikipedia is full of structured data expressed in text form. How would you go about extracting some of this data?
Ideally, you'd already have worked in relevancy on a major search engine and have experience in optimizing ranking. Here you'll be able to put that knowledge to work on our extremely densely cross-linked dataset and also come up with brand new techniques for improving results that our database allows.
Experience with clustering and classification (linear classifiers, k-means, SVM) would be really helpful. The ability to moonlight in text mining and structured data extraction is a huge plus.