We are looking for an experienced Architect to work with our Engineering team to map the future of our big data infrastructure. For the past year our Engineering team has built up an amazing infrastructure for hosting and distributing the world’s data and now it’s time to take it to the next level. We utilize over half-a-dozen different best-in-class databases and tools including HBase, Elastic Search, Flume, Chef, Pig, and Hadoop. All these technologies work together to form a world-class platform for collecting and distributing data.
Core to our philosophy, and our primary mission, is the democratization of the world’s data. This backend infrastructure is critical to our product and progress towards this goal. Your contributions would help the rest of the world by taking the monkey-work out of dealing with data.
For some public examples of our projects, see our labs page at http://www.infochimps.com/labs
You are an ideal candidate if you enjoy working on big problems and having a big impact early in a company’s life. You should have a deep understanding of design patterns and the Unix way, and an intuitive feel for maintaining infrastructure, untangling bugs, and simplifying systems.
Natural Language Processing algorithms
ETL (Extract, Transform, and Load) Experience
Unsupervised clustering algorithms
Large scale data processing