Are billions of transactions, terabytes of data and 100’s of mbps of bandwidth the reason you wake up in the morning? Yeah, us too.
Have you built web-scale infrastructure, monitoring and escalation procedures before and doing it again, but this time better, faster and more reliably is all you can think about? The ideal candidate will have maintained proprietary applications in a high-availability production environment and have the ability to rapidly self-educate on new concepts and methods.
Responsibilities (what you get to do every day!):
Principle: 100% customer satisfaction
Work with Engineering to continuously refine our architecture at scale
Responsible for planning, designing, and implementing the appropriate infrastructure needed to ensure the stability, integrity, and efficient operation of enterprise information systems that support Disaster Recovery functions.
This position requires the ability to take the vision of the IT Architecture group and translate it into reliable Disaster Recovery technology solutions.
Own and manage our multiple data centers across the US and internationally
Manage outside vendor relationships—much of our infrastructure is managed services
Scale up systems/centers to maintain a steady-state maximum @ < 40% of capacity
00’s of servers, gbit’s of bandwidth, 10’s of T’s of data/day
Manage any outages, emergencies removing or reducing customer impact
Responsible for all facets of servers and Java-application servers for applications.
Oversee internal systems—e-mail, Wiki, tickets, etc (Jira, Confluence
Monitor system performance from bandwidth to response times
Participate in on-call rotation
Investigate and recommend ways to more elegantly/efficiently enhance processing time, reliability, scalability and ease of deployment
Produce and maintain documentation on installations, incidents and FAQs through Confluence
Contribute to planning efforts for disaster recovery, capacity expansion, component upgrading and system hardening
Maintain data center operation procedures in collaboration with Engineering staff and Client Services staff (Sales & Account Management)
As a senior IT engineer, this position requires all of the below skills & experiences
BS/MS in CS or Engineering discipline
7+ years production internet systems administration
Configuration/administration of apache
Configuration/administration of tomcat
Apache ant—it’s EVERYWHERE
Advanced networking experience
Working with IPSEC VPN’s
Load-balancer configuration—F5, Cisco, Foundry
Experience with IP telephony systems
Familiarity with storage solutions from attached, JBOD, NAS, SAN
Managed data centers with 1000’s of nodes
Experience working with outside IT vendors—Hardware, Rackspace, Akamai, etc.
Team player—it’s a small team, and at 1:00 AM, we eat pizza together!
Self-starter & fast-learner
Aggressive—we’re in it to win it!
Great communicator—we act as a team, we celebrate successes together and we brainstorm our way together through challenges
Additional language skills, one or more of: perl, php, Java, c/c+
Proficient in SQL (Postgres a plus)
Oracle DBA would be sweet
Performance testing and tuning experience
Software development experience a plus
Working with Fortune 500/1000 a plus
• BS/MS in CS or Engineering discipline