Although Riak is open source, Basho currently has many commercial clients. Our client opportunities are growing in both number and scope and we are looking for talented, highly motivated Operations Engineer to join the Professional Services team. The Operations Engineer must have a combination of outstanding technical expertise and communication skills required to build the architecture, design, and development of scalable distributed storage applications. The Operations Engineer provides customer facing support of database technologies for a variety of customers in designated regions.
This is an exciting early stage opportunity with a privately help startup company!
Assist customers with the planning and design of system architecture to allow rapid scaling to meet demand.
Design and implement systems and procedures for customers to monitor and measure servers, processes, networks and other important metrics.
Lead the hardware evaluation process for storage for customer, should they require it.
Consult with internal and external engineers in design and quality standards for building system tools to support infrastructure (custom monitoring scripts, automation, etc.), always keeping in mind that "Infrastructure is Code"
Define and implement procedures to facilitate knowledge sharing and training internal and external customers. Ensure adequate support knowledge and/or documentation is available at all times.
Respond to issue alerts by following appropriate problem management procedures. You'll handle escalation and communication with senior management.
Work with customers to plan and coordinate the implementation of moderate to complex system and software migrations.
Write maintainable code with extensive test coverage, working in a professional services engineering environment (with source control, dev/stage/prod release cycle, continuous deployment) - cowboy coders need not apply.
Distilling down extremely complex infrastructure environments to create supportable, and elegant, solutions.
Having the flexibility to change gears quickly, adapt rapidly, and perform under pressure.
Develop processes and frameworks to implement solutions.
3 – 8 years of Linux / Unix systems administration experience
Extremely fluent in systems automation tools such as Puppet, Chef, and Capistrano and scripting languages such as Python or Bash
Skilled in Linux or Sun system administration, including configuration, trouble-shooting, and automation
Experienced with architecting, deploying, and maintaining monitoring and data collection/trending solutions (Nagios / Cacti / Munin / Graphite).
Experienced with web servers and load balancers (nginx, HAProxy and Varnish) and high availability systems.
An expert with running high-use services, including requirements and capacity analysis, service configuration, and change management
An excellent communicator (especially in writing) with serious time management skills
A team player who is willing to do whatever it takes to make the team successful
Experienced with automating software installation and administration of all descriptions on remote servers
A track record for troubleshooting and creating effective and enduring solutions.
Knowledgeable of common network protocols and finding your way around an RFC
Passionate about DevOps / WebOps culture
Familiarity with supporting database servers in a high-¬‐availability, high-¬‐performance clustered environment.
Strong customer relations & presentation skills - must be able to speak well in front of an audience and with customers
Self-motivated and self-directed
Travel is required for this position
Preferred but not required:
Programming/Scripting skills (Ruby, Python, Perl, Erlang, Java, Bash, etc.)
Large scale infrastructure experience
Knowledge of data center management, systems management, and monitoring, networking & security.
BS/BA degree in Computer Science or related field or equivalent, relevant work experience