Senior Systems Reliability / K8s Engineer

Company Background:
Kinetic Data is an emerging leader at the forefront of the $300 billion+ market for digital workflow and service portal automation. The Kinetic Platform offers unrivaled flexibility to launch and manage continuous process improvement as needs change and evolve across the enterprise...and not just in IT. 
The highly-scalable, multi-tenant platform is built for the most complex and demanding environments and has proven value for the most stringent demands of multiple federal agencies and U.S. military and commercial organizations for cross-platform integration and a unified “single pane of glass” customer experience.
The Kinetic Platform is made up of a React front-end, Java + Ruby back-end, and a mix of Cassandra and relational databases for persistence. Our customers can choose to install the platform on-prem or leverage our AWS-hosted solution called kinops.
Our Core Values:
High Passion- Involved and invested to help shape Kinetic’s future in your role
Pride in Work- Striving to become the best at your work
Customer Success Driven- Delivers the right solution with a customer-first mindset
Self Motivated- Identifies, prioritizes and takes ownership of meaningful work
Honesty and Integrity- Does the right thing even when no one is watching
Position Summary:

As the Senior SRE, you will be working directly with the development team in order to maintain and mature our AWS-hosted kinops offering, Kubernetes container strategy and overall application lifecycle practices.

Responsibilities shall include in whole or in part:
  • Maintaining and maturing the hosted offering
  • Maintaining and improving the monitoring and observability of the hosted offering
  • Coordinating upgrades of the underlying infrastructure
  • Coordinating upgrades or migrations related to the Kinetic Platform itself
  • Participate in the complete application development lifecycle
  • Maintaining and maturing Kubernetes patterns and practices
  • Participate in crisis management and disaster recovery response efforts
Skills and Expertise (Required):
  • Strong ability to work independently or with a group
  • Deep working knowledge of Docker, Kubernetes, and Helm
  • Working knowledge of AWS cloud services
  • Working knowledge of networking
  • Experience and working knowledge of Linux
  • Proficiency with Bash scripting
  • Familiarity with Git
Skills and Expertise (Desired):
  • Experience with the Java language, tooling, and basic JVM tuning
  • Experience with the Ruby language and tooling
  • Experience with performance testing
  • Hands-on experience with SAML, OAuth, and other SSO
  • Hands-on experience with SSL certificate management
  • Familiarity with Elasticsearch, Kibana, and structured logging
  • Familiarity with Cassandra and PostgreSQL/MSSQL
  • Familiarity with AWS and AWS EKS
  • A strong knowledge of Enterprise Software deployment and management
What you'll need to apply:
  • Post-secondary degree or equivalent experience in related fields for computer science
  • Proven ability to work effectively in a fast-paced, high-growth, rapidly changing environment
  • Proof of US citizenship
  • Comprehensive Health insurance includes full premium coverage and 90% of out of pocket, in network expenses
  • SIMPLE IRA match
  • Remote first, flexible working hours
If this sounds like you, email your resume to us at