Site Reliability Engineer at Freed managing cloud infrastructure and ensuring system reliability. Collaborating with engineers on infrastructure needs while implementing security best practices.
Responsibilities
Manage and expand cloud infrastructure (Azure, Kubernetes), putting in place IaC best practices
Implement observability instrumentation, dashboards and alerts to monitor product health
Collaborate with engineers on product teams for new infrastructure needs or dev experience improvements and make application-level changes that improve system reliability (e.g. database configuration, JS bundle delivery, caching or network resiliency)
Implement security requirements (HITRUST and SOC2) and collaborate with Technical Program Manager for audits
Maintain databases (PostgreSQL, Redis) with backups, migrations, and security/privacy controls while monitoring performance and stability
Requirements
7+ years of experience in a SRE, Production Engineering, Infrastructure Engineering or related roles
Strong proficiency with SQL, Git, Kubernetes, Bash, and Networking (DNS, SSL, IP)
Familiarity with Azure, JavaScript/TypeScript, Python, Github, and VSCode
Security-oriented mindset and experience implementing security best practices
Benefits
Competitive salary and equity in a high-growth company
DevOps Specialist creating and overseeing Azure hybrid cloud infrastructures for EVLO's battery energy storage solutions. Collaborating with teams to implement cutting - edge technologies in a dynamic environment.
Software Quality and Release Engineer developing and maintaining C++/Python software solutions for aerospace and defense industry. Collaborating on CI/CD automation and feedback documentation.
Site Reliability / DevOps Engineer developing Big Data platforms for clients in Telco and Retail industries. Focus on stability, scalability, and performance of large - scale data processing systems.
Senior DevOps Engineer building and managing big data platforms for clients in telecommunications and finance industries. Ensuring stability, scalability, and performance across cloud and on - premise environments.
Site Reliability Engineer ensuring reliability, automation, and observability across cloud infrastructures for Diligent. Leading initiatives to improve performance in fast - paced environments.
Senior DevOps Engineer leading DevOps design and implementation for gaming projects at Stillfront. Collaborating with international teams to enhance gaming infrastructure and reduce costs.
Mainframe DevOps Engineer at Kyndryl enhancing mainframe delivery practices and migrating SCM to Azure DevOps. Requires extensive Mainframe development experience and DevOps skills.
DevOps/MLOps Engineer designing, automating, and maintaining scalable infrastructure for federal client. Collaborating with software engineers and data scientists for resilient solutions.