Software Engineer developing observability systems at Airtable. Designing and evolving logging, metrics, and tracing pipelines for reliability and performance.
Responsibilities
Architect and scale core observability
Lead the design and evolution of logging, metrics, and tracing pipelines to handle massive data volumes
Evaluate and integrate new technologies (e.g., OpenTelemetry, ClickHouse, ELK stack) that enhance Airtable’s observability posture
Guide and mentor a growing team of infrastructure engineers; share best practices in distributed tracing, monitoring, and logging
Define and uphold coding standards and operational excellence across the org
Partner with Deploy Infrastructure, Service Orchestration, and Product teams to embed observability throughout the development lifecycle
Align infrastructure decisions with business goals to detect issues before they impact customers
Own end-to-end reliability for observability tools and establish SLAs, SLOs, and error budgets
Optimize performance and cost of large-scale data pipelines and storage
Shape the observability roadmap, prioritizing initiatives like improved tracing coverage, advanced monitoring dashboards, and next-gen logging pipelines
Continuously explore emerging trends to keep Airtable’s monitoring capabilities at the cutting edge
Extend observability to LLM and AI features
Instrument prompts, model calls, and RAG pipelines to capture latency, reliability, cost, and safety signals
Design online and offline evaluation loops for LLM quality, including canary analysis and drift detection
Build dashboards and alerts for token usage, error rates, guardrail triggers, and model performance; connect these signals to tracing for prompt lineage
Partner with AI and Product teams to define SLOs for AI features and close the feedback loop from incidents to model and prompt improvements
Requirements
6+ years of software engineering experience, with 3+ years focused on observability, or infrastructure at scale.
Demonstrated success implementing and running production-grade logging, metrics, or tracing systems.
Proficiency in distributed systems concepts, data streaming pipelines, and container orchestration (Kubernetes).
Deep hands-on knowledge of tools such as Prometheus, Grafana, Datadog, OpenTelemetry, ELK Stack, Loki, or ClickHouse.
Comfort with at least one programming language (e.g., Go, Python, Java) to build and maintain observability tooling.
Experience mentoring engineers and collaborating across multiple teams.
Strong communication skills to effectively present technical trade-offs and architectural plans.
Eagerness to own high-impact initiatives from design through production and maintenance.
Proven ability to balance short-term fixes with long-term strategic vision.
A passion for enabling all of Airtable’s engineering organization through reliable, intuitive observability tools.
Commitment to measuring success by the velocity and confidence with which product teams can ship.
Software Engineer developing software solutions for PNC's Asset Management organization in Dallas or Pittsburgh. Collaborating in an agile environment to build and maintain software applications.
Software Engineer building intelligent systems and workflows for AI - driven insights at DTN. Collaborating with cross - functional teams to deliver innovative solutions in agriculture, weather, and energy sectors.
Engineering Leader for multi - site Forming Systems Division at Afinitas. Leading engineering design standards and driving continuous improvement across the division.
Software Engineer developing cloud - based fulfillment technology for eCommerce logistics. Collaborating with cross - functional teams to design and deliver customer value from scalable software applications.
Lead Software Developer at Fiserv focusing on designing and implementing financial service applications. Collaborate with teams to ensure optimal software performance and security.
Software Engineer developing secure software solutions for Android, Windows, and Linux environments in the Mission Readiness Systems business area. Working within a cross functional team on mobile and web application development.
EPM Software Developer designing, developing, and troubleshooting software programs for financial analytics systems. Collaborating with teams to implement EPM software using modern methodologies and tools.
Full Stack Engineer developing scalable web solutions to enhance customer experiences at Genesys. Collaborating with cross - functional teams to drive platform modernization and ensure operational excellence.
Senior Software Architect leading the design of scalable communications software at NVIDIA. Focusing on AI and HPC performance improvements for cutting - edge technology solutions.
Software Engineer developing AI solutions for aerospace applications at Boeing. Collaborating in a cross - functional team to develop cutting - edge machine learning algorithms and data engineering pipelines.