Senior Data Architect responsible for building data infrastructure at Trexquant, integrating diverse datasets for research and simulation applications. Collaborating with teams to enhance data accessibility and quality.
Responsibilities
Architect and implement a unified data platform that integrates hundreds of vendor datasets, providing consistent, accessible, and high-quality data to simulators and researchers.**
Design efficient storage and retrieval systems to support both large-scale historical backtesting and high-frequency research workflows.**
Develop intuitive researcher interfaces and APIs that allow users to easily discover variables, explore metadata, and assemble data into standardized stocks × values matrices for rapid hypothesis testing.**
Collaborate closely with quantitative researchers and simulation teams to understand their workflows, ensuring the data platform meets real-world analytical and performance needs.**
Establish best practices for data modeling, normalization, versioning, and quality control across asset classes and data vendors.**
Work with infrastructure and DevOps teams to optimize data pipelines, caching, and distributed storage for scalability and reliability.**
Prototype and deploy internal data applications that enhance research productivity and data transparency.**
Mentor and guide data engineers to maintain robust, maintainable, and well-documented data systems.**
Requirements
7+ years of experience in data architecture, quantitative research infrastructure, or large-scale data engineering in a financial or research-driven environment.**
Proven experience designing and implementing scalable data storage solutions (e.g., columnar databases, time-series systems, object stores, or data lakes).**
Strong proficiency in Python and familiarity with modern data stack technologies (e.g., Parquet, Arrow, Spark, SQL/NoSQL, distributed file systems).**
Deep understanding of time-series and financial data modeling, including handling multiple vendors, instruments, and frequencies.**
Experience building data interfaces, APIs, or tools that serve researchers, data scientists, or quantitative analysts.**
Ability to translate research needs into efficient data schemas and access patterns.****
Bachelor’s, Master’s, or Ph.D. in Computer Science, Engineering, Mathematics, or a related quantitative field.**
Strong collaboration, communication, and documentation skills.**
Familiarity with cloud-based architectures (e.g., AWS, GCP, Azure) and modern data governance practices is a plus.
Benefits
Competitive salary plus bonus based on individual and company performance.**
Collaborative, casual, and friendly work environment.**
PPO health, dental, and vision insurance premiums fully covered for you and your dependents.**
Data Engineer managing customer datasets to enhance industrial research and development. Responsible for ETL pipelines and data ingestion for the Uncountable Web Platform.
Data Engineer designing and maintaining scalable data solutions on Databricks for clinical trials. Collaborating with teams to overcome data challenges and ensure the smooth logistics of clinical supplies.
Senior Manager leading a team of database engineers to manage CCC's data platform. Overseeing mission - critical applications and collaborating with cross - functional teams in a hybrid environment.
As a Principal Data Architect at Solstice, lead the design and implementation of data architecture solutions. Ensure data integrity, security, and accessibility to meet strategic organizational goals.
Data Platform Specialist overseeing data workflows and enhancing data quality for Stackgini's AI - driven IT solutions. Collaborating with teams to drive improvements and stakeholder support.
Data Engineer designing data pipelines in Python for a major railway industry client. Collaborate with Data Scientists and ensure code quality with agile methodologies.
Senior Data Engineer responsible for building and optimizing data pipelines for banking analytics initiatives. Collaborating with data teams to ensure data quality and readiness for enterprise use.
Senior Data Engineer developing scalable data solutions on Databricks for analytics and operational workloads. Collaborating with cross - functional teams to modernize the data ecosystem.
Data Engineer focused on analytics and data pipeline development for network optimisation. Collaborating with teams to deliver high - quality data solutions with Python and SQL.
Senior Product Manager defining platform capabilities for Data Cloud in Salesforce. Collaborating with R&D teams while shaping product strategy for Data 360 integration.