Senior Machine Learning Systems Engineer, Ads ML Experience Platform

Remote - United States·Posted today
developer-toolsaimlspark
<div class="content-intro"><div class="c-message_kit__blocks c-message_kit__blocks--rich_text"> <div class="c-message__message_blocks c-message__message_blocks--rich_text" data-qa="message-text"> <div class="p-block_kit_renderer" data-qa="block-kit-renderer"> <div class="p-block_kit_renderer__block_wrapper p-block_kit_renderer__block_wrapper--first"> <div class="p-rich_text_block"> <div class="p-rich_text_section">Reddit is a community of communities. It’s built on shared interests, passion, and trust, and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote, and comment on the topics they care most about. With 100,000+ active communities and approximately 126 million daily active unique visitors, Reddit is one of the internet’s largest sources of information. For more information, visit <a class="c-link" href="http://www.redditinc.com/" target="_blank" data-stringify-link="http://redditinc.com" data-sk="tooltip_parent">www.redditinc.com</a>.</div> </div> </div> </div> </div> </div></div><p>Reddit has a flexible workforce! If you happen to live close to one of our physical office locations our doors are open for you to come into the office as often as you'd like. Don't live near one of our offices? No worries: You can apply to work remotely in any country in which we have a physical presence</p> <p><strong>Team Overview</strong></p> <p>We are building the next generation of ML research tools and agentic AI platforms that power machine learning development across Reddit. Our mission is to accelerate the Ads ML lifecycle – from experimentation and training to deployment, evaluation, and autonomous operations – through scalable platform services, intelligent automation, and developer-centric tooling.</p> <p>Our team owns critical platform capabilities including offline ML experimentation systems, production training orchestration frameworks, ML lifecycle automation and, agentic ML frameworks that enable faster model iterations.</p> <p>We are looking for an experienced engineer with deep expertise in large-scale distributed systems, ML platforms, and emerging agentic architectures to help define and build the foundational tooling for the next generation of our machine learning devX tooling.</p> <p><strong>What You’ll Do</strong></p> <ul> <li>Design and build large-scale offline ML experimentation platforms that enable reproducible research, model development, evaluation, and promotion workflows.</li> <li>Develop production-grade training orchestration frameworks supporting distributed training, hyperparameter optimization, model evaluation, and automated retraining.</li> <li>Build infrastructure for experiment tracking, metadata management, lineage, artifact versioning, model registries, and reproducibility.</li> <li>Partner with ML engineers and researchers to improve experimentation velocity and operational efficiency.</li> <li>Build automated workflows for model promotion, rollback, compliance validation, and continuous evaluation.</li> <li>Design and build an agentic AI execution platform supporting autonomous and human-in-the-loop workflows, including multi-agent orchestration, memory/context systems, and scalable workflow infrastructure.</li> </ul> <p><strong>What You Bring</strong></p> <ul> <li>5+ years in infrastructure/platform engineering or large-scale distributed systems.</li> <li>2+ years of hands-on experience building and operating production ML infrastructure, developer SDKs, platform APIs, or self-service AI tooling.</li> <li>Experience building workflow orchestration systems, developer platforms, or large-scale automation frameworks.</li> <li>Experience with distributed data processing systems such as Spark, Flink, Ray, or equivalent technologies.</li> <li>Experience with modern orchestration and workflow technologies such as Kubeflow, Argo, Airflow, or similar frameworks.</li> <li>Experience building offline ML experimentation platforms, model registries, experiment tracking systems, or training orchestration frameworks.</li> <li>Experience building and operating agentic AI systems, including multi-agent orchestration, autonomous workflows, and agent communication/runtime frameworks (e.g., MCP, A2A, and orchestration systems) is a strong plus</li> <li>Experience running end-to-end model development and iteration cycles at scale is a plus</li> </ul><div class="content-pay-transparency"><div class="pay-input"><div class="description"><p><strong>Pay Transparency:</strong></p> <p>This job posting may span more than one career level.</p> <p>In addition to base salary, this job is eligible to receive equity in the form of restricted stock units, and depending on the position offered, it may also be eligible to receive a commission. Additionally, Reddit offers a wide range of benefits to U.S.-based employees, including medical, dental, and vision insurance, 401(k) program with employer match, generous time off for vacation, and parental leave. To learn more, please visit <a href="https://www.redditinc.com/careers/" target="_blank">https://www.redditinc.com/careers/</a>.</p> <p>To provide greater transparency to candidates, we share base salary ranges for all US-based job postings regardless of state. We set standard base pay ranges for all roles based on function, level, and country location, benchmarked against similar stage growth companies. Final offer amounts are determined by multiple factors including, skills, depth of work experience and relevant licenses/credentials, and may vary from the amounts listed below.</p></div><div class="title">The base salary range for this position is:</div><div class="pay-range"><span>$216,700</span><span class="divider">&mdash;</span><span>$303,400 USD</span></div></div></div><div class="content-conclusion"><p>In select roles and locations, the interviews will be recorded, transcribed and summarized by artificial intelligence (AI). You will have the opportunity to opt out of recording, transcription and summarization prior to any scheduled interviews.</p> <p><span style="font-weight: 400;">During the interview, we will collect the following categories of personal information: Identifiers, Professional and Employment-Related Information, Sensory Information (audio/video recording), and any other categories of personal information you choose to share with us. We will use this information to evaluate your application for employment or an independent contractor role, as applicable.&nbsp; We will not sell your personal information or disclose it to any third party for their marketing purposes.&nbsp; We will delete any recording of your interview promptly after making a hiring decision.&nbsp; For more information about how we will handle your personal information, including our retention of it, please refer to our <a href="https://redditinc.com/policies/candidate-privacy-policy">Candidate Privacy Policy for Potential Employees and Contractors</a>.</span></p> <p><em><span style="font-weight: 400;">Reddit is proud to be an equal opportunity employer, and is committed to building a workforce representative of the diverse communities we serve.&nbsp; Reddit is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If, due to a disability, you need an accommodation during the interview process, please let your recruiter know.</span></em></p></div>