Find your next LLM, RAG & AI Agent engineering role
316+ open roles · 10+ companies hiring
316 open positions
Research Scientist, Frontier Risk Evaluations
Scale AI· San Francisco, CA; New York, NY
Scale Labs, Research Scientist — Frontier Risk Evaluations As the leading data and evaluation partner for frontier AI companies, Scale plays an integral role in understanding the capabilities and safeguarding AI models and systems. Building on this expertise, Scale Labs has launched a new team focused on policy research, to bridge the gap between AI research and global policymakers to make informed, scientific decisions about AI risks and capabilities. Our research tackles the hardest problems in agent robustness, AI control protocols, and AI risk evaluations to help governments, industry, and the public understand and mitigate AI risk while maximizing AI adoption. This team collaborates broadly across industry, the public sector, and academia and regularly publishes our findings. We are actively seeking talented researchers to join us in shaping this vision. As a Research Scientist focused on Frontier Risk Evaluations, you will design and create evaluation measures, harnesses and datasets for measuring the risks posed by frontier AI systems. For example, you might do any or all of the following: Design and build harnesses to test AI models and systems (including agents) for dangerous capabilities such as security vulnerability exploitation, CBRN uplift, and other high-risk activities; Work with government agencies or other labs to collectively scope and design evaluations to measure and mitigate risks posed by advanced AI systems; Publish evaluation methodologies and write technical reports for policymakers. Ideally you’d have: Commitment to our mission of promoting safe, secure, and trustworthy AI deployments in the industry as frontier AI capabilities continue to advance. Practical experience conducting technical research collaboratively. You should be comfortable building and instrumenting ML pipelines, writing evaluation harnesses, and quickly turning new ideas from the research literature into working prototypes. A track record of published research in machine learning, particularly in generative AI. At least three years of experience addressing sophisticated ML problems, whether in a research setting or in product development. Strong written and verbal communication skills to operate in a cross-functional team. Nice to have: Experience in crafting evaluations and benchmarks, or a background in data science roles related to LLM technologies. Experience with red-teaming or adversarial testing of AI systems. Familiarity with AI safety policy frameworks (e.g., NIST AI RMF, EU AI Act, Korea AI Basic Act). Our research interviews are crafted to assess candidates' skills in practical ML prototyping and debugging, their grasp of research concepts, and their alignment with our organizational culture. We will not ask any LeetCode-style questions. If you’re excited about advancing AI safety and contributing to our mission, we encourage you to apply, even if your experience doesn’t perfectly align with every requirement. Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $216,000 — $270,000 USD PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
20d ago
Research Scientist, AI Controls and Monitoring
Scale AI· San Francisco, CA; New York, NY
Scale Labs, Research Scientist — AI Controls and Monitoring As the leading data and evaluation partner for frontier AI companies, Scale plays an integral role in understanding the capabilities and safeguarding AI models and systems. Building on this expertise, Scale Labs has launched a new team focused on policy research, to bridge the gap between AI research and global policymakers to make informed, scientific decisions about AI risks and capabilities. Our research tackles the hardest problems in agent robustness, AI control protocols, and AI risk evaluations to help governments, industry, and the public understand and mitigate AI risk while maximizing AI adoption. This team collaborates broadly across industry, the public sector, and academia and regularly publishes our findings. We are actively seeking talented researchers to join us in shaping this vision. As a Research Scientist focused on AI Controls and Monitoring, you will design methods, systems, and experiments to ensure that advanced AI models and agents remain aligned with intended goals, even in high-stakes or adversarial environments. For example, you might: Develop monitoring techniques and observability methods that track AI behavior in real time to identify and flag deviations, emergent capabilities, or anomalous outputs; Research mechanisms for layered control, including fail-safes, oversight protocols, and intervention methods that can halt or redirect AI systems when risks are detected; Design red-team simulations to probe weaknesses in oversight and control mechanisms, and build mitigations to close identified gaps; Collaborate with policymakers, engineers, and other researchers to establish standards and benchmarks for AI monitoring and escalation. Ideally you’d have: Commitment to our mission of promoting safe, secure, and trustworthy AI deployments in the industry as frontier AI capabilities continue to advance. Practical experience conducting technical research collaboratively. You should be comfortable designing control and monitoring experiments for AI systems, building prototype systems, and quickly turning new ideas from the research literature into working prototypes. A track record of published research in machine learning, particularly in generative AI. At least three years of experience addressing sophisticated ML problems, whether in a research setting or in product development. Strong written and verbal communication skills to operate in a cross-functional team. Nice to have: Experience with runtime monitoring, anomaly detection, or observability for ML systems. Familiarity with AI control or alignment research (e.g., scalable oversight, interpretability, debate). Experience with post-training and RL techniques such as RLHF, DPO, GRPO, and similar approaches. Our research interviews are crafted to assess candidates' skills in practical ML prototyping and debugging, their grasp of research concepts, and their alignment with our organizational culture. We will not ask any LeetCode-style questions. If you’re excited about advancing AI safety and contributing to our mission, we encourage you to apply, even if your experience doesn’t perfectly align with every requirement. Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $216,000 — $270,000 USD PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
20d ago
Research Scientist, Agent Robustness
Scale AI· San Francisco, CA; New York, NY
Scale Labs, Research Scientist — Agent Robustness As the leading data and evaluation partner for frontier AI companies, Scale plays an integral role in understanding the capabilities and safeguarding AI models and systems. Building on this expertise, Scale Labs has launched a new team focused on policy research, to bridge the gap between AI research and global policymakers to make informed, scientific decisions about AI risks and capabilities. Our research tackles the hardest problems in agent robustness, AI control protocols, and AI risk evaluations to help governments, industry, and the public understand and mitigate AI risk while maximizing AI adoption. This team collaborates broadly across industry, the public sector, and academia and regularly publishes our findings. We are actively seeking talented researchers to join us in shaping this vision. As a Research Scientist working on Agent Robustness you will work on the fundamental challenges of building AI agents that are safe and aligned with humans. For example, you might: Research the science of AI agent capabilities with a focus on how they relate to safety, risk factors, and methodologies for benchmarking them; Design and build harnesses to test AI agents’ tendency to take harmful actions when pressured to do so by users or tricked into doing so by elements of their environment; Design and build exploits and mitigations for new and unique failure modes that arise as AI agents gain affordances like coding, web browsing, and computer use; Characterize and design mitigations for potential failure modes or broader risks of systems involving multiple interacting AI agents. Ideally you’d have: Commitment to our mission of promoting safe, secure, and trustworthy AI deployments in the industry as frontier AI capabilities continue to advance. Practical experience conducting technical research collaboratively. You should be comfortable building and leveraging agent scaffolding, designing evaluation harnesses, and quickly turning new ideas from the research literature into working prototypes. Experience with post-training and RL techniques such as RLHF, DPO, GRPO, and similar approaches. A track record of published research in machine learning, particularly in generative AI. At least three years of experience addressing sophisticated ML problems, whether in a research setting or in product development. Strong written and verbal communication skills to operate in a cross-functional team. Nice to have: Hands-on experience with agent evaluation frameworks such as SWE-bench, WebArena, OSWorld, Inspect, or similar tools. Experience with red-teaming, prompt injection, or adversarial testing of AI systems. Our research interviews are crafted to assess candidates' skills in practical ML prototyping and debugging, their grasp of research concepts, and their alignment with our organizational culture. We will not ask any LeetCode-style questions. If you’re excited about advancing AI safety and contributing to our mission, we encourage you to apply, even if your experience doesn’t perfectly align with every requirement. Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $216,000 — $270,000 USD PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
20d ago
Product Manager, Gen AI
Scale AI· New York, NY; San Francisco, CA
Scale AI builds the data infrastructure that powers the world’s most advanced AI. We are the trusted data partner behind frontier model makers and enterprise AI teams — providing the high-quality training data, evaluation frameworks, and human-feedback systems that make models smarter, safer, and more capable. Scale operates as a two-sided marketplace. On the demand side , our customers — leading AI labs and enterprises — need precisely labeled, expert-curated data to train and evaluate their models. On the supply side , we work with a global network of 500,000+ skilled contributors across 100+ countries who perform the complex annotation, evaluation, and data-generation tasks that fuel AI progress. Product Managers at Scale sit at the intersection of these two sides, shaping the systems, tooling, and experiences that make this marketplace work at unprecedented quality and scale. We are hiring Product Managers across multiple teams within our GenAI organization. These roles span both demand-side products (the tools and platforms our customers interact with) and supply-side products (the systems that power our contributor ecosystem). Each role offers the opportunity to work on high-impact, technically complex problems at the frontier of AI — with dedicated engineering, design, and data science teams. About the Role Scale’s GenAI platform is how the world’s leading AI companies — from frontier model labs to Fortune 500 enterprises — create the training data that makes their models best-in-class. As a PM here, you are building the systems that directly shape AI model quality. These roles are deeply cross-functional. You will work with dedicated engineering, design, and data science teams, as well as operations, finance, growth, and customer-facing stakeholders. The problems are technically complex, the pace is fast, and the impact is measurable. Whether you are on the demand side (shaping the products customers use to create and evaluate training data) or the supply side (building the systems that power our global contributor marketplace), you will own your product area end-to-end — from strategy to execution to instrumentation. Scale is a growth-stage company with the resources of a well-funded leader and the urgency of a startup. PMs here operate with significant autonomy, ship frequently, and are expected to be deeply analytical and hands-on. Example Product Management Openings: Task UX (Demand Side) — Own the end-to-end tasking product. You will define how tasks are designed, how contributors interact with multi-turn chat interfaces, and how in-task quality is measured and enforced across diverse conversational modalities. This is a critical surface area for training the next generation of models. Multi-Dimensional Quality (Demand Side) — Own Scale’s MDQ measurement framework, the CoPilot assisted-annotation experience, and our data pipeline connectivity layer. You will drive the core data quality infrastructure that customers depend on — defining how quality is decomposed, measured, and surfaced, while building AI-assisted tooling that helps contributors produce higher-quality outputs faster. Pay & Incentives (Supply Side) — This PM owns the payment and incentive systems that serve Scale’s global contributor base. You will ensure 500,000+ contributors across 100+ countries are paid accurately and on time, set pay rates methodically by skill and geography, and design incentive structures that balance cost efficiency, data quality, and contributor satisfaction. This role sits at the intersection of marketplace economics, global payments operations, and contributor experience. You Will Set the product strategy and roadmap for your area, grounded in customer needs, data analysis, and business impact Develop and execute a data-driven product roadmap through close collaboration with senior leadership, engineering, operations, data science, analytics, and design Translate customer and internal-user needs into clear, well-defined functional and technical requirements backed by data analysis and deep understanding of your users Guide and interface closely with engineering and data teams to define scope, review and refine technical capabilities, prioritize projects for release, and identify new opportunities Build long-term instrumentation, monitoring, and evaluation capabilities for product performance tracking and insight generation Establish business cases and projected return on investment to identify and prioritize opportunities Partner with finance and business leaders to manage impact on the profitability and growth of the overall business Communicate product vision, strategy, and progress to executive stakeholders and cross-functional partners Ideally, You’d Have 4–10 years of experience in Product Management in the tech industry, with scope appropriate to level (L4: 4–6 yrs, L5: 6–8 yrs, L6: 8–10+ yrs) Strong business acumen and analytical rigor, with demonstrated success driving products in ambiguous, high-growth environments Experience translating complex technical systems into clear product strategies — comfort engaging deeply with engineering and data science teams Excellent communication and stakeholder management skills, capable of influencing across technical and non-technical audiences Experience building products from the ground up and iterating through the scaling journey of a business Bachelor’s or advanced degree in a quantitative, engineering, or related discipline Nice to Have: Experience in AI/ML, data infrastructure, or marketplace businesses Strong understanding of the AI landscape — model training workflows, data labeling, evaluation, and deployment Experience with global payment systems, contributor/gig-economy platforms, or trust & safety domains Experience working at high-growth startups or scaling consumer/enterprise platforms Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $205,600 — $257,000 USD PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
20d ago
Staff Software Engineer, Full-Stack - Enterprise Gen AI
Scale AI· New York, NY; San Francisco, CA
Staff Software Engineer, Full-Stack - Enterprise Gen AI Scale GP (Scale Generative AI Platform) is an enterprise-grade AI platform providing APIs for knowledge retrieval, inference, evaluation, and more. We are looking for a frontend-focused full-stack engineer to help build AI-powered applications that redefine enterprise workflows and push the boundaries of interactive AI. This role is ideal for someone who thrives in a fast-paced environment, enjoys working on a diverse set of projects, and has a passion for crafting high-quality, intuitive user experiences. At Scale, you'll work on a mix of cutting-edge customer-facing AI applications and internal SaaS products. Our engineering team powers projects like TIME’s Person of the Year AI experience ( see it in action ), where our AI technology helped shape one of the most iconic features in media. You'll also contribute to Scale’s GenAI Platform ( SGP ), a powerful system that enables businesses to build and deploy AI agents at scale. Whether it’s developing interactive AI assistants, enterprise-grade web applications, or refining our core SaaS platform, you’ll play a crucial role in shaping how AI integrates into real-world applications. You Will: Build and enhance user-facing AI applications for major enterprise customers, including high-profile media and Fortune 500 companies Develop and refine features for Scale’s GenAI Platform , empowering businesses to build, deploy, and manage AI-driven agents Design, build, and optimize polished, high-performance UIs using Next.js, React, TypeScript, and Tailwind Work closely with product managers, designers, and AI/ML teams to create seamless, intuitive, and impactful user experiences Integrate frontend applications with backend services, working with APIs, authentication systems, and cloud-based infrastructure Ship features at a rapid pace while maintaining a high level of code quality, performance, and accessibility Ideally, You Have: 5+ years of experience developing frontend or fullstack applications in a modern tech stack Strong proficiency in Next.js, React, TypeScript, and Tailwind , with an eye for building polished, user-friendly interfaces Experience working on high-visibility, customer-facing applications and making trade-offs between speed and quality in fast-paced environments A passion for AI and experience working on interactive AI applications, agent-based systems, or data-rich web platforms Familiarity with backend technologies such as FastAPI, PostgreSQL, GraphQL , and cloud infrastructure like AWS, Azure, or GCP A track record of collaborating cross-functionally with design, product, and ML teams to bring AI-powered applications to life This role is a unique opportunity to shape the future of AI-powered user experiences , working on projects that impact millions of users while developing tools that empower businesses to deploy AI at scale. If you’re excited by the intersection of AI, frontend engineering, and product design, we’d love to hear from you. The base salary range for this full-time position in our hub locations of San Francisco, New York, or Seattle is $ 248,400 — $310,500 USD . Compensation packages at Scale include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position, determined by work location and additional factors, including job-related skills, experience, interview performance, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process. Scale employees are also granted Stock Options that are awarded upon board of director approval. You’ll also receive benefits including, but not limited to: Comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $248,400 — $310,500 USD PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
20d ago
Staff Product Manager, Agentic Platform
Scale AI· New York, NY; San Francisco, CA; Washington, DC
Role: Scale is at the forefront of the AI revolution, working across the US government, partners and allies around the world to unlock the potential of generative AI (GenAI). We are seeking a product leader to join our team and play a pivotal role in building Agentic AI platforms to support national-level decisions, including for some of the nation’s most important national security challenges. The ideal candidate will have a strong understanding of product leadership, software engineering principles practices and deep experience with ML/AI application development, coupled with proven experience in managing complex projects with multiple stakeholder or AI-related projects within a government or highly regulated setting, emphasizing ethical AI deployment and robust risk management practices. This role requires a strategic leader adept at navigating the complexities of government GenAI projects, ensuring Scale’s public sector AI solution aligns with agency objectives and adheres to stringent security and compliance mandates. The product manager will be responsible for the entire lifecycle of the generative AI platform, including product design, cross-program execution, capability prioritization, stakeholder engagement with various government entities, defining and managing engineering scope, developing detailed project plans, and overseeing resource allocation and budget management. A key focus will be on ensuring that Scale’s public sector AI solution operates securely within controlled network environments, and is configured properly to support government workflows, specifically those that relate to national defense. Some examples of GenAI applications we build are: Agentic warfare and scenario planning Indications and warnings integration for the protection of critical continental level assets Deep research capability that can help evaluate thousands of pages of classified information Report generation for multiple customized report templates Text2SQL intelligence applications to make analysts more efficient and embed a culture of data-driven decision-making You will: Develop enterprise grade solutions that leverage cutting edge AI and AI agents to drive value for public sector customers Work with executives at Scale and our customers to determine and execute the product strategy of the business. Own end-to-end product development by understanding customer pain points, defining product requirements, managing development, testing, and launches Lead cross-functional teams including engineering, product design, operations, marketing, go-to-market and finance. Develop a point of view and execute on turning the solutions we build into scalable software that we can commercialize across the industry Maintain a Top Secret security clearance Ideally you'd have: Technical degree in computer science, engineering, or equivalent experience 4+ years of experience in building ML-powered and / or enterprise-facing products Strong understanding of generative AI technologies and their applications in public or large-scale private sector settings Experience operating in a fast-paced environment with high ambiguity Exceptional leadership, presentation, and communication skills with the ability to influence cross-functional teams Data literacy and experience with data analytics Prior military or government experience Coding experience (e.g. Python) Nice to haves: Experience building infrastructure and tooling to develop and support agentic applications. Experience working in startup environments building solutions for public sector / federal customers. Understanding of public / federal networks, infrastructure, and deployment constraints. TS/SCI Security Clearance Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $237,600 — $297,000 USD Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of Washington DC, Texas, Colorado, Hawaii is: $213,400 — $267,300 USD PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
20d ago
Staff Machine Learning Research Engineer, Agent Post-training - Enterprise GenAI
Scale AI· San Francisco, CA; New York, NY
AI is becoming vitally important in every function of our society. At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including generative AI, defense applications, and autonomous vehicles. With our recent investment from Meta, we are doubling down on building out state of the art post-training algorithms to reach the performance necessary for complex agents in enterprises around the world. The Enterprise ML Research Lab works on the front lines of this AI revolution. We are working on an arsenal of proprietary research, tools, and resources that serve all of our enterprise clients. As a Staff Agent Post-Training MLRE, you will build out our next-gen Agent RL training platform. You’ll build out the platform that will train best-in-class Agents that achieve state of the art results on real enterprise use-cases. You’ll integrate cutting edge research into our training stack, enabling MLREs on the Enterprise AI team to deploy use-cases ranging from next-generation AI cybersecurity firewall LLMs to training foundation healthtech search models. If you are excited about shaping the future of the modern GenAI movement, we would love to hear from you! You will: Train state of the art models, developed both internally and from the community, to deploy to our enterprise customers. Research cutting edge algorithms to integrate directly into our training stack. Design solutions that enable complex multi-agent systems to directly learn from both process + outcome based rewards. Ideally you’d have: 5+ years of LLM training in a production environment Experience with post-training methods like RLHF/RLVR and related algorithms like PPO/GRPO etc. Publications in top conferences such as NEURIPS, ICLR, or ICML within the last two years PhD or Masters in Computer Science or a related field Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $264,800 — $331,000 USD PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
20d ago
Senior AI Infrastructure Engineer, Model Serving Platform
Scale AI· San Francisco, CA; New York, NY
As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving of LLMs. Our platform powers cutting-edge research and production systems, supporting both internal and external use cases across various environments. The ideal candidate combines strong ML fundamentals with deep expertise in backend system design. You’ll work in a highly collaborative environment, bridging research and engineering to deliver seamless experiences to our customers and accelerate innovation across the company. You will: Build and maintain fault-tolerant, high-performance systems for serving LLMs workloads at scale. Build an internal platform to empower LLM capability discovery. Collaborate with researchers and engineers to integrate and optimize models for production and research use cases. Conduct architecture and design reviews to uphold best practices in system design and scalability. Develop monitoring and observability solutions to ensure system health and performance. Lead projects end-to-end, from requirements gathering to implementation, in a cross-functional environment. Ideally you'd have: 5+ years of experience building large-scale, high-performance backend systems. Strong programming skills in one or more languages (e.g., Python, Go, Rust, C++). Experience with LLM serving and routing fundamentals (e.g. rate limiting, token streaming, load balancing, budgets, etc.) Experience with LLM capabilities and concepts such as reasoning, tool calling, prompt templates, etc. Experience with containers and orchestration tools (e.g., Docker, Kubernetes). Familiarity with cloud infrastructure (AWS, GCP) and infrastructure as code (e.g., Terraform). Proven ability to solve complex problems and work independently in fast-moving environments. Nice to haves: Experience with modern LLM serving frameworks such as vLLM, SGLang, TensorRT-LLM, or text-generation-inference. Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $216,000 — $270,000 USD PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
20d ago
Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI
Scale AI· San Francisco, CA; New York, NY
AI is becoming vitally important in every function of our society. At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including generative AI, defense applications, and autonomous vehicles. With our recent investment from Meta, we are doubling down on building out state of the art post-training algorithms to reach the performance necessary for complex agents in enterprises around the world. The Enterprise ML Research Lab works on the front lines of this AI revolution. We are working on an arsenal of proprietary research and resources that serve all of our enterprise clients. As an ML Sys Research Engineer, you’ll work on building out the algorithms for our next-gen Agent RL training platform, support large scale training, and research and integrate state-of-the-art technologies to optimize our ML system. Your customer will be other MLREs and AAIs on the Enterprise AI team who are taking the training algorithms and applying them to client use-cases ranging from next-generation AI cybersecurity firewall LLMs to training foundation healthtech search models. If you are excited about shaping the future of the modern AI movement, we would love to hear from you! You will: Build, profile and optimize our training and inference framework. Post-train state of the art models, developed both internally and from the community, to define stable post-training recipes for our enterprise engagements. Collaborate with ML teams to accelerate their research and development, and enable them to develop the next generation of models and data curation.. Create a next-gen agent training algorithm for multi-agent/multi-tool rollouts. Ideally you’d have: At least 1-3 years of LLM training in a production environment Passionate about system optimization Experience with post-training methods like RLHF/RLVR and related algorithms like PPO/GRPO etc. Ability to demonstrate know-how on how to operate the architecture of the modern GPU cluster Experience with multi-node LLM training and inference Strong software engineering skills, proficient in frameworks and tools such as CUDA, Pytorch, transformers, flash attention, etc. Strong written and verbal communication skills to operate in a cross functional team environment. PhD or Masters in Computer Science or a related field Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $264,800 — $331,000 USD PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
20d ago
Machine Learning Research Engineer, Agents - Enterprise GenAI
Scale AI· San Francisco, CA; New York, NY
AI is becoming vitally important in every function of our society. At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including generative AI, defense applications, and autonomous vehicles. With our recent investment from Meta, we are doubling down on building out state of the art post-training algorithms to reach the performance necessary for complex agents in enterprises around the world. The Enterprise ML Research Lab works on the front lines of this AI revolution. We are working on an arsenal of proprietary research, tools, and resources that serve all of our enterprise clients. As an Agent MLRE, you will be working on applying our Agent RL Training + Building algorithms to real life enterprise datasets across our clients + benchmarks. This will involve creating best-in-class Agents that achieve state of the art results through a combination of post-training + agent-building algorithms. If you are excited about shaping the future of the modern GenAI movement, we would love to hear from you! You will: Train state of the art models, developed both internally and from the community, to deploy to our enterprise customers. Research cutting edge algorithms to integrate directly into our training stack. Build agents that leverage our proprietary agent-building algorithms to automatically hill climb datasets – including defining highly performant tools, multi-agent systems, and complex rewards. Ideally you’d have: 1-3 years of building with LLMs in a production environment Experience with post-training methods like RLHF/RLVR and related algorithms like PPO/GRPO etc. Publications in top conferences such as NEURIPS, ICLR, or ICML within the last two years PhD or Masters in Computer Science or a related field Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $264,800 — $331,000 USD PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
20d ago
Machine Learning Research Engineer, Agent Data Foundation - Enterprise GenAI
Scale AI· San Francisco, CA; New York, NY
AI is becoming vitally important in every function of our society. At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including generative AI, defense applications, and autonomous vehicles. With our recent investment from Meta, we are doubling down on building out state of the art post-training algorithms to reach the performance necessary for complex agents in enterprises around the world. The Enterprise ML Research Lab works on the front lines of this AI revolution. We are working on an arsenal of proprietary research, tools, and resources that serve all of our enterprise clients. As MLRE on the Data Foundation team, you’ll work on cutting edge research to define the data flywheel that makes the whole machine move. This includes research around synthetic environments from task definitions, building agents for trace analysis, and contributing to a cutting edge framework that automatically hill-climbs agent-building from an eval set. This will involve creating best-in-class Agents that achieve state of the art results through a combination of post-training + agent-building algorithms. If you are excited about shaping the future of the modern GenAI movement, we would love to hear from you! You will: Build synthetic data pipelines to generate enterprise environments to use for RL post-training Create agents to convert traces from production into actionable insights to use to improve agents Contribute to our agent building product which can construct other agents using coding agents + proprietary algorithms Train state of the art models, developed both internally and from the community, to deploy to our enterprise customers. Ideally you’d have: 3+ years of building with LLMs in a production environment Clear experiences with constructing high quality data to use to improve an LLM/Agent Publications in top conferences such as NEURIPS, ICLR, or ICML within the last two years PhD or Masters in Computer Science or a related field Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $264,800 — $331,000 USD PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
20d ago
Staff Software Engineer, Full-Stack - Enterprise Gen AI
Scale AI· New York, NY; San Francisco, CA
Staff Software Engineer, Full-Stack - Enterprise Gen AI Scale GP (Scale Generative AI Platform) is an enterprise-grade AI platform providing APIs for knowledge retrieval, inference, evaluation, and more. We are looking for a frontend-focused full-stack engineer to help build AI-powered applications that redefine enterprise workflows and push the boundaries of interactive AI. This role is ideal for someone who thrives in a fast-paced environment, enjoys working on a diverse set of projects, and has a passion for crafting high-quality, intuitive user experiences. At Scale, you'll work on a mix of cutting-edge customer-facing AI applications and internal SaaS products. Our engineering team powers projects like TIME’s Person of the Year AI experience ( see it in action ), where our AI technology helped shape one of the most iconic features in media. You'll also contribute to Scale’s GenAI Platform ( SGP ), a powerful system that enables businesses to build and deploy AI agents at scale. Whether it’s developing interactive AI assistants, enterprise-grade web applications, or refining our core SaaS platform, you’ll play a crucial role in shaping how AI integrates into real-world applications. You Will: Build and enhance user-facing AI applications for major enterprise customers, including high-profile media and Fortune 500 companies Develop and refine features for Scale’s GenAI Platform , empowering businesses to build, deploy, and manage AI-driven agents Design, build, and optimize polished, high-performance UIs using Next.js, React, TypeScript, and Tailwind Work closely with product managers, designers, and AI/ML teams to create seamless, intuitive, and impactful user experiences Integrate frontend applications with backend services, working with APIs, authentication systems, and cloud-based infrastructure Ship features at a rapid pace while maintaining a high level of code quality, performance, and accessibility Ideally, You Have: 5+ years of experience developing frontend or fullstack applications in a modern tech stack Strong proficiency in Next.js, React, TypeScript, and Tailwind , with an eye for building polished, user-friendly interfaces Experience working on high-visibility, customer-facing applications and making trade-offs between speed and quality in fast-paced environments A passion for AI and experience working on interactive AI applications, agent-based systems, or data-rich web platforms Familiarity with backend technologies such as FastAPI, PostgreSQL, GraphQL , and cloud infrastructure like AWS, Azure, or GCP A track record of collaborating cross-functionally with design, product, and ML teams to bring AI-powered applications to life This role is a unique opportunity to shape the future of AI-powered user experiences , working on projects that impact millions of users while developing tools that empower businesses to deploy AI at scale. If you’re excited by the intersection of AI, frontend engineering, and product design, we’d love to hear from you. The base salary range for this full-time position in our hub locations of San Francisco, New York, or Seattle is $ 248,400 — $310,500 USD . Compensation packages at Scale include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position, determined by work location and additional factors, including job-related skills, experience, interview performance, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process. Scale employees are also granted Stock Options that are awarded upon board of director approval. You’ll also receive benefits including, but not limited to: Comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $248,400 — $310,500 USD PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
20d ago
Staff Product Manager, Agentic Platform
Scale AI· New York, NY; San Francisco, CA; Washington, DC
Role: Scale is at the forefront of the AI revolution, working across the US government, partners and allies around the world to unlock the potential of generative AI (GenAI). We are seeking a product leader to join our team and play a pivotal role in building Agentic AI platforms to support national-level decisions, including for some of the nation’s most important national security challenges. The ideal candidate will have a strong understanding of product leadership, software engineering principles practices and deep experience with ML/AI application development, coupled with proven experience in managing complex projects with multiple stakeholder or AI-related projects within a government or highly regulated setting, emphasizing ethical AI deployment and robust risk management practices. This role requires a strategic leader adept at navigating the complexities of government GenAI projects, ensuring Scale’s public sector AI solution aligns with agency objectives and adheres to stringent security and compliance mandates. The product manager will be responsible for the entire lifecycle of the generative AI platform, including product design, cross-program execution, capability prioritization, stakeholder engagement with various government entities, defining and managing engineering scope, developing detailed project plans, and overseeing resource allocation and budget management. A key focus will be on ensuring that Scale’s public sector AI solution operates securely within controlled network environments, and is configured properly to support government workflows, specifically those that relate to national defense. Some examples of GenAI applications we build are: Agentic warfare and scenario planning Indications and warnings integration for the protection of critical continental level assets Deep research capability that can help evaluate thousands of pages of classified information Report generation for multiple customized report templates Text2SQL intelligence applications to make analysts more efficient and embed a culture of data-driven decision-making You will: Develop enterprise grade solutions that leverage cutting edge AI and AI agents to drive value for public sector customers Work with executives at Scale and our customers to determine and execute the product strategy of the business. Own end-to-end product development by understanding customer pain points, defining product requirements, managing development, testing, and launches Lead cross-functional teams including engineering, product design, operations, marketing, go-to-market and finance. Develop a point of view and execute on turning the solutions we build into scalable software that we can commercialize across the industry Maintain a Top Secret security clearance Ideally you'd have: Technical degree in computer science, engineering, or equivalent experience 4+ years of experience in building ML-powered and / or enterprise-facing products Strong understanding of generative AI technologies and their applications in public or large-scale private sector settings Experience operating in a fast-paced environment with high ambiguity Exceptional leadership, presentation, and communication skills with the ability to influence cross-functional teams Data literacy and experience with data analytics Prior military or government experience Coding experience (e.g. Python) Nice to haves: Experience building infrastructure and tooling to develop and support agentic applications. Experience working in startup environments building solutions for public sector / federal customers. Understanding of public / federal networks, infrastructure, and deployment constraints. TS/SCI Security Clearance Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $237,600 — $297,000 USD Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of Washington DC, Texas, Colorado, Hawaii is: $213,400 — $267,300 USD PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
20d ago
Staff Machine Learning Research Engineer, Agent Post-training - Enterprise GenAI
Scale AI· San Francisco, CA; New York, NY
AI is becoming vitally important in every function of our society. At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including generative AI, defense applications, and autonomous vehicles. With our recent investment from Meta, we are doubling down on building out state of the art post-training algorithms to reach the performance necessary for complex agents in enterprises around the world. The Enterprise ML Research Lab works on the front lines of this AI revolution. We are working on an arsenal of proprietary research, tools, and resources that serve all of our enterprise clients. As a Staff Agent Post-Training MLRE, you will build out our next-gen Agent RL training platform. You’ll build out the platform that will train best-in-class Agents that achieve state of the art results on real enterprise use-cases. You’ll integrate cutting edge research into our training stack, enabling MLREs on the Enterprise AI team to deploy use-cases ranging from next-generation AI cybersecurity firewall LLMs to training foundation healthtech search models. If you are excited about shaping the future of the modern GenAI movement, we would love to hear from you! You will: Train state of the art models, developed both internally and from the community, to deploy to our enterprise customers. Research cutting edge algorithms to integrate directly into our training stack. Design solutions that enable complex multi-agent systems to directly learn from both process + outcome based rewards. Ideally you’d have: 5+ years of LLM training in a production environment Experience with post-training methods like RLHF/RLVR and related algorithms like PPO/GRPO etc. Publications in top conferences such as NEURIPS, ICLR, or ICML within the last two years PhD or Masters in Computer Science or a related field Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $264,800 — $331,000 USD PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
20d ago
Senior AI Infrastructure Engineer, Model Serving Platform
Scale AI· San Francisco, CA; New York, NY
As a Software Engineer on the ML Infrastructure team, you will design and build platforms for scalable, reliable, and efficient serving of LLMs. Our platform powers cutting-edge research and production systems, supporting both internal and external use cases across various environments. The ideal candidate combines strong ML fundamentals with deep expertise in backend system design. You’ll work in a highly collaborative environment, bridging research and engineering to deliver seamless experiences to our customers and accelerate innovation across the company. You will: Build and maintain fault-tolerant, high-performance systems for serving LLMs workloads at scale. Build an internal platform to empower LLM capability discovery. Collaborate with researchers and engineers to integrate and optimize models for production and research use cases. Conduct architecture and design reviews to uphold best practices in system design and scalability. Develop monitoring and observability solutions to ensure system health and performance. Lead projects end-to-end, from requirements gathering to implementation, in a cross-functional environment. Ideally you'd have: 5+ years of experience building large-scale, high-performance backend systems. Strong programming skills in one or more languages (e.g., Python, Go, Rust, C++). Experience with LLM serving and routing fundamentals (e.g. rate limiting, token streaming, load balancing, budgets, etc.) Experience with LLM capabilities and concepts such as reasoning, tool calling, prompt templates, etc. Experience with containers and orchestration tools (e.g., Docker, Kubernetes). Familiarity with cloud infrastructure (AWS, GCP) and infrastructure as code (e.g., Terraform). Proven ability to solve complex problems and work independently in fast-moving environments. Nice to haves: Experience with modern LLM serving frameworks such as vLLM, SGLang, TensorRT-LLM, or text-generation-inference. Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $216,000 — $270,000 USD PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
20d ago
Machine Learning Systems Research Engineer, Agent Post-training - Enterprise GenAI
Scale AI· San Francisco, CA; New York, NY
AI is becoming vitally important in every function of our society. At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including generative AI, defense applications, and autonomous vehicles. With our recent investment from Meta, we are doubling down on building out state of the art post-training algorithms to reach the performance necessary for complex agents in enterprises around the world. The Enterprise ML Research Lab works on the front lines of this AI revolution. We are working on an arsenal of proprietary research and resources that serve all of our enterprise clients. As an ML Sys Research Engineer, you’ll work on building out the algorithms for our next-gen Agent RL training platform, support large scale training, and research and integrate state-of-the-art technologies to optimize our ML system. Your customer will be other MLREs and AAIs on the Enterprise AI team who are taking the training algorithms and applying them to client use-cases ranging from next-generation AI cybersecurity firewall LLMs to training foundation healthtech search models. If you are excited about shaping the future of the modern AI movement, we would love to hear from you! You will: Build, profile and optimize our training and inference framework. Post-train state of the art models, developed both internally and from the community, to define stable post-training recipes for our enterprise engagements. Collaborate with ML teams to accelerate their research and development, and enable them to develop the next generation of models and data curation.. Create a next-gen agent training algorithm for multi-agent/multi-tool rollouts. Ideally you’d have: At least 1-3 years of LLM training in a production environment Passionate about system optimization Experience with post-training methods like RLHF/RLVR and related algorithms like PPO/GRPO etc. Ability to demonstrate know-how on how to operate the architecture of the modern GPU cluster Experience with multi-node LLM training and inference Strong software engineering skills, proficient in frameworks and tools such as CUDA, Pytorch, transformers, flash attention, etc. Strong written and verbal communication skills to operate in a cross functional team environment. PhD or Masters in Computer Science or a related field Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $264,800 — $331,000 USD PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
20d ago
Machine Learning Research Engineer, Agents - Enterprise GenAI
Scale AI· San Francisco, CA; New York, NY
AI is becoming vitally important in every function of our society. At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including generative AI, defense applications, and autonomous vehicles. With our recent investment from Meta, we are doubling down on building out state of the art post-training algorithms to reach the performance necessary for complex agents in enterprises around the world. The Enterprise ML Research Lab works on the front lines of this AI revolution. We are working on an arsenal of proprietary research, tools, and resources that serve all of our enterprise clients. As an Agent MLRE, you will be working on applying our Agent RL Training + Building algorithms to real life enterprise datasets across our clients + benchmarks. This will involve creating best-in-class Agents that achieve state of the art results through a combination of post-training + agent-building algorithms. If you are excited about shaping the future of the modern GenAI movement, we would love to hear from you! You will: Train state of the art models, developed both internally and from the community, to deploy to our enterprise customers. Research cutting edge algorithms to integrate directly into our training stack. Build agents that leverage our proprietary agent-building algorithms to automatically hill climb datasets – including defining highly performant tools, multi-agent systems, and complex rewards. Ideally you’d have: 1-3 years of building with LLMs in a production environment Experience with post-training methods like RLHF/RLVR and related algorithms like PPO/GRPO etc. Publications in top conferences such as NEURIPS, ICLR, or ICML within the last two years PhD or Masters in Computer Science or a related field Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $264,800 — $331,000 USD PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
20d ago
Machine Learning Research Engineer, Agent Data Foundation - Enterprise GenAI
Scale AI· San Francisco, CA; New York, NY
AI is becoming vitally important in every function of our society. At Scale, our mission is to accelerate the development of AI applications. For 9 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including generative AI, defense applications, and autonomous vehicles. With our recent investment from Meta, we are doubling down on building out state of the art post-training algorithms to reach the performance necessary for complex agents in enterprises around the world. The Enterprise ML Research Lab works on the front lines of this AI revolution. We are working on an arsenal of proprietary research, tools, and resources that serve all of our enterprise clients. As MLRE on the Data Foundation team, you’ll work on cutting edge research to define the data flywheel that makes the whole machine move. This includes research around synthetic environments from task definitions, building agents for trace analysis, and contributing to a cutting edge framework that automatically hill-climbs agent-building from an eval set. This will involve creating best-in-class Agents that achieve state of the art results through a combination of post-training + agent-building algorithms. If you are excited about shaping the future of the modern GenAI movement, we would love to hear from you! You will: Build synthetic data pipelines to generate enterprise environments to use for RL post-training Create agents to convert traces from production into actionable insights to use to improve agents Contribute to our agent building product which can construct other agents using coding agents + proprietary algorithms Train state of the art models, developed both internally and from the community, to deploy to our enterprise customers. Ideally you’d have: 3+ years of building with LLMs in a production environment Clear experiences with constructing high quality data to use to improve an LLM/Agent Publications in top conferences such as NEURIPS, ICLR, or ICML within the last two years PhD or Masters in Computer Science or a related field Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $264,800 — $331,000 USD PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
20d ago
Software Engineer, Frontier AI Infrastructure
Scale AI· San Francisco, CA; St. Louis, MO; New York, NY; Washington, DC
Scale AI is seeking a highly skilled and motivated Software Engineer, Frontier AI Infrastructure to join our dynamic Public Sector Engineering team. As a part of this team, you will own the model inference layer - enabling state of the art models, debugging the latest AI tools, managing networking, debugging latency, and tracking pricing/usage metrics for AI models. You will lead technical discussions on the frontlines with cloud vendors and customers to deliver on critical contracts and to debug platform issues. You will also work upstream with Product to understand features before they break, moving us from "infra-only debugging" to proactive integration testing. You will: Design and implement secure scalable backend systems for Public Sector customers, leveraging Scale's modern and cloud-native AI infrastructure. Own services or systems and define their long-term health goals, while also improving the health of surrounding components Re-architect the stack to run in compliant or restrictive environments. This requires designing swappable components (auth, storage, logging) to meet government/security mandates without breaking the product. You will work with Product to build integration tests that catch issues early, shifting the focus from "infra-only debugging" to preventing failures upstream. Participate actively in customer engagements, working closely with stakeholders to understand requirements and deliver innovative solutions. Contribute to the platform roadmap and product strategy for Scale AI's Public Sector business, playing a key role in shaping the future direction of our offerings. Must have: At least an active secret clearance and the ability & willingness to up level to TS/SCI with CI Poly. This is a requirement and candidates will not be considered who do not hold at least a secret clearance Ideally you'd have: Full Stack Development: Proficiency in both front-end and back-end development, including experience with modern web development frameworks, programming languages, and databases. Experience with developing & delivering software to air-gapped & isolated environments is a plus. Cloud-Native Technologies: Understanding of containerization (e.g., Docker) and container orchestration (e.g., Kubernetes) is desired. Familiarity with cloud platforms (e.g., AWS, Azure, GCP) and experience in developing and deploying applications in a cloud-native environment. Security Focused: Experience with Federal Compliance frameworks, and requirements(e.g, Cloud SRG, FedRAMP, STIG Benchmarks, etc). Experience developing software & technical solutions that meet strict security & regulatory compliance requirements. Problem Solving: Strong analytical and problem-solving skills to understand complex challenges and devise effective solutions. Ability to think critically, identify root causes, and propose innovative approaches to overcome technical obstacles. Collaboration and Communication: Excellent interpersonal and communication skills to effectively collaborate with cross-functional teams, stakeholders, and customers. Ability to clearly articulate technical concepts to non-technical audiences and foster a collaborative work environment. Adaptability and Learning Agility: Willingness to embrace new technologies, learn new skills, and adapt to evolving project requirements. Ability to quickly grasp and apply new concepts and stay up-to-date with emerging trends in software engineering. Must be able to support work 3-4 days a week from the DC, SF, NYC, or STL office. Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $184,000 — $259,440 USD The base salary range for this full-time position in the locations of Hawaii, Washington DC, Texas, Colorado is: $165,600 — $233,496 USD The base salary range for this full-time position in the location of St. Louis is: $138,000 — $194,580 USD PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
20d ago
Machine Learning Engineer, Global Public Sector
Scale AI· Doha, Qatar; London, UK
Scale’s mission is to develop reliable AI systems for the world's most important decisions. Our core work consists of: Creating custom AI applications that will impact millions of citizens Generating high-quality training data for national LLMs Upskilling and advisory services to spread the impact of AI Scale is hiring ML Research Engineers to bridge the gap between emerging AI capabilities and mission-critical, real-world impact. In our Global Public Sector (GPS) division, we don’t just implement tools; we conduct applied research to solve the unique challenges of sovereign AI. Your role is to move beyond off-the-shelf implementations. You will lead the research into Agent Design, Reliability, and AI Safety, developing novel system architectures that power high-stakes government applications. You will be the bridge between a research paper and a production-ready system that functions at scale. The Mission Applied Agent Research: Leading the design of reliable, multi-step agentic systems and long-horizon reasoning frameworks that can solve complex problems for national security and public policy. Systemic Evaluation & Red-Teaming: Developing rigorous benchmarks and evaluation protocols to ensure AI systems are safe, unbiased, and performant in high-stakes, non-commercial environments. Model Optimisation & Selection: Conducting deep-dive research into model performance (both open-weight and closed) to identify the best tools for niche domains, optimising them through context engineering, RAG, and other inference-time techniques. What You Will Do Architect Agentic Systems: Design and build agent architectures, the harnesses, tool-use protocols, and logic flows that allow LLMs to function as reliable, autonomous agents in complex workflows. Drive Reliability & Safety: Research and implement robust evaluation frameworks. This includes red-teaming for sovereign AI requirements and developing strategies to mitigate hallucinations in regulated data environments. Synthesise Deep Research: Build agents capable of autonomous information synthesis and long-horizon reasoning, enabling users to analyse massive datasets and extract actionable insights. Optimize for Niche Domains: Evaluate and adapt models for specialised use cases, such as LLM reasoning for low-resource languages, complex OCR tasks, or working in GPU-constrained environments Build Evaluation Frontiers: Create new, automated benchmarks that define what success looks like for AI in the public sector, ensuring our systems meet the highest standards of accuracy and sovereignty. Consult as a Technical Authority: Act as a subject matter expert for public sector leaders, advising on the practical limits, safety requirements, and performance trade-offs of emerging AI technologies. Ideally, You Have Engineering Rigour: Exceptional proficiency in Python and experience building agentic harnesses or AI infrastructure. You write production-ready code that is modular, scalable, and reliable. Applied Research Mindset: A track record of taking theoretical AI concepts and turning them into functional prototypes or products. You know how to read a paper and determine if its methods are actually viable for a production system. Evaluation Expertise: Experience in LLM benchmarking, red-teaming, or building evaluations that go beyond standard academic datasets. Advanced Degree: A Master’s or PhD in Computer Science, Mathematics, or a related field (with a focus on ML) is preferred, but we value demonstrated impact and engineering excellence. Nice to Haves Agentic Systems Expert: Deep experience in building multi-agent systems, including chain-of-thought optimisation and tool-calling reliability. Sovereign AI Experience: Experience working with highly regulated data environments, on-premise deployments, or sensitive government use cases. Inference Optimisation: Knowledge of how to optimise model performance for environments with limited GPU capacity or specific latency requirements. Zero-to-One Mindset: You are comfortable navigating ambiguity and enjoy defining research directions from scratch to solve a specific product or mission need. PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
20d ago
Applied AI Engineer, Global Public Sector
Scale AI· Doha, Qatar; London, UK
Scale’s rapidly growing Global Public Sector team is focused on using AI to address critical challenges facing the public sector around the world. Our core work consists of: Creating custom AI applications that will impact millions of citizens Generating high-quality training data for national LLMs Upskilling and advisory services to spread the impact of AI We are hiring Applied AI Engineers to build custom end-to-end AI applications for our public sector clients using the latest developments in the field of AI. You will also get the opportunity to develop and be part of creating custom datasets, evaluations, and fine-tuning these sophisticated models to maximize performance and apply on real world use cases with global reach. At Scale, we’re not just building AI solutions—we are building repeatable blocks to enable the public sector to transform their operations and better serve citizens through cutting-edge technology. If you’re ready to shape the future of AI in the public sector and be a member of our rapidly expanding team, we’d love to hear from you. You will: Partner with public sector clients to deeply understand their challenges and define AI-driven solutions Build and deploy end-to-end AI applications into production leveraging latest developments from the biggest AI labs, and open source models Collaborate with cross-functional teams, including data annotation specialists, to create high-quality training datasets Design and maintain robust evaluation frameworks to ensure the reliability and effectiveness of AI models Participate in customer engagements, including occasional travel (approximately two weeks per quarter) Contribute to the scaling of AI capabilities in the public sector through hands-on knowledge sharing Ideally you’d have: A strong engineering background, with a Bachelor’s degree in Computer Science, Mathematics, or a related quantitative field (or equivalent practical experience) 7+ years of post-graduation engineering experience, with demonstrated proficiency in languages such as Python, TypeScript/JavaScript, Java, or C++. 2+ years of experience applying AI/ML in production environments, such as deploying deep learning solutions, building generative/agentic AI applications or setting up evaluations pipelines Familiarity with cloud-based machine learning tools and platforms (e.g. AWS, GCP, Azure) Strong problem-solving skills, with a data-driven approach to iterating on machine learning models and datasets Excellent written and verbal communication skills to collaborate effectively in a cross-functional environment Nice to haves: Experience working at a startup, particularly as founding engineer Experience building and deploying large-scale AI solutions Strong written and verbal communication skills to operate in a cross-functional team environment Proficiency in Arabic (if focused on language models) PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
20d ago
Software Engineer, Frontier AI Infrastructure
Scale AI· San Francisco, CA; St. Louis, MO; New York, NY; Washington, DC
Scale AI is seeking a highly skilled and motivated Software Engineer, Frontier AI Infrastructure to join our dynamic Public Sector Engineering team. As a part of this team, you will own the model inference layer - enabling state of the art models, debugging the latest AI tools, managing networking, debugging latency, and tracking pricing/usage metrics for AI models. You will lead technical discussions on the frontlines with cloud vendors and customers to deliver on critical contracts and to debug platform issues. You will also work upstream with Product to understand features before they break, moving us from "infra-only debugging" to proactive integration testing. You will: Design and implement secure scalable backend systems for Public Sector customers, leveraging Scale's modern and cloud-native AI infrastructure. Own services or systems and define their long-term health goals, while also improving the health of surrounding components Re-architect the stack to run in compliant or restrictive environments. This requires designing swappable components (auth, storage, logging) to meet government/security mandates without breaking the product. You will work with Product to build integration tests that catch issues early, shifting the focus from "infra-only debugging" to preventing failures upstream. Participate actively in customer engagements, working closely with stakeholders to understand requirements and deliver innovative solutions. Contribute to the platform roadmap and product strategy for Scale AI's Public Sector business, playing a key role in shaping the future direction of our offerings. Must have: At least an active secret clearance and the ability & willingness to up level to TS/SCI with CI Poly. This is a requirement and candidates will not be considered who do not hold at least a secret clearance Ideally you'd have: Full Stack Development: Proficiency in both front-end and back-end development, including experience with modern web development frameworks, programming languages, and databases. Experience with developing & delivering software to air-gapped & isolated environments is a plus. Cloud-Native Technologies: Understanding of containerization (e.g., Docker) and container orchestration (e.g., Kubernetes) is desired. Familiarity with cloud platforms (e.g., AWS, Azure, GCP) and experience in developing and deploying applications in a cloud-native environment. Security Focused: Experience with Federal Compliance frameworks, and requirements(e.g, Cloud SRG, FedRAMP, STIG Benchmarks, etc). Experience developing software & technical solutions that meet strict security & regulatory compliance requirements. Problem Solving: Strong analytical and problem-solving skills to understand complex challenges and devise effective solutions. Ability to think critically, identify root causes, and propose innovative approaches to overcome technical obstacles. Collaboration and Communication: Excellent interpersonal and communication skills to effectively collaborate with cross-functional teams, stakeholders, and customers. Ability to clearly articulate technical concepts to non-technical audiences and foster a collaborative work environment. Adaptability and Learning Agility: Willingness to embrace new technologies, learn new skills, and adapt to evolving project requirements. Ability to quickly grasp and apply new concepts and stay up-to-date with emerging trends in software engineering. Must be able to support work 3-4 days a week from the DC, SF, NYC, or STL office. Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $184,000 — $259,440 USD The base salary range for this full-time position in the locations of Hawaii, Washington DC, Texas, Colorado is: $165,600 — $233,496 USD The base salary range for this full-time position in the location of St. Louis is: $138,000 — $194,580 USD PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
20d ago
Machine Learning Engineer, Global Public Sector
Scale AI· Doha, Qatar; London, UK
Scale’s mission is to develop reliable AI systems for the world's most important decisions. Our core work consists of: Creating custom AI applications that will impact millions of citizens Generating high-quality training data for national LLMs Upskilling and advisory services to spread the impact of AI Scale is hiring ML Research Engineers to bridge the gap between emerging AI capabilities and mission-critical, real-world impact. In our Global Public Sector (GPS) division, we don’t just implement tools; we conduct applied research to solve the unique challenges of sovereign AI. Your role is to move beyond off-the-shelf implementations. You will lead the research into Agent Design, Reliability, and AI Safety, developing novel system architectures that power high-stakes government applications. You will be the bridge between a research paper and a production-ready system that functions at scale. The Mission Applied Agent Research: Leading the design of reliable, multi-step agentic systems and long-horizon reasoning frameworks that can solve complex problems for national security and public policy. Systemic Evaluation & Red-Teaming: Developing rigorous benchmarks and evaluation protocols to ensure AI systems are safe, unbiased, and performant in high-stakes, non-commercial environments. Model Optimisation & Selection: Conducting deep-dive research into model performance (both open-weight and closed) to identify the best tools for niche domains, optimising them through context engineering, RAG, and other inference-time techniques. What You Will Do Architect Agentic Systems: Design and build agent architectures, the harnesses, tool-use protocols, and logic flows that allow LLMs to function as reliable, autonomous agents in complex workflows. Drive Reliability & Safety: Research and implement robust evaluation frameworks. This includes red-teaming for sovereign AI requirements and developing strategies to mitigate hallucinations in regulated data environments. Synthesise Deep Research: Build agents capable of autonomous information synthesis and long-horizon reasoning, enabling users to analyse massive datasets and extract actionable insights. Optimize for Niche Domains: Evaluate and adapt models for specialised use cases, such as LLM reasoning for low-resource languages, complex OCR tasks, or working in GPU-constrained environments Build Evaluation Frontiers: Create new, automated benchmarks that define what success looks like for AI in the public sector, ensuring our systems meet the highest standards of accuracy and sovereignty. Consult as a Technical Authority: Act as a subject matter expert for public sector leaders, advising on the practical limits, safety requirements, and performance trade-offs of emerging AI technologies. Ideally, You Have Engineering Rigour: Exceptional proficiency in Python and experience building agentic harnesses or AI infrastructure. You write production-ready code that is modular, scalable, and reliable. Applied Research Mindset: A track record of taking theoretical AI concepts and turning them into functional prototypes or products. You know how to read a paper and determine if its methods are actually viable for a production system. Evaluation Expertise: Experience in LLM benchmarking, red-teaming, or building evaluations that go beyond standard academic datasets. Advanced Degree: A Master’s or PhD in Computer Science, Mathematics, or a related field (with a focus on ML) is preferred, but we value demonstrated impact and engineering excellence. Nice to Haves Agentic Systems Expert: Deep experience in building multi-agent systems, including chain-of-thought optimisation and tool-calling reliability. Sovereign AI Experience: Experience working with highly regulated data environments, on-premise deployments, or sensitive government use cases. Inference Optimisation: Knowledge of how to optimise model performance for environments with limited GPU capacity or specific latency requirements. Zero-to-One Mindset: You are comfortable navigating ambiguity and enjoy defining research directions from scratch to solve a specific product or mission need. PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
20d ago
Applied AI Engineer, Global Public Sector
Scale AI· Doha, Qatar; London, UK
Scale’s rapidly growing Global Public Sector team is focused on using AI to address critical challenges facing the public sector around the world. Our core work consists of: Creating custom AI applications that will impact millions of citizens Generating high-quality training data for national LLMs Upskilling and advisory services to spread the impact of AI We are hiring Applied AI Engineers to build custom end-to-end AI applications for our public sector clients using the latest developments in the field of AI. You will also get the opportunity to develop and be part of creating custom datasets, evaluations, and fine-tuning these sophisticated models to maximize performance and apply on real world use cases with global reach. At Scale, we’re not just building AI solutions—we are building repeatable blocks to enable the public sector to transform their operations and better serve citizens through cutting-edge technology. If you’re ready to shape the future of AI in the public sector and be a member of our rapidly expanding team, we’d love to hear from you. You will: Partner with public sector clients to deeply understand their challenges and define AI-driven solutions Build and deploy end-to-end AI applications into production leveraging latest developments from the biggest AI labs, and open source models Collaborate with cross-functional teams, including data annotation specialists, to create high-quality training datasets Design and maintain robust evaluation frameworks to ensure the reliability and effectiveness of AI models Participate in customer engagements, including occasional travel (approximately two weeks per quarter) Contribute to the scaling of AI capabilities in the public sector through hands-on knowledge sharing Ideally you’d have: A strong engineering background, with a Bachelor’s degree in Computer Science, Mathematics, or a related quantitative field (or equivalent practical experience) 7+ years of post-graduation engineering experience, with demonstrated proficiency in languages such as Python, TypeScript/JavaScript, Java, or C++. 2+ years of experience applying AI/ML in production environments, such as deploying deep learning solutions, building generative/agentic AI applications or setting up evaluations pipelines Familiarity with cloud-based machine learning tools and platforms (e.g. AWS, GCP, Azure) Strong problem-solving skills, with a data-driven approach to iterating on machine learning models and datasets Excellent written and verbal communication skills to collaborate effectively in a cross-functional environment Nice to haves: Experience working at a startup, particularly as founding engineer Experience building and deploying large-scale AI solutions Strong written and verbal communication skills to operate in a cross-functional team environment Proficiency in Arabic (if focused on language models) PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
20d ago
Research Engineer, Knowledge Foundations
Anthropic· San Francisco, CA
About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role The Knowledge Work team builds the training environments and evaluations that make Claude effective at real-world professional workflows — searching, analyzing, and creating across the tools and documents knowledge workers use every day. As that work scales, the systems behind it need to be as rigorous as the research itself. As a Research Engineer on Knowledge, you'll design and run experiments that improve how Claude searches, retrieves, and reasons over information at scale. The work spans environment design, data curation, RL training, evaluation, and the infrastructure that supports it all. You'll move fluidly between these depending on what's blocking progress. You'll partner closely with researchers and other RL teams to ship capabilities that show up directly in Claude's behavior. As our training and evaluations continue to scale, we see a strong synergy between the capabilities our models learn, the tools we build for them to use, and the tools we build for ourselves to understand it all. We own the science behind superhuman epistemics and we ensure the quality of the stack that drives it. We understand that real ownership and impact comes as much through hardening and iterating on environments as it does creating new ones. Responsibilities Design, build, and iterate on training environments and data pipelines that improve Claude's ability to reason over knowledge-intensive tasks Run experiments end-to-end: form a hypothesis, build the infrastructure, train models, analyze results, and decide what to try next Develop evaluations that meaningfully capture progress on search, retrieval, and reasoning quality Identify failure modes in current model behavior and translate them into concrete training signals Collaborate closely with researchers across RL Data, post-training, and product teams to align on priorities and ship improvements Contribute to shared infrastructure and tooling that compounds the team's velocity over time Own a clean, canonical set of evaluation tools and processes for Knowledge Work capabilities, including the process used for model releases Build and automate observability, dashboards, and operational tooling for our training environments and evaluation systems, with an emphasis on high signal-to-noise: a small set of trusted metrics and alerts rather than sprawling instrumentation You may be a good fit if you Are a highly experienced Python engineer who ships reliable, well-instrumented code that teammates trust in production Experience designing, running, and analyzing ML experiments Ability to work across the stack — from data pipelines to model training to evaluation Have 5+ years of experience operating ML or distributed systems at scale Comfort working with ambiguity and choosing the most impactful problem to tackle next Clear written and verbal communication, especially when collaborating across time zones Find genuine satisfaction and impact in making existing critical systems dependable Preferred qualifications Hands-on experience training, fine-tuning, or doing RL on large language models Experience building evaluations for LLMs, particularly in open-ended or knowledge-intensive domains Prior work in a research-heavy environment such as a frontier AI lab, quant research firm, or domain-focused AI startup Published research on LLMs, RL, retrieval, or related areas Experience with distributed training systems Are comfortable being the long-term, context-rich owner of a system and its operational health Representative projects Building a training environment that teaches Claude to plan and execute multi-step research tasks against real document corpora Designing an evaluation suite that distinguishes genuine reasoning over evidence from plausible-sounding pattern matching Scaling long-running evals and fickle training environments that use many different tools Curating and validating a high-quality dataset of expert research workflows for use in post-training Diagnosing why Claude fails on a class of long-horizon retrieval tasks and proposing a training intervention, tool, or infrastructure change to fix it The annual compensation range for this role is listed below. For sales roles, the range provided is the role’s On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. Annual Salary: $350,000 — $850,000 USD Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.
20d ago
Research Engineer, Knowledge Foundations
Anthropic· San Francisco, CA
About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role The Knowledge Work team builds the training environments and evaluations that make Claude effective at real-world professional workflows — searching, analyzing, and creating across the tools and documents knowledge workers use every day. As that work scales, the systems behind it need to be as rigorous as the research itself. As a Research Engineer on Knowledge, you'll design and run experiments that improve how Claude searches, retrieves, and reasons over information at scale. The work spans environment design, data curation, RL training, evaluation, and the infrastructure that supports it all. You'll move fluidly between these depending on what's blocking progress. You'll partner closely with researchers and other RL teams to ship capabilities that show up directly in Claude's behavior. As our training and evaluations continue to scale, we see a strong synergy between the capabilities our models learn, the tools we build for them to use, and the tools we build for ourselves to understand it all. We own the science behind superhuman epistemics and we ensure the quality of the stack that drives it. We understand that real ownership and impact comes as much through hardening and iterating on environments as it does creating new ones. Responsibilities Design, build, and iterate on training environments and data pipelines that improve Claude's ability to reason over knowledge-intensive tasks Run experiments end-to-end: form a hypothesis, build the infrastructure, train models, analyze results, and decide what to try next Develop evaluations that meaningfully capture progress on search, retrieval, and reasoning quality Identify failure modes in current model behavior and translate them into concrete training signals Collaborate closely with researchers across RL Data, post-training, and product teams to align on priorities and ship improvements Contribute to shared infrastructure and tooling that compounds the team's velocity over time Own a clean, canonical set of evaluation tools and processes for Knowledge Work capabilities, including the process used for model releases Build and automate observability, dashboards, and operational tooling for our training environments and evaluation systems, with an emphasis on high signal-to-noise: a small set of trusted metrics and alerts rather than sprawling instrumentation You may be a good fit if you Are a highly experienced Python engineer who ships reliable, well-instrumented code that teammates trust in production Experience designing, running, and analyzing ML experiments Ability to work across the stack — from data pipelines to model training to evaluation Have 5+ years of experience operating ML or distributed systems at scale Comfort working with ambiguity and choosing the most impactful problem to tackle next Clear written and verbal communication, especially when collaborating across time zones Find genuine satisfaction and impact in making existing critical systems dependable Preferred qualifications Hands-on experience training, fine-tuning, or doing RL on large language models Experience building evaluations for LLMs, particularly in open-ended or knowledge-intensive domains Prior work in a research-heavy environment such as a frontier AI lab, quant research firm, or domain-focused AI startup Published research on LLMs, RL, retrieval, or related areas Experience with distributed training systems Are comfortable being the long-term, context-rich owner of a system and its operational health Representative projects Building a training environment that teaches Claude to plan and execute multi-step research tasks against real document corpora Designing an evaluation suite that distinguishes genuine reasoning over evidence from plausible-sounding pattern matching Scaling long-running evals and fickle training environments that use many different tools Curating and validating a high-quality dataset of expert research workflows for use in post-training Diagnosing why Claude fails on a class of long-horizon retrieval tasks and proposing a training intervention, tool, or infrastructure change to fix it The annual compensation range for this role is listed below. For sales roles, the range provided is the role’s On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. Annual Salary: $350,000 — $850,000 USD Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.
20d ago
Applied AI Architect, Applied AI (Digital Natives Business)
Anthropic· Munich, Germany
About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role As an Applied AI team member at Anthropic, you will be a Pre-Sales architect focused on becoming a trusted technical advisor helping large enterprises understand the value of Claude and paint the vision on how they can successfully integrate and deploy Claude into their technology stack. You'll combine your deep technical expertise with customer-facing skills to architect innovative LLM solutions that address complex business challenges while maintaining our high standards for safety and reliability. Working closely with our Sales, Product, and Engineering teams, you'll guide customers from initial technical discovery through successful deployment. You'll leverage your expertise to help customers understand Claude's capabilities, develop evals, and design scalable architectures that maximize the value of our AI systems. Responsibilities: Partner with account executives to deeply understand customer requirements and translate them into technical solutions, ensuring alignment between business objectives and technical implementation Serve as the primary technical advisor to enterprise customers throughout their Claude adoption journey, from discovery to initial evaluation through deployment. You will need to coordinate internally across multiple teams & stakeholders to drive customer success Support customers building with both the Claude API and Claude for Work Create and deliver compelling technical content tailored to different audiences. You will need to be able to spread the gamut from technical deep dives for engineering & development teams up to business value focused conversations with executives Guide technical architecture decisions and help customers integrate Claude effectively into their existing technology stack Help customers develop evaluation frameworks to measure Claude's performance for their specific use cases Identify common integration patterns and contribute insights back to our Product and Engineering teams Travel occasionally to customer sites for workshops, technical deep dives, and relationship building Maintain strong knowledge of the latest developments in LLM capabilities and implementation patterns You may be a good fit if you have: 5+ years of experience in technical customer-facing roles such as Solutions Architect, Sales Engineer, or Technical Account Manager Native German speaker with fluent English proficiency Experience working with enterprise customers, navigating complex buying cycles involving multiple stakeholders Exceptional ability to build relationships with and communicate technical concepts to diverse stakeholders to include C-suite executives, engineering & IT teams, and more Strong technical communication skills with the ability to translate customer requirements between technical and business stakeholders Experience designing scalable cloud architectures and integrating with enterprise systems Comfortable with python Familiarity with common LLM frameworks and tools or a background in machine learning or data science Excitement for engaging in cross-organizational collaboration, working through trade-offs, and balancing competing priorities A love of teaching, mentoring, and helping others succeed Excellent communication and interpersonal skills, able to convey complicated topics in easily understandable terms to a diverse set of external and internal stakeholders. You enjoy engaging in cross-organizational collaboration, working through trade-offs, and balancing competing priorities Passion for thinking creatively about how to use technology in a way that is safe and beneficial, and ultimately furthers the goal of advancing safe AI systems Deadline to apply: None. Applications will be reviewed on a rolling basis. Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.
25d ago
Applied AI Architect, Applied AI (Digital Natives Business)
Anthropic· Munich, Germany
About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role As an Applied AI team member at Anthropic, you will be a Pre-Sales architect focused on becoming a trusted technical advisor helping large enterprises understand the value of Claude and paint the vision on how they can successfully integrate and deploy Claude into their technology stack. You'll combine your deep technical expertise with customer-facing skills to architect innovative LLM solutions that address complex business challenges while maintaining our high standards for safety and reliability. Working closely with our Sales, Product, and Engineering teams, you'll guide customers from initial technical discovery through successful deployment. You'll leverage your expertise to help customers understand Claude's capabilities, develop evals, and design scalable architectures that maximize the value of our AI systems. Responsibilities: Partner with account executives to deeply understand customer requirements and translate them into technical solutions, ensuring alignment between business objectives and technical implementation Serve as the primary technical advisor to enterprise customers throughout their Claude adoption journey, from discovery to initial evaluation through deployment. You will need to coordinate internally across multiple teams & stakeholders to drive customer success Support customers building with both the Claude API and Claude for Work Create and deliver compelling technical content tailored to different audiences. You will need to be able to spread the gamut from technical deep dives for engineering & development teams up to business value focused conversations with executives Guide technical architecture decisions and help customers integrate Claude effectively into their existing technology stack Help customers develop evaluation frameworks to measure Claude's performance for their specific use cases Identify common integration patterns and contribute insights back to our Product and Engineering teams Travel occasionally to customer sites for workshops, technical deep dives, and relationship building Maintain strong knowledge of the latest developments in LLM capabilities and implementation patterns You may be a good fit if you have: 5+ years of experience in technical customer-facing roles such as Solutions Architect, Sales Engineer, or Technical Account Manager Native German speaker with fluent English proficiency Experience working with enterprise customers, navigating complex buying cycles involving multiple stakeholders Exceptional ability to build relationships with and communicate technical concepts to diverse stakeholders to include C-suite executives, engineering & IT teams, and more Strong technical communication skills with the ability to translate customer requirements between technical and business stakeholders Experience designing scalable cloud architectures and integrating with enterprise systems Comfortable with python Familiarity with common LLM frameworks and tools or a background in machine learning or data science Excitement for engaging in cross-organizational collaboration, working through trade-offs, and balancing competing priorities A love of teaching, mentoring, and helping others succeed Excellent communication and interpersonal skills, able to convey complicated topics in easily understandable terms to a diverse set of external and internal stakeholders. You enjoy engaging in cross-organizational collaboration, working through trade-offs, and balancing competing priorities Passion for thinking creatively about how to use technology in a way that is safe and beneficial, and ultimately furthers the goal of advancing safe AI systems Deadline to apply: None. Applications will be reviewed on a rolling basis. Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.
25d ago
Research Engineer, Economic Research Data Platform
Anthropic· San Francisco, CA
About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role As a Research Engineer on the Economic Research Data Platform team, you will design, build, and maintain critical infrastructure that powers Anthropic's research on AI's economic impact. You will work with data systems from across Anthropic, including our research tools for privacy-preserving analysis. The Economic Research team is part of the Anthropic Institute , and studies the economic implications of AI on individual, firm, and economy-wide outcomes. We build scalable systems to monitor AI usage patterns and directly measure the impact of AI adoption on real-world outcomes. We publish research and data, including the Anthropic Economic Index, for the benefit of the public – helping policymakers, businesses, and workers understand and navigate the transition to powerful AI. The questions we work on include: how is AI changing jobs and economic activity, who is adopting it and why, and what determines whether a region or industry captures value from it. In this role, you will work closely with teams across Anthropic — including Data Science and Analytics, Data Infrastructure, Societal Impacts, and Public Policy — to build scalable and robust data systems that support high-leverage, high-impact research. Strong candidates will have a track record building data processing pipelines, architecting and implementing high-quality internal infrastructure, working in a fast-paced environment, and navigating ambiguity. Responsibilities : Build and operate the data pipelines that turn raw usage data into clean, reusable, privacy-preserving datasets Design new systems - including developing classifiers, training probes on model internals, and building the ML pipelines behind them — for understanding how Claude is used and the impact it's having on the economy Build self-serve workflows to ingest and integrate external data sources so they're interoperable with internal datasets Develop the APIs, libraries, and interfaces that serve data to researchers and the public Partner closely with researchers, data scientists, policy experts, and other cross-functional partners to advance Anthropic's safety mission Contribute to the team roadmap, documentation, and practices that enable self-serve data access while maintaining safety and governance standards Ensure data reliability, integrity, and privacy compliance across all economic research data infrastructure You might be a good fit if you: Have significant experience building data-intensive applications, pipelines, or internal tooling in production Have experience with cloud infrastructure platforms such as AWS or GCP, and take pride in writing clean, well-documented code in Python that others can build upon Have intuition for analytics workflows and empathy for how researchers and data scientists work Are comfortable making technical decisions with incomplete information while keeping engineering standards high Have a "full-stack mindset", not hesitating to do what it takes to solve a problem end-to-end, even if it requires going outside the original job description Have strong communication skills to collaborate effectively with economists, researchers, and cross-functional partners who may have varying levels of technical expertise Care about the societal impacts of your work, and are interested in AI's economic implications Bonus qualifications: Experience with modern data transformation, orchestration, and query frameworks Building systems and products on top of LLMs Privacy-preserving data systems, or data governance and lineage tooling Building and operating web services and the infrastructure underneath them Full-stack development or complex data visualization Background in econometrics, statistics, or quantitative social science Working in environments where engineers partner closely with quantitative users — research labs, trading firms, analytics companies Some Examples of Our Recent Work Anthropic Economic Index report: Learning curves Labor market impacts of AI: A new measure and early evidence Anthropic Economic Index Report: Economic Primitives Anthropic Economic Index Report: Uneven Geographic and Enterprise AI Adoption Estimating AI productivity gains from Claude conversations The Anthropic Economic Index Deadline to apply: None. Applications are reviewed on a rolling basis The annual compensation range for this role is listed below. For sales roles, the range provided is the role’s On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. Annual Salary: $300,000 — $405,000 USD Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.
26d ago
Research Engineer, Economic Research Data Platform
Anthropic· San Francisco, CA
About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role As a Research Engineer on the Economic Research Data Platform team, you will design, build, and maintain critical infrastructure that powers Anthropic's research on AI's economic impact. You will work with data systems from across Anthropic, including our research tools for privacy-preserving analysis. The Economic Research team is part of the Anthropic Institute , and studies the economic implications of AI on individual, firm, and economy-wide outcomes. We build scalable systems to monitor AI usage patterns and directly measure the impact of AI adoption on real-world outcomes. We publish research and data, including the Anthropic Economic Index, for the benefit of the public – helping policymakers, businesses, and workers understand and navigate the transition to powerful AI. The questions we work on include: how is AI changing jobs and economic activity, who is adopting it and why, and what determines whether a region or industry captures value from it. In this role, you will work closely with teams across Anthropic — including Data Science and Analytics, Data Infrastructure, Societal Impacts, and Public Policy — to build scalable and robust data systems that support high-leverage, high-impact research. Strong candidates will have a track record building data processing pipelines, architecting and implementing high-quality internal infrastructure, working in a fast-paced environment, and navigating ambiguity. Responsibilities : Build and operate the data pipelines that turn raw usage data into clean, reusable, privacy-preserving datasets Design new systems - including developing classifiers, training probes on model internals, and building the ML pipelines behind them — for understanding how Claude is used and the impact it's having on the economy Build self-serve workflows to ingest and integrate external data sources so they're interoperable with internal datasets Develop the APIs, libraries, and interfaces that serve data to researchers and the public Partner closely with researchers, data scientists, policy experts, and other cross-functional partners to advance Anthropic's safety mission Contribute to the team roadmap, documentation, and practices that enable self-serve data access while maintaining safety and governance standards Ensure data reliability, integrity, and privacy compliance across all economic research data infrastructure You might be a good fit if you: Have significant experience building data-intensive applications, pipelines, or internal tooling in production Have experience with cloud infrastructure platforms such as AWS or GCP, and take pride in writing clean, well-documented code in Python that others can build upon Have intuition for analytics workflows and empathy for how researchers and data scientists work Are comfortable making technical decisions with incomplete information while keeping engineering standards high Have a "full-stack mindset", not hesitating to do what it takes to solve a problem end-to-end, even if it requires going outside the original job description Have strong communication skills to collaborate effectively with economists, researchers, and cross-functional partners who may have varying levels of technical expertise Care about the societal impacts of your work, and are interested in AI's economic implications Bonus qualifications: Experience with modern data transformation, orchestration, and query frameworks Building systems and products on top of LLMs Privacy-preserving data systems, or data governance and lineage tooling Building and operating web services and the infrastructure underneath them Full-stack development or complex data visualization Background in econometrics, statistics, or quantitative social science Working in environments where engineers partner closely with quantitative users — research labs, trading firms, analytics companies Some Examples of Our Recent Work Anthropic Economic Index report: Learning curves Labor market impacts of AI: A new measure and early evidence Anthropic Economic Index Report: Economic Primitives Anthropic Economic Index Report: Uneven Geographic and Enterprise AI Adoption Estimating AI productivity gains from Claude conversations The Anthropic Economic Index Deadline to apply: None. Applications are reviewed on a rolling basis The annual compensation range for this role is listed below. For sales roles, the range provided is the role’s On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. Annual Salary: $300,000 — $405,000 USD Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.
26d ago
Solutions Architect, Applied AI
Anthropic· Bangalore, India
About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. Location - Mumbai About the role As an Applied AI team member at Anthropic India, you will be a Pre-Sales architect focused on becoming a trusted technical advisor helping large enterprises across India and the Asia-Pacific region understand the value of Claude and paint the vision on how they can successfully integrate and deploy Claude into their technology stack. You'll combine your deep technical expertise with customer-facing skills to architect innovative LLM solutions that address complex business challenges while maintaining our high standards for safety and reliability. Working closely with our Sales, Product, and Engineering teams, you'll guide customers from initial technical discovery through successful deployment. You'll leverage your expertise to help customers understand Claude's capabilities, develop evals, and design scalable architectures that maximize the value of our AI systems. Responsibilities: Partner with account executives to deeply understand customer requirements and translate them into technical solutions, ensuring alignment between business objectives and technical implementation Serve as the primary technical advisor to enterprise customers throughout their Claude adoption journey, from discovery to initial evaluation through deployment. You will need to coordinate internally across multiple teams & stakeholders to drive customer success Support customers building with both the Claude API and Claude for Work Create and deliver compelling technical content tailored to different audiences. You will need to be able to spread the gamut from technical deep dives for engineering & development teams up to business value focused conversations with executives Guide technical architecture decisions and help customers integrate Claude effectively into their existing technology stack Help customers develop evaluation frameworks to measure Claude's performance for their specific use cases Identify common integration patterns and contribute insights back to our Product and Engineering teams Travel occasionally within India and the APAC region to customer sites for workshops, technical deep dives, and relationship building Maintain strong knowledge of the latest developments in LLM capabilities and implementation patterns Collaborate across time zones with global teams while serving as the technical expert for the India market You may be a good fit if you have: 10+ years of experience in technical customer-facing roles such as Solutions Architect, Sales Engineer, or Technical Account Manager Experience working with enterprise customers in India or the APAC region, navigating complex buying cycles involving multiple stakeholders Exceptional ability to build relationships with and communicate technical concepts to diverse stakeholders to include C-suite executives, engineering & IT teams, and more Strong technical communication skills with the ability to translate customer requirements between technical and business stakeholders Experience designing scalable cloud architectures and integrating with enterprise systems Comfortable with Python Familiarity with common LLM frameworks and tools or a background in machine learning or data science Excitement for engaging in cross-organizational collaboration, working through trade-offs, and balancing competing priorities A love of teaching, mentoring, and helping others succeed Excellent communication and interpersonal skills, able to convey complicated topics in easily understandable terms to a diverse set of external and internal stakeholders. You enjoy engaging in cross-organizational collaboration, working through trade-offs, and balancing competing priorities Passion for thinking creatively about how to use technology in a way that is safe and beneficial, and ultimately furthers the goal of advancing safe AI systems Ability to work effectively across cultures and time zones as part of a global team Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.
29d ago
Solutions Architect, Applied AI
Anthropic· Bangalore, India
About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. Location - Mumbai About the role As an Applied AI team member at Anthropic India, you will be a Pre-Sales architect focused on becoming a trusted technical advisor helping large enterprises across India and the Asia-Pacific region understand the value of Claude and paint the vision on how they can successfully integrate and deploy Claude into their technology stack. You'll combine your deep technical expertise with customer-facing skills to architect innovative LLM solutions that address complex business challenges while maintaining our high standards for safety and reliability. Working closely with our Sales, Product, and Engineering teams, you'll guide customers from initial technical discovery through successful deployment. You'll leverage your expertise to help customers understand Claude's capabilities, develop evals, and design scalable architectures that maximize the value of our AI systems. Responsibilities: Partner with account executives to deeply understand customer requirements and translate them into technical solutions, ensuring alignment between business objectives and technical implementation Serve as the primary technical advisor to enterprise customers throughout their Claude adoption journey, from discovery to initial evaluation through deployment. You will need to coordinate internally across multiple teams & stakeholders to drive customer success Support customers building with both the Claude API and Claude for Work Create and deliver compelling technical content tailored to different audiences. You will need to be able to spread the gamut from technical deep dives for engineering & development teams up to business value focused conversations with executives Guide technical architecture decisions and help customers integrate Claude effectively into their existing technology stack Help customers develop evaluation frameworks to measure Claude's performance for their specific use cases Identify common integration patterns and contribute insights back to our Product and Engineering teams Travel occasionally within India and the APAC region to customer sites for workshops, technical deep dives, and relationship building Maintain strong knowledge of the latest developments in LLM capabilities and implementation patterns Collaborate across time zones with global teams while serving as the technical expert for the India market You may be a good fit if you have: 10+ years of experience in technical customer-facing roles such as Solutions Architect, Sales Engineer, or Technical Account Manager Experience working with enterprise customers in India or the APAC region, navigating complex buying cycles involving multiple stakeholders Exceptional ability to build relationships with and communicate technical concepts to diverse stakeholders to include C-suite executives, engineering & IT teams, and more Strong technical communication skills with the ability to translate customer requirements between technical and business stakeholders Experience designing scalable cloud architectures and integrating with enterprise systems Comfortable with Python Familiarity with common LLM frameworks and tools or a background in machine learning or data science Excitement for engaging in cross-organizational collaboration, working through trade-offs, and balancing competing priorities A love of teaching, mentoring, and helping others succeed Excellent communication and interpersonal skills, able to convey complicated topics in easily understandable terms to a diverse set of external and internal stakeholders. You enjoy engaging in cross-organizational collaboration, working through trade-offs, and balancing competing priorities Passion for thinking creatively about how to use technology in a way that is safe and beneficial, and ultimately furthers the goal of advancing safe AI systems Ability to work effectively across cultures and time zones as part of a global team Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.
29d ago
Forward Deployed Engineer, Applied AI
Anthropic· Boston, MA; New York City, NY | Seattle, WA; San Francisco, CA | New York City, NY; Washington, DC
About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role: As a member of the Applied AI team at Anthropic, you will be a Forward Deployed Engineer (FDE) who embeds directly with our most strategic customers to drive transformational AI adoption. You will collaborate closely with customer teams to ship advanced AI applications that solve real world business problems. Our FDEs engage with customers to accelerate the adoption of existing products and create new applications built on our models. Working closely with our Post-Sales, Product, and Engineering teams, you'll combine engineering expertise, an understanding of frontier AI applications, and customer-facing skills to understand customer workflows and develop innovative solutions that address complex business challenges while maintaining our high standards for safety and reliability. You will sit at the frontier of enterprise AI deployments and serve as one of our founding FDEs who helps to shape our forward-deployed motion. We expect our FDEs to operate autonomously, thrive under ambiguity, and represent Anthropic at the highest level in customer environments. This is a significant responsibility: you’ll play a key role in championing our mission in the enterprise. Responsibilities: Work within customer systems to build production applications with Claude models, ensuring that these products meet customer requirements. Deliver technical artifacts for customers like MCP servers, sub-agents, and agent skills that will be used in production workflows. Provide white glove deployment support for Anthropic products in enterprise environments. Identify and codify repeatable deployment patterns and contribute insights back to our Product and Engineering teams. Maintain strong knowledge of the latest developments in LLM capabilities, implementation patterns, and AI product development stacks. Build long term relationships with customers and proactively identify new opportunities for AI deployment throughout the lifecycle of an engagement. Potential Travel (based on location) to customer sites to build in person with customers. - Estimated 25% Be a champion for Anthropic’s mission in the field. You May Be a Good Fit If You Have: 3+ years of experience in a technical, customer facing role such as Forward Deployed Engineer, or as a Software Engineer with consulting experience. Former technical founders are also encouraged to apply. Production experience with LLMs including advanced prompt engineering, agent development, evaluation frameworks, and deployment at scale. Strong programming skills with proficiency in Python (and ideally in one or more additional languages like Typescript, Java, etc) and experience shipping production applications High agency with an ability to navigate ambiguity present in complex organizations. High cooperation mindset for cross-organizational collaboration, balancing competing priorities with integrity. Passion for advancing safe, beneficial AI systems through creative technical applications. Strong communication skills to conduct discovery with customers and to convey technical concepts to diverse stakeholders while maintaining a low ego and collaborative approach. A background in financial services, healthcare/life sciences, or another enterprise vertical is a plus. Experience with enterprise IT systems and/or AI deployment patterns is a plus. Experience working as an FDE or in a professional services context is a plus. The annual compensation range for this role is listed below. For sales roles, the range provided is the role’s On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. Annual Salary: $200,000 — $300,000 USD Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.
1mo ago
Forward Deployed Engineer, Applied AI
Anthropic· Boston, MA; New York City, NY | Seattle, WA; San Francisco, CA | New York City, NY; Washington, DC
About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role: As a member of the Applied AI team at Anthropic, you will be a Forward Deployed Engineer (FDE) who embeds directly with our most strategic customers to drive transformational AI adoption. You will collaborate closely with customer teams to ship advanced AI applications that solve real world business problems. Our FDEs engage with customers to accelerate the adoption of existing products and create new applications built on our models. Working closely with our Post-Sales, Product, and Engineering teams, you'll combine engineering expertise, an understanding of frontier AI applications, and customer-facing skills to understand customer workflows and develop innovative solutions that address complex business challenges while maintaining our high standards for safety and reliability. You will sit at the frontier of enterprise AI deployments and serve as one of our founding FDEs who helps to shape our forward-deployed motion. We expect our FDEs to operate autonomously, thrive under ambiguity, and represent Anthropic at the highest level in customer environments. This is a significant responsibility: you’ll play a key role in championing our mission in the enterprise. Responsibilities: Work within customer systems to build production applications with Claude models, ensuring that these products meet customer requirements. Deliver technical artifacts for customers like MCP servers, sub-agents, and agent skills that will be used in production workflows. Provide white glove deployment support for Anthropic products in enterprise environments. Identify and codify repeatable deployment patterns and contribute insights back to our Product and Engineering teams. Maintain strong knowledge of the latest developments in LLM capabilities, implementation patterns, and AI product development stacks. Build long term relationships with customers and proactively identify new opportunities for AI deployment throughout the lifecycle of an engagement. Potential Travel (based on location) to customer sites to build in person with customers. - Estimated 25% Be a champion for Anthropic’s mission in the field. You May Be a Good Fit If You Have: 3+ years of experience in a technical, customer facing role such as Forward Deployed Engineer, or as a Software Engineer with consulting experience. Former technical founders are also encouraged to apply. Production experience with LLMs including advanced prompt engineering, agent development, evaluation frameworks, and deployment at scale. Strong programming skills with proficiency in Python (and ideally in one or more additional languages like Typescript, Java, etc) and experience shipping production applications High agency with an ability to navigate ambiguity present in complex organizations. High cooperation mindset for cross-organizational collaboration, balancing competing priorities with integrity. Passion for advancing safe, beneficial AI systems through creative technical applications. Strong communication skills to conduct discovery with customers and to convey technical concepts to diverse stakeholders while maintaining a low ego and collaborative approach. A background in financial services, healthcare/life sciences, or another enterprise vertical is a plus. Experience with enterprise IT systems and/or AI deployment patterns is a plus. Experience working as an FDE or in a professional services context is a plus. The annual compensation range for this role is listed below. For sales roles, the range provided is the role’s On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. Annual Salary: $200,000 — $300,000 USD Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.
1mo ago
Strategic Projects Lead, Generative AI
Scale AI· India
Scale’s Generative AI business unit is currently seeing historic levels of growth. As a Strategic Projects Lead (SPL), you will leading initiatives that will drive $XXM+ in new revenue for the business. This is a demanding role, and as an SPL, you should be prepared to wear many hats such as Operator, Product Manager and customer-facing Engagement Manager. The ideal SPL should have a strong entrepreneurial mindset, be comfortable getting into the weeds, and be excited about intense, impactful work that leads to an accelerated career progression. You will: Lead cross-functional projects with diverse stakeholders (Engineering + Ops + Go-to-Market) Partner with product and engineering teams to enhance products to fulfill needs of strategic customers and initiatives Own the execution of our data labeling operations for strategic projects Give regular progress updates to Scale’s executive team Work on some of the most impactful problems at the company Ideally, you’d have: Strong technical background (a degree in computer science is ideal, and at minimum the role requires the ability to do data analytics using SQL or Python). 2+ years of experience leading a team, developing product or operational processes, or as a SWE. Strong problem solving capabilities (experience working on operational challenges or as a consultant is a plus). Entrepreneurial experience and mindset - you are excited about building things from scratch PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
1mo ago
Strategic Projects Lead, Generative AI
Scale AI· India
Scale’s Generative AI business unit is currently seeing historic levels of growth. As a Strategic Projects Lead (SPL), you will leading initiatives that will drive $XXM+ in new revenue for the business. This is a demanding role, and as an SPL, you should be prepared to wear many hats such as Operator, Product Manager and customer-facing Engagement Manager. The ideal SPL should have a strong entrepreneurial mindset, be comfortable getting into the weeds, and be excited about intense, impactful work that leads to an accelerated career progression. You will: Lead cross-functional projects with diverse stakeholders (Engineering + Ops + Go-to-Market) Partner with product and engineering teams to enhance products to fulfill needs of strategic customers and initiatives Own the execution of our data labeling operations for strategic projects Give regular progress updates to Scale’s executive team Work on some of the most impactful problems at the company Ideally, you’d have: Strong technical background (a degree in computer science is ideal, and at minimum the role requires the ability to do data analytics using SQL or Python). 2+ years of experience leading a team, developing product or operational processes, or as a SWE. Strong problem solving capabilities (experience working on operational challenges or as a consultant is a plus). Entrepreneurial experience and mindset - you are excited about building things from scratch PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
1mo ago
Research Engineer, Voice
Inflection AI· Palo Alto, California, United States
About Inflection AI Inflection AI is a Public Benefit Corporation empowering people with human-centered, emotionally intelligent AI. We’re shaping the future of AI by combining emotional intelligence (EQ) and raw intelligence (IQ) to elevate people’s potential. Inflection AI created Pi, the world’s first emotionally intelligent AI, to help people work through decisions, emotions, and challenges. Pi is a personal AI agent powered by Inflection AI’s foundation model, proving that AI can be personal, empathetic, and contextually aware. About the Role We’re looking for a Member of Technical Staff (MTS), Research Engineer focused on voice and audio to help advance the spoken intelligence behind Pi. In this role, you’ll work at the intersection of research and production—developing, training, and shipping neural models across the full spectrum of voice: speech synthesis, recognition, audio generation, and real-time spoken dialogue. You’ll collaborate closely with ML engineers, product teams, and infrastructure to turn cutting-edge ideas in areas like neural audio codecs, diffusion-based TTS, and multimodal foundation models into the natural, expressive voice experiences that millions of Pi users interact with every day. What You’ll Do Research, develop, and optimize neural models for voice and audio—including text-to-speech, automatic speech recognition, audio generation, and spoken dialogue systems. Build and maintain production-grade training and inference pipelines for voice models, with close attention to latency, naturalness, and scalability. Run experiments end-to-end: data curation, model architecture design, training, evaluation, and ablation studies. Collaborate with ML engineers, product teams, and infrastructure to integrate voice models into Pi’s real-time conversational stack. Explore and apply advances in neural audio codecs, diffusion-based synthesis, streaming architectures, and multimodal foundation models to improve Pi’s voice experience. Develop robust evaluation frameworks combining perceptual metrics, automated benchmarks, and user-facing quality signals. Contribute to Inflection’s research culture through publications, internal reviews, and knowledge sharing. What We’re Looking For 2-5 years of research or engineering experience (including graduate work) in audio, speech, or multimodal ML. Strong proficiency in PyTorch and hands-on experience training and debugging large-scale neural models on GPU/accelerator clusters. Solid understanding of audio and speech fundamentals spectrograms, mel features, vocoders, codec-based representations, and signal processing. Demonstrated ability to take a research idea from prototype to production: equally comfortable reading papers and writing efficient, CUDA-aware training loops. Familiarity with modern generative architectures for audio (e.g., diffusion models, autoregressive codecs, flow-matching) and their trade-offs. Clear, collaborative communication able to distill complex research into actionable insights for cross-functional partners. Have a bachelor’s degree or equivalent in Computer Science, Electrical Engineering, Linguistics, or a related field; MS or PhD strongly preferred. Employee Pay Disclosures At Inflection AI, we aim to attract and retain the best employees and compensate them in a way that appropriately and fairly values their individual contributions to the company. For this role, Inflection AI estimates a starting annual base salary to fall within the range of $225,000 to $325,000 , depending on a candidate’s qualifications and level of experience. This role also includes a meaningful equity component, allowing employees to share in the long-term success of the company. Benefits Inflection AI values and supports our team’s mental and physical health. We are focused on building a positive, safe, inclusive and inspiring place to work. Our benefits include: Diverse medical, dental and vision options 401k matching program Unlimited paid time off Parental leave and flexibility for all parents and caregivers Support of country-specific visa needs for international employees living in the Bay Area
1mo ago
Research Engineer, Voice
Inflection AI· Palo Alto, California, United States
About Inflection AI Inflection AI is a Public Benefit Corporation empowering people with human-centered, emotionally intelligent AI. We’re shaping the future of AI by combining emotional intelligence (EQ) and raw intelligence (IQ) to elevate people’s potential. Inflection AI created Pi, the world’s first emotionally intelligent AI, to help people work through decisions, emotions, and challenges. Pi is a personal AI agent powered by Inflection AI’s foundation model, proving that AI can be personal, empathetic, and contextually aware. About the Role We’re looking for a Member of Technical Staff (MTS), Research Engineer focused on voice and audio to help advance the spoken intelligence behind Pi. In this role, you’ll work at the intersection of research and production—developing, training, and shipping neural models across the full spectrum of voice: speech synthesis, recognition, audio generation, and real-time spoken dialogue. You’ll collaborate closely with ML engineers, product teams, and infrastructure to turn cutting-edge ideas in areas like neural audio codecs, diffusion-based TTS, and multimodal foundation models into the natural, expressive voice experiences that millions of Pi users interact with every day. What You’ll Do Research, develop, and optimize neural models for voice and audio—including text-to-speech, automatic speech recognition, audio generation, and spoken dialogue systems. Build and maintain production-grade training and inference pipelines for voice models, with close attention to latency, naturalness, and scalability. Run experiments end-to-end: data curation, model architecture design, training, evaluation, and ablation studies. Collaborate with ML engineers, product teams, and infrastructure to integrate voice models into Pi’s real-time conversational stack. Explore and apply advances in neural audio codecs, diffusion-based synthesis, streaming architectures, and multimodal foundation models to improve Pi’s voice experience. Develop robust evaluation frameworks combining perceptual metrics, automated benchmarks, and user-facing quality signals. Contribute to Inflection’s research culture through publications, internal reviews, and knowledge sharing. What We’re Looking For 2-5 years of research or engineering experience (including graduate work) in audio, speech, or multimodal ML. Strong proficiency in PyTorch and hands-on experience training and debugging large-scale neural models on GPU/accelerator clusters. Solid understanding of audio and speech fundamentals spectrograms, mel features, vocoders, codec-based representations, and signal processing. Demonstrated ability to take a research idea from prototype to production: equally comfortable reading papers and writing efficient, CUDA-aware training loops. Familiarity with modern generative architectures for audio (e.g., diffusion models, autoregressive codecs, flow-matching) and their trade-offs. Clear, collaborative communication able to distill complex research into actionable insights for cross-functional partners. Have a bachelor’s degree or equivalent in Computer Science, Electrical Engineering, Linguistics, or a related field; MS or PhD strongly preferred. Employee Pay Disclosures At Inflection AI, we aim to attract and retain the best employees and compensate them in a way that appropriately and fairly values their individual contributions to the company. For this role, Inflection AI estimates a starting annual base salary to fall within the range of $225,000 to $325,000 , depending on a candidate’s qualifications and level of experience. This role also includes a meaningful equity component, allowing employees to share in the long-term success of the company. Benefits Inflection AI values and supports our team’s mental and physical health. We are focused on building a positive, safe, inclusive and inspiring place to work. Our benefits include: Diverse medical, dental and vision options 401k matching program Unlimited paid time off Parental leave and flexibility for all parents and caregivers Support of country-specific visa needs for international employees living in the Bay Area
1mo ago
Strategic Projects Lead, Generative AI
Scale AI· San Francisco, CA; New York, NY
Scale’s Generative AI business unit is currently seeing historic levels of growth. As a Strategic Projects Lead (SPL), you will leading initiatives that will drive $XXM+ in new revenue for the business. This is a demanding role, and as an SPL, you should be prepared to wear many hats such as Operator, Product Manager and customer-facing Engagement Manager. The ideal SPL should have a strong entrepreneurial mindset, be comfortable getting into the weeds, and be excited about intense, impactful work that leads to an accelerated career progression. You will: Lead cross-functional projects with diverse stakeholders (Engineering + Ops + Go-to-Market) Partner with product and engineering teams to enhance products to fulfill needs of strategic customers and initiatives Own the execution of our data labeling operations for strategic projects Give regular progress updates to Scale’s executive team Work on some of the most impactful problems at the company Ideally, you’d have: Strong technical background (a degree in computer science is ideal, and at minimum the role requires the ability to do data analytics using SQL or Python). 2+ years of experience leading a team, developing product or operational processes, or as a SWE. Strong problem solving capabilities (experience working on operational challenges or as a consultant is a plus). Entrepreneurial experience and mindset - you are excited about building things from scratch Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $112,000 — $190,000 USD PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
1mo ago
Senior / Staff Machine Learning Research Scientist, Agents
Scale AI· San Francisco, CA; Seattle, WA; New York, NY
About Scale At Scale AI, our mission is to accelerate the development of AI applications. For 8 years, Scale has been the leading AI data foundry, helping fuel the most exciting advancements in AI, including: generative AI, defense applications, and autonomous vehicles. With our recent Series F round, we’re accelerating the abundance of frontier data to pave the road to Artificial General Intelligence (AGI), and building upon our prior model evaluation work with enterprise customers and governments, to deepen our capabilities and offerings for both public and private evaluations. About the ACE team The Agent Capabilities & Environments (ACE) team, part of Scale’s Research organization, brings together customer-facing Researchers and Applied AI Engineers. Our core mission includes research on agent environments and RL reward signals, benchmarking autonomous agent performance across real-world scenarios and environments, creating robust data programs to improve Large Language Models (LLMs) agentic capabilities and building foundational tools and frameworks for evaluating models as agents. ACE focuses on autonomous agents that dynamically interact with diverse external environments, including code repositories, GUI interfaces, browsers, and more. About This Role This role is at the intersection of cutting-edge AI research and practical application, with a focus on studying the data types essential for building state-of-the-art agents, such as browser and SWE agents. The ideal candidate will explore the data landscape needed to advance intelligent, adaptable AI agents, guiding the data strategy at Scale to drive innovation. This position requires not only expertise in LLM agents and planning algorithms but also creativity in addressing novel challenges related to data, interaction, and evaluation. You will contribute to impactful research publications on agents, collaborate with customer researchers, and work alongside the engineering team to translate these advancements into real-world, scalable solutions. Ideally you’d have: Practical experience working with LLMs, with proficiency in frameworks like Pytorch, Jax, or Tensorflow. You should also be adept at interpreting research literature and quickly turning new ideas into prototypes. A track record of published research in top ML venues (e.g., ACL, EMNLP, NAACL, NeurIPS, ICML, ICLR, COLM, etc.) At least three years of experience addressing sophisticated ML problems, either in a research setting or product development. Strong written and verbal communication skills and the ability to operate cross-functionally. Nice to have: Hands-on experience with open source LLM fine-tuning or involvement in bespoke LLM fine-tuning projects using Pytorch/Jax. Hands-on experience and publications in building applications and evaluations related to AI agents such as tool-use, text2SQL, browser agents, coding agents and GUI agents. Hands-on experience with agent frameworks such as OpenHands, Swarm, LangGraph, etc. Familiarity with agentic reasoning methods such as STaR and PLANSEARCH Experience working with cloud technology stack (eg. AWS or GCP) and developing machine learning models in a cloud environment. Our research interviews are crafted to assess candidates' skills in practical ML prototyping and debugging, their grasp of research concepts, and their alignment with our organizational culture. We will not ask any LeetCode-style questions. Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $302,400 — $378,000 USD PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.
1mo ago