Find your next LLM, RAG & AI Agent engineering role

164+ open roles · 10+ companies hiring

164 open positions

Senior LLM Engineer

Featured
O

OpenAI· San Francisco, CA

We are looking for a Senior LLM Engineer to fine-tune and deploy GPT-4 class models. You will work with PyTorch, Python, and Hugging Face to build training pipelines, and collaborate with the research team on prompt engineering and fine-tuning. Salary range: $180,000 - $260,000. This role is fully remote.

$180k - $260k

Remote Full-time OpenAIPythonPrompt Engineering +3 more

2d ago

RAG Engineer

Featured
A

Anthropic· Remote - US

Build retrieval augmented generation (RAG) systems powering Claude-based products. Experience with LangChain, LlamaIndex, Pinecone, and vector databases required. Strong Python and TypeScript skills a plus. $160,000 - $220,000. Work from anywhere.

$160k - $220k

Remote Full-time AnthropicLangChainLlamaIndex +6 more

3d ago

Generative AI Engineer

Featured
P

Perplexity· San Francisco, CA

Build next-generation search experiences powered by RAG and LLMs. Tech stack includes Python, TypeScript, OpenAI, Weaviate, and GraphQL APIs. $165,000 - $230,000. Hybrid - 3 days in office.

$165k - $230k

Full-time OpenAIWeaviatePython +3 more

6d ago

Founding AI Engineer

Featured
P

Perplexity· Remote

Join as a founding engineer to build our core AI agent and RAG infrastructure from the ground up. LangChain, LlamaIndex, OpenAI, Claude, and TypeScript experience preferred. Fully remote. $190,000 - $270,000 + significant equity.

$190k - $270k

Remote Full-time OpenAIAnthropicLangGraph +6 more

12d ago

Research Engineer, RL Engineering

A

Anthropic· San Francisco, CA | New York City, NY | Seattle, WA

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role: You want to build the cutting-edge systems that train AI models like Claude. You're excited to work at the frontier of machine learning, implementing and improving advanced techniques to create ever more capable, reliable and steerable AI. As an ML Systems Engineer on our Reinforcement Learning Engineering team, you'll be responsible for the critical algorithms and infrastructure that our researchers depend on to train models. Your work will directly enable breakthroughs in AI capabilities and safety. You'll focus obsessively on improving the performance, robustness, and usability of these systems so our research can progress as quickly as possible. You're energized by the challenge of supporting and empowering our research team in the mission to build beneficial AI systems. Our finetuning researchers train our production Claude models, and internal research models, using RLHF and other related methods. Your job will be to build, maintain, and improve the algorithms and systems that these researchers use to train models. You’ll be responsible for improving the speed, reliability, and ease-of-use of these systems. You may be a good fit if you: Have 4+ years of software engineering experience Like working on systems and tools that make other people more productive Are results-oriented, with a bias towards flexibility and impact Pick up slack, even if it goes outside your job description Enjoy pair programming (we love to pair!) Want to learn more about machine learning research Care about the societal impacts of your work Strong candidates may also have experience with: High performance, large scale distributed systems Large scale LLM training Python Implementing LLM finetuning algorithms, such as RLHF Representative projects: Profiling our reinforcement learning pipeline to find opportunities for improvement Building a system that regularly launches training jobs in a test environment so that we can quickly detect problems in the training pipeline Making changes to our finetuning systems so they work on new model architectures Building instrumentation to detect and eliminate Python GIL contention in our training code Diagnosing why training runs have started slowing down after some number of steps, and fixing it Implementing a stable, fast version of a new training algorithm proposed by a researcher Deadline to apply: None. Applications will be reviewed on a rolling basis. The annual compensation range for this role is listed below. For sales roles, the range provided is the role’s On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. Annual Salary: $500,000 — $850,000 USD Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.

12h ago

Systems Research Engineer Intern - GPU Programming (Fall 2026)

T

Together AI· San Francisco

About The Role As a Systems Research Engineer Intern specialized in GPU Programming, you will play a crucial role in developing and optimizing GPU-accelerated kernels and algorithms for ML/AI applications. Working closely with the modeling and algorithm team, you will co-design GPU kernels and model architecture to enhance the performance and efficiency of our AI systems. Collaborating with the hardware and software teams, you will contribute to the co-design of efficient GPU architectures and programming models, leveraging your expertise in GPU programming and parallel computing. Your research skills will be vital in staying up-to-date with the latest advancements in GPU programming techniques, ensuring that our AI infrastructure remains at the forefront of innovation. Responsibilities Optimize and fine-tune GPU code to achieve better performance and scalability Collaborate with cross-functional teams to integrate GPU-accelerated solutions into existing software systems Stay up-to-date with the latest advancements in GPU programming techniques and technologies Requirements Strong background in GPU programming and parallel computing, such as CUDA and/or Triton. Knowledge of ML/AI applications and models Knowledge of performance profiling and optimization tools for GPU programming Excellent problem-solving and analytical skills Internship Program Details Our fall internship program spans over 12 to 16 weeks where you’ll have the opportunity to work with industry-leading engineers building a cloud from the ground up and possibly contribute to influential open source projects. Our internship dates are September 14th to December 18th. About Together AI Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancements such as FlashAttention, Mamba, FlexGen, Petals, Mixture of Agents, and RedPajama. Compensation We offer competitive compensation, housing stipends, and other competitive benefits. The estimated US hourly rate for this role is $58 to $63. Our hourly rates are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge. Equal Opportunity Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our privacy policy at https://www.together.ai/privacy

15h ago

Systems Research Engineer, GPU Programming

T

Together AI· San Francisco

About the Role As a Systems Research Engineer specialized in GPU Programming, you will play a crucial role in developing and optimizing GPU-accelerated kernels and algorithms for ML/AI applications. Working closely with the modeling and algorithm team, you will co-design GPU kernels and model architecture to enhance the performance and efficiency of our AI systems. Collaborating with the hardware and software teams, you will contribute to the co-design of efficient GPU architectures and programming models, leveraging your expertise in GPU programming and parallel computing. Your research skills will be vital in staying up-to-date with the latest advancements in GPU programming techniques, ensuring that our AI infrastructure remains at the forefront of innovation. Requirements Strong background in GPU programming and parallel computing, such as CUDA and/or Triton. Knowledge of ML/AI applications and models Knowledge of performance profiling and optimization tools for GPU programming Excellent problem-solving and analytical skills Bachelor's, Master's, or Ph.D. degree in Computer Science, Electrical Engineering, or equivalent practical experiences Responsibilities Optimize and fine-tune GPU code to achieve better performance and scalability Collaborate with cross-functional teams to integrate GPU-accelerated solutions into existing software systems Stay up-to-date with the latest advancements in GPU programming techniques and technologies About Together AI Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure. Compensation We offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work. The US base salary range for this full-time position is: $160,000 - $230,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge. Equal Opportunity Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our privacy policy at https://www.together.ai/privacy

15h ago

Staff Machine Learning Engineer, Voice AI

T

Together AI· San Francisco

About the Role Together AI is building the best inference infrastructure for voice applications. Our Voice AI platform powers production-grade, real-time voice agents and applications — serving speech-to-text and text-to-speech models with best-in-class latency and reliability. We're looking for a Staff ML Engineer to drive the model serving layer for voice workloads. You'll work hands-on with inference engines like TRT-LLM and SGLang to optimize how we serve models like Whisper, Parakeet, Orpheus, and Kokoro — pushing latency and throughput to the frontier. You'll profile GPU utilization, design batching strategies for streaming audio, and ensure new model architectures can go from research to production quickly. This is a foundational hire on a small, high-impact team. Voice inference has unique challenges — streaming audio, tokenization, real-time latency budgets — that require dedicated ML engineering focus. You'll shape how Together serves voice models as the industry moves from pipeline architectures (ASR → LLM → TTS) toward end-to-end speech-to-speech. Own the model serving stack that powers Together's voice platform across STT, TTS, and speech-to-speech. Work directly with state-of-the-art accelerators (H100s, H200s, B200s) to optimize voice model inference. Collaborate with model partners (Cartesia, Deepgram, Rime, and others) to bring their models to production on Together's infrastructure. Build quality evaluation frameworks that guide model selection for customers and inform the roadmap. Join a small, early-stage team with outsized impact on a fast-growing product area. Responsibilities Own the voice inference roadmap end-to-end — define and execute the technical strategy for optimizing STT, TTS, and speech-to-speech models across Together's infrastructure, with a clear-eyed view of where the field is heading and how to position the platform ahead of it. Drive best-in-class inference performance — architect and implement systems targeting leading TTFB, throughput, and GPU utilization for voice workloads; set the performance bar others in the industry measure against, not just catch up to. Lead productionization of voice models at scale — design the serving architecture for serverless and dedicated endpoints, including batching strategies, streaming inference pipelines, and memory management tailored to real-time audio; own reliability and latency SLAs. Build the voice evaluation platform — design a rigorous, extensible evaluation framework covering WER across accents, languages, and noise conditions for STT; naturalness, latency, and pronunciation fidelity for TTS; establish the internal benchmark methodology that informs model selection and roadmap decisions. Shape the architecture for next-generation model support — anticipate and enable emerging model paradigms — audio-native LLMs, codec-based architectures (SNAC, Encodec), and end-to-end speech-to-speech systems — before they're mainstream, not after. Serve as the technical DRI for model partner integrations — lead deep collaboration with partners such as Cartesia, Deepgram, and Rime; own the full lifecycle from integration to optimization to ongoing performance accountability. Diagnose and resolve the hardest performance problems in the stack — conduct systematic profiling and root-cause analysis from GPU kernel behavior to framework-level bottlenecks; drive shipped improvements with documented, measurable impact. Influence platform architecture across the organization — partner with platform engineering leadership to ensure the serving layer is built for the latency and reliability demands of real-time voice APIs; your technical decisions should raise the ceiling for the whole team. Define and scale voice fine-tuning capabilities — lead the technical direction for enabling customers to fine-tune STT and TTS models on Together's infrastructure, establishing the primitives for differentiated voice experiences. Lay technical foundations for a category-defining product surface — architect systems with enough foresight that they support multiple new voice products with minimal rework; think in terms of platforms, not point solutions. Requirements 8+ years of ML engineering experience, with a demonstrated focus on model serving, inference optimization, or ML infrastructure at production scale — including systems you've owned from design through live traffic. Deep, practical expertise in LLM serving engines (vLLM, SGLang, TensorRT-LLM, or equivalent) — you've modified engine internals, debugged edge cases under load, and contributed improvements back; you don't stop at the API surface. Expert-level Python and PyTorch proficiency, with a strong command of GPU optimization — CUDA kernels, memory hierarchies, profiling toolchains — and a track record of turning that knowledge into shipped latency or throughput wins. Proven system design judgment — you've made architectural decisions that held up at scale and influenced how a team or platform evolved; you can articulate the tradeoffs you made and why. Strong technical leadership — you operate with high autonomy, define the right problems before solving them, and raise the bar for engineering quality around you without requiring process overhead. Sharp product intuition for developer tooling — you understand what voice application developers actually need to ship great products, and you let that shape your technical priorities, not just the other way around. Proven ability to move fast in ambiguous environments — you've thrived on early-stage or platform teams where scope is wide, ownership is deep, and the roadmap you build is the one you execute. Strong foundation in speech and audio ML (ASR/TTS architectures, audio signal processing) — directly relevant experience is strongly preferred; exceptional ML engineering fundamentals with genuine curiosity about the domain is also considered. Familiarity with audio codec and tokenization schemes (SNAC, Encodec, DAC) is a meaningful plus at this level. Experience training or fine-tuning speech models at scale is a significant advantage. Bachelor's or Master's in Computer Science, Electrical Engineering, or related field — or equivalent depth demonstrated through your work. About Together AI Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey in building the next generation AI infrastructure. Compensation We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $220,000 - $280,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge. Equal Opportunity Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our privacy policy at https://www.together.ai/privacy

15h ago

Staff Engineer, Distributed Storage and HPC & AI Infrastructure

T

Together AI· San Francisco

About the Role In this role, you will design and deliver multi-petabyte storage systems purpose-built for the world’s largest AI training and inference workloads. You’ll architect high-performance parallel filesystems and object stores, evaluate and integrate cutting-edge technologies such as WekaFS, Ceph, and Lustre, and drive aggressive cost optimization-routinely achieving 30-50% savings through intelligent tiering, lifecycle policies, capacity forecasting, and right-sizing. You will also build Kubernetes-native storage operators and self-service platforms that provide automated provisioning, strict multi-tenancy, performance isolation, and quota enforcement at cluster scale. Day-to-day, you’ll optimize end-to-end data paths for 10-50 GB/s per node, design multi-tier caching architectures, implement intelligent prefetching and model-weight distribution, and tune parallel filesystems for AI workloads. Responsibilities Design multi-petabyte AI/ML storage systems; integrate WekaFS, Ceph, etc.; lead capacity planning and cost optimization (30-50% savings via tiering, lifecycle policies, right-sizing). Design/optimize RDMA, InfiniBand, 400GbE networks; tune for max throughput/min latency; implement NVMe-oF/iSCSI; troubleshoot bottlenecks; optimize TCP/IP for storage. Build Kubernetes storage operators/controllers; enable automated provisioning, self-service abstractions, multi-tenant isolation, quotas; create reusable Helm/Terraform patterns. Deliver 10-50 GB/s per GPU node; optimize caching (weights/datasets/checkpoints), parallel filesystems, and data paths; troubleshoot with profiling tools; scale to thousands of nodes. Build multi-tier caches (local NVMe, distributed, object); optimize data locality and model-weight distribution; implement smart prefetching/eviction. Implement monitoring, alerting, SLOs; design DR/backups with runbooks; run chaos engineering; ensure 99.9%+ uptime via proactive/automated remediation. Partner with ML/SRE teams; mentor on storage best practices; contribute to open-source; write docs, postmortems, and public learnings. Requirements 8+ years in storage engineering with 3+ years managing distributed storage at multi-petabyte scale Proven track record deploying and operating high-performance storage for GPU/HPC clusters Deep Kubernetes and cloud-native storage experience in production environments Strong coding skills in Go and Python with demonstrated ability to build production-grade tools BS/MS in Computer Science, Engineering, or equivalent practical experience History of technical leadership: designing systems that significantly improved performance (>3x), reliability (99.9%+ uptime), or cost efficiency Distributed Storage Systems: Deep expertise in WekaFS, Lustre, GPFS, BeeGFS, or similar parallel filesystems at multi-petabyte scale Object Storage: Production experience with S3, MinIO, Ceph, or R2 including performance optimization and cost management Kubernetes Storage: CSI drivers, StatefulSets, PersistentVolumes, storage operators, and custom controllers Storage optimization for GPU workloads, RDMA/InfiniBand networking, parallel filesystem optimization (100+ GB/s aggregate cluster throughput) Programming: Go and Python for automation, operators, and tooling Infrastructure as Code: Terraform, Ansible, Helm, GitOps (ArgoCD) Linux Storage Stack: Advanced knowledge of filesystems (ext4, xfs), LVM, NVMe optimization, RAID configurations Observability: Prometheus, Grafana, Thanos architecture and operations Nice to Have Skills GPU Direct Storage (GDS), NVMe-oF, storage networking (100GbE/400GbE) ML/AI storage patterns (model weights, checkpointing, dataset caching) Kubernetes operator development (controller-runtime, kubebuilder) Storage snapshots, cloning, and thin provisioning Backup and disaster recovery (Velero, Restic, cross-region replication) Storage encryption (at-rest and in-transit), security and compliance Storage benchmarking and profiling tools (fio, iperf3, iostat, blktrace) About Together AI Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure. Compensation We offer competitive compensation, startup equity, health insurance, and other benefits, as well as flexibility in terms of remote work. The US base salary range for this full-time position is: $250,000 - $300,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge. Equal Opportunity Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our privacy policy at https://www.together.ai/privacy

15h ago

Senior Machine Learning Engineer, Voice AI

T

Together AI· San Francisco

About the Role Together AI is building the best inference infrastructure for voice applications. Our Voice AI platform powers production-grade, real-time voice agents and applications — serving speech-to-text and text-to-speech models with best-in-class latency and reliability. We're looking for a Senior ML Engineer to drive the model serving layer for voice workloads. You'll work hands-on with inference engines like TRT-LLM and SGLang to optimize how we serve models like Whisper, Parakeet, Orpheus, and Kokoro — pushing latency and throughput to the frontier. You'll profile GPU utilization, design batching strategies for streaming audio, and ensure new model architectures can go from research to production quickly. This is a foundational hire on a small, high-impact team. Voice inference has unique challenges — streaming audio, tokenization, real-time latency budgets — that require dedicated ML engineering focus. You'll shape how Together serves voice models as the industry moves from pipeline architectures (ASR → LLM → TTS) toward end-to-end speech-to-speech. Own the model serving stack that powers Together's voice platform across STT, TTS, and speech-to-speech. Work directly with state-of-the-art accelerators (H100s, H200s, B200s) to optimize voice model inference. Collaborate with model partners (Cartesia, Deepgram, Rime, and others) to bring their models to production on Together's infrastructure. Build quality evaluation frameworks that guide model selection for customers and inform the roadmap. Join a small, early-stage team with outsized impact on a fast-growing product area. Responsibilities Optimize inference performance for voice models (STT, TTS, speech-to-speech) — targeting best-in-class TTFB, throughput, and GPU utilization across our curated model set. Productionize voice models on serverless and dedicated endpoints, including batching strategies, streaming inference, and memory management tailored to audio workloads. Build and maintain a voice model evaluation framework — measuring WER across accents, languages, and noise conditions for STT; naturalness, latency, and pronunciation accuracy for TTS. Enable new model architectures in our serving stack as the field evolves, including audio-native LLMs, codec-based models (SNAC), and speech-to-speech systems. Collaborate with model partners to integrate and optimize their models (Cartesia, Deepgram, Rime, and others) running on Together's infrastructure. Profile and debug performance across the full inference stack — from GPU kernels to framework-level bottlenecks — and ship measurable improvements. Work with the platform engineering side of the team to ensure the serving layer meets the latency and reliability requirements of real-time voice APIs. Contribute to voice model fine-tuning capabilities (STT and TTS) as we enable customers to build differentiated voice experiences on Together. Lay the groundwork for multiple new products down the line. Requirements 5+ years of experience in ML engineering, with a focus on model serving, inference optimization, or ML infrastructure. Hands-on experience with LLM serving engines (vLLM, SGLang, TensorRT-LLM, or similar) — comfortable reading and modifying engine internals, not just using APIs. Strong proficiency in Python and PyTorch; experience with GPU profiling and optimization (CUDA, memory management, kernel-level debugging). Track record of shipping ML systems to production with measurable performance improvements. Strong product sense — you think about what developers building voice apps actually need, not just what's technically interesting. Comfort working on a small, early-stage team where you'll wear multiple hats and move fast. Experience with speech and audio ML (ASR, TTS architectures, audio signal processing) is a strong plus but not required — you can learn this quickly if you have strong ML engineering fundamentals. Familiarity with audio codecs and tokenization schemes (SNAC, Encodec, DAC) is a plus. Experience training or fine-tuning speech models is a plus. Bachelor's or Master's degree in Computer Science, Electrical Engineering, or related field, or equivalent practical experience About Together AI Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey in building the next generation AI infrastructure. Compensation We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $200,000 - $260,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge. Equal Opportunity Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our privacy policy at https://www.together.ai/privacy

15h ago

Research Engineer, Frontier Speculative Decoding

T

Together AI· San Francisco, New York City

About the Role Together AI is building the Inference Platform that powers the world's most advanced generative AI models. Your role will be a critical bridge between cutting-edge research and real-world applications, focusing on making translating our internal model training research to production-ready deployment for our customers. This involves a deep commitment to data-centric development, meticulous hyperparameter tuning, and rigorous checkpoint evaluation before models ever hit production. This role will involve understanding customer specific needs and fine-tuning models on our internal data recipe and their proprietary data. The goal is to transform general-purpose models into highly performant, specialized tools that solve real business problems. You will not be training foundation models from scratch but rather focusing on creating highly efficient, specialized models by working with dedicated GPU clusters. Responsibilities Design and iterate on novel speculator algorithms, combining architectural innovations with carefully curated data to push the frontier of accuracy–efficiency tradeoffs. Be the critical link between raw data and a production-ready model, seeing your work directly impact our customers' success. Work in a fast-paced, high-impact role at the cutting edge of generative AI. Collaborate with a team of experts dedicated to solving real-world, high-performance challenges. You'll collaborate directly with customers to understand their needs, and work closely with our core inference and Applied ML research teams to integrate your work into the production platform. A culture of deep technical ownership where you are empowered to take on and solve challenging problems Requirements A genuine love for data curation and processing, with a meticulous attention to detail. You believe that great models start with great data. Demonstrated ability to perform effective hyperparameter searches and understand the trade-offs involved in tuning models for specific tasks. Experience working with and building on top of existing training codebases. You are comfortable navigating complex code and contributing to its improvement. Strong attention-to-detail in evaluating model checkpoints to ensure they meet strict quality, performance, and reliability standards. Experience with Python and PyTorch. Familiarity with SLURM and/or Kubernetes clusters and experience submitting and managing jobs in a high-performance computing environment. Familiarity with modern LLMs and generative models. Basic understanding of distributed training frameworks (e.g., FSDP, DeepSpeed). Bachelor’s, Master’s degree, or Ph.D. in Computer Science, Computer Engineering, or a related field, or equivalent practical experience. About Together AI Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, ATLAS, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey in building the next generation AI infrastructure. Compensation We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $190,000 - $270,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge. Equal Opportunity Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our privacy policy at https://www.together.ai/privacy

15h ago

Research Engineer, Core ML

T

Together AI· San Francisco

About the Role This is a research engineering role with direct production impact. You won’t be publishing ideas in isolation—you will translate new RL algorithms, scheduling methods, and inference optimizations into production-grade systems that power Together’s API. Success in this role means shipping measurable improvements in latency, throughput, cost, and model quality at scale. We are looking for researchers who enjoy owning systems end-to-end and turning frontier ideas into robust infrastructure. The Core ML (Turbo) at Together AI team sits at the intersection of efficient inference (algorithms, architectures, engines) and post‑training / RL systems. We build and operate the systems behind Together’s API, including high‑performance inference and RL/post‑training engines that can run at production scale. Our mandate is to push the frontier of efficient inference and RL‑driven training: making models dramatically faster and cheaper to run, while improving their capabilities through RL‑based post‑training (e.g., GRPO‑style objectives). This work lives at the interface of algorithms and systems: asynchronous RL, rollout collection, scheduling, and batching all interact with engine design, creating many knobs to tune across the RL algorithm, training loop, and inference stack. Much of the job is modifying production inference systems—for example, SGLang‑ or vLLM‑style serving stacks and speculative decoding systems such as ATLAS—grounded in a strong understanding of post‑training and inference theory, rather than purely theoretical algorithm design. You’ll work across the stack—from RL algorithms and training engines to kernels and serving systems—to build and improve frontier models via RL pipelines. People on this team are often spiky: some are more RL‑first, some are more systems‑first. Depth in one of these areas plus appetite to collaborate across (and grow toward more full‑stack ownership over time) is ideal. Responsibilities Advance inference efficiency end‑to‑end Design and prototype algorithms, architectures, and scheduling strategies for low‑latency, high‑throughput inference. Implement and maintain changes in high‑performance inference engines (e.g., SGLang‑ or vLLM‑style systems and Together’s inference stack), including kernel backends, speculative decoding (e.g., ATLAS), quantization, etc. Profile and optimize performance across GPU, networking, and memory layers to improve latency, throughput, and cost. Unify inference with RL / post‑training Design and operate RL and post‑training pipelines (e.g., RLHF, RLAIF, GRPO, DPO‑style methods, reward modeling) where 90+% of the cost is inference, jointly optimizing algorithms and systems. Make RL and post‑training workloads more efficient with inference‑aware training loops—for example, async RL rollouts, speculative decoding, and other techniques that make large‑scale rollout collection and evaluation cheaper. Use these pipelines to train, evaluate, and iterate on frontier models on top of our inference stack. Co‑design algorithms and infrastructure so that objectives, rollout collection, and evaluation are tightly coupled to efficient inference, and quickly identify bottlenecks across the training engine, inference engine, data pipeline, and user‑facing layers. Run ablations and scale‑up experiments to understand trade‑offs between model quality, latency, throughput, and cost, and feed these insights back into model, RL, and system design. Own critical systems at production scale Profile, debug, and optimize inference and post-training services under real production workloads, taking research ideas all the way to stable, measurable improvements in deployed systems. Drive roadmap items that require real engine modification—changing kernels, memory layouts, scheduling logic, and APIs as needed. Establish metrics, benchmarks, and experimentation frameworks to validate improvements rigorously. Provide technical leadership (Staff level) Set technical direction for cross‑team efforts at the intersection of inference, RL, and post‑training. Mentor other engineers and researchers on full‑stack ML systems work and performance engineering. Requirements We don’t expect anyone to check every box below. People on this team typically have deep expertise in one or more areas and enough breadth (or interest) to work effectively across the stack. The closer you are to full‑stack (inference + post‑training/RL + systems), the stronger the fit—but being spiky in one area and eager to grow is absolutely okay. You might be a good fit if you: Have a bias toward implementation and shipping —you are excited to modify real engines and services, not just prototype in research code. Have strong expertise in at least one of the following, and are excited to collaborate across (and grow into) the others: Systems‑first profile: Large‑scale inference systems (e.g., SGLang, vLLM, FasterTransformer, TensorRT, custom engines, or similar), GPU performance, distributed serving. RL‑first profile: RL / post‑training for LLMs or large models (e.g., GRPO, RLHF/RLAIF, DPO‑like methods, reward modeling), and using these to train or fine‑tune real models. Model architecture design for Transformers or other large neural nets. Distributed systems / high‑performance computing for ML. Are comfortable working from algorithms to engines: Strong coding ability in Python Experience profiling and optimizing performance across GPU, networking, and memory layers. Able to take a new sampling method, scheduler, or RL update and turn it into a production‑grade implementation in the engine and/or training stack. Have a solid research foundation in your area(s) of depth: Track record of impactful work in ML systems, RL, or large‑scale model training (papers, open‑source projects, or production systems). Can read new RL / post‑training papers, understand their implications on the stack, and design minimal, correct changes in the right layer (training engine vs. inference engine vs. data / API). Operate well as a full‑stack problem solver: You naturally ask: “Where in the stack is this really bottlenecked?” You enjoy collaborating with infra, research, and product teams, and you care about both scientific quality and user‑visible wins. Minimum qualifications 3+ years of experience working on ML systems, large‑scale model training, inference, or adjacent areas (or equivalent experience via research / open source). Advanced degree in Computer Science, EE, or a related field, or equivalent practical experience. Demonstrated experience owning complex technical projects end‑to‑end. If you’re excited about the role and strong in some of these areas, we encourage you to apply even if you don’t meet every single requirement. About Together AI Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure. Compensation We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $200,000 - $280,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge. Equal Opportunity Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our privacy policy at https://www.together.ai/privacy

15h ago

Machine Learning, Platform Engineer

T

Together AI· San Francisco

About the Role Our team focuses on enabling custom models and dedicated inference on Together. We are responsible for building a container platform, optimizing autoscaling, minimizing cold starts, achieving the best end-to-end model performance, and providing a best-in-class developer experience with great tooling. We often focus on video or audio generation across the stack: CUDA kernels, pytorch optimization, inference engines, container orchestration, queueing theory, etc. An ideal candidate will be great at profiling/optimization but know the word kubernetes, or be intimately familiar with multi-cluster scheduling and have some sense of ML bottlenecks. Responsibilities New hires may work on multi-cluster orchestration, portfolio optimization, predictive autoscaling, control panes, model bring-up, model optimization, APIs for managing deployments, inference worker SDKs, and CLI tools. Analyze and improve the robustness and scalability of existing distributed systems, APIs, databases, and infrastructure Partner with product teams to understand functional requirements and deliver solutions that meet business needs Write clear, well-tested, and maintainable software and IaC for both new and existing systems Conduct design and code reviews, create developer documentation, and develop testing strategies for robustness and fault tolerance Requirements 5+ years of demonstrated experience in building large scale, fault tolerant, distributed systems. Experience running serverless inference platforms, doing model bring-up on short notice, being on call, or running a cloud provider is a very big plus Good taste and ability to thoughtfully discuss how what you’ve built has failed over time Experience designing, analyzing and improving efficiency, scalability, and stability of various system resources Excellent understanding of low level operating systems concepts including concurrency, networking and storage, performance and scale Expert-level programmer in one or more of Python, Golang, Rust, C++, or Haskell Proficiency in writing and maintaining Infrastructure as Code (IaC) using tools like Terraform Experience with Kubernetes internals or other container orchestration systems Sound judgement for when to use and when to not use LLMs for code Bachelor’s or Master’s degree in Computer Science, Computer Engineering, or a related technical field, or equivalent practical experience Writing-heavy roles or companies are a plus About Together AI Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey in building the next generation AI infrastructure. Compensation We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $160,000 - $250,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge. Equal Opportunity Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our privacy policy at https://www.together.ai/privacy

15h ago

Machine Learning Engineer - Inference

T

Together AI· San Francisco

About the Role Together AI is seeking a Machine Learning Engineer to join our Inference Engine team, focusing on optimizing and enhancing the performance of our AI inference systems. This role involves working with state-of-the-art large language models models and ensuring they run efficiently and effectively at scale. If you are passionate about AI inference, PyTorch, and developing high-performance systems, we want to hear from you. This position offers the chance to collaborate closely with AI researchers and engineers to create cutting-edge AI solutions. Join us in shaping the future at Together AI! Responsibilities Design and build the production systems that power the Together AI inference engine, enabling reliability and performance at scale. Develop and optimize runtime inference services for large-scale AI applications. Collaborate with researchers, engineers, product managers, and designers to bring new features and research capabilities to the world. Conduct design and code reviews to ensure high standards of quality. Create services, tools, and developer documentation to support the inference engine. Implement robust and fault-tolerant systems for data ingestion and processing. Requirements 3+ years of experience writing high-performance, well-tested, production-quality code. Proficiency with Python and PyTorch. Demonstrated experience in building high performance libraries and tooling. Excellent understanding of low-level operating systems concepts including multi-threading, memory management, networking, storage, performance, and scale. Preferred: Knowledge of existing AI inference systems such as TGI, vLLM, TensorRT-LLM, Optimum Preferred: Knowledge of AI inference techniques such as speculative decoding. Preferred: Knowledge of CUDA/Triton programming. Nice to have: Knowledge of Rust, Cython and compilers. About Together AI Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society. Together, we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI. Our team has been behind technological advancements such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey to build the next-generation AI infrastructure. Compensation We offer competitive compensation, startup equity, health insurance, and other competitive benefits. The US base salary range for this full-time position is $160,000 - $230,000 + equity + benefits. Our salary ranges are determined by location, level, and role. Individual compensation will be determined by experience, skills, and job-related knowledge. Equal Opportunity Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunities to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our privacy policy at https://www.together.ai/privacy

15h ago

Machine Learning Engineer

T

Together AI· San Francisco

About the Role Together AI is looking for an ML Engineer who will develop systems and APIs that enable our customers to perform inference and fine tune LLMs. Relevant experience includes implementing runtime systems that perform inference at scale using AI/ML models from simple models up to the largest LLMs. Requirements 5+ years experience writing high-performance, well-tested, production quality code Bachelor’s degree in computer science or equivalent industry experience Familiar with LLM inference ecosystem, including frameworks and engines (e.g. vLLM, SGLang, TRT, ...) Demonstrated experience in building large scale, fault tolerant, distributed systems like storage, search, and computation Expert level programmer in one or more of Python, Go, Rust, or C/C++ Experience implementing runtime inference services at scale or similar Responsibilities Design and build the production systems that power the Together Cloud inference and fine-tuning APIs, enabling reliability and performance at scale Partner with researchers, engineers, product managers, and designers to bring new features and research capabilities to the world Analyze and improve efficiency, scalability, and stability of various system resources Conduct design and code reviews Create services, tools & developer documentation Create testing frameworks for robustness and fault-tolerance Participate in an on-call rotation to respond to critical incidents as needed About Together AI Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey in building the next generation AI infrastructure. Compensation We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $160,000 - $220,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge. Equal Opportunity Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our privacy policy at https://www.together.ai/privacy

15h ago

LLM Inference Frameworks and Optimization Engineer

T

Together AI· San Francisco, Singapore, Amsterdam

About the Role At Together.ai, we are building state-of-the-art infrastructure to enable efficient and scalable inference for large language models (LLMs). Our mission is to optimize inference frameworks, algorithms, and infrastructure, pushing the boundaries of performance, scalability, and cost-efficiency. We are seeking an Inference Frameworks and Optimization Engineer to design, develop, and optimize distributed inference engines that support multimodal and language models at scale. This role will focus on low-latency, high-throughput inference, GPU/accelerator optimizations, and software-hardware co-design, ensuring efficient large-scale deployment of LLMs and vision models. This role offers a unique opportunity to shape the future of LLM inference infrastructure, ensuring scalable, high-performance AI deployment across a diverse range of applications. If you're passionate about pushing the boundaries of AI inference, we’d love to hear from you! Responsibilities Inference Framework Development and Optimization Design and develop fault-tolerant, high-concurrency distributed inference engine for text, image, and multimodal generation models. Implement and optimize distributed inference strategies, including Mixture of Experts (MoE) parallelism, tensor parallelism, pipeline parallelism for high-performance serving. Apply CUDA graph optimizations, TensorRT/TRT-LLM graph optimizations, and PyTorch-based compilation (torch.compile), and speculative decoding to enhance efficiency and scalability. Software-Hardware Co-Design and AI Infrastructure Collaborate with hardware teams on performance bottleneck analysis, co-optimize inference performance for GPUs, TPUs, or custom accelerators. Work closely with AI researchers and infrastructure engineers to develop efficient model execution plans and optimize E2E model serving pipelines. Requirements Must-Have: Experience: 3+ years of experience in deep learning inference frameworks, distributed systems, or high-performance computing. Technical Skills: Familiar with at least one LLM inference frameworks (e.g., TensorRT-LLM, vLLM, SGLang, TGI(Text Generation Inference)). Background knowledge and experience in at least one of the following: GPU programming (CUDA/Triton/TensorRT), compiler, model quantization, and GPU cluster scheduling. Deep understanding of KV cache systems like Mooncake , PagedAttention , or custom in-house variants. Programming: Proficient in Python and C++/CUDA for high-performance deep learning inference. Optimization Techniques: Deep understanding of Transformer architectures and LLM/VLM/Diffusion model optimization. Knowledge of inference optimization, such as workload scheduling, CUDA graph, compiled, efficient kernels Soft Skills: Strong analytical problem-solving skills with a performance-driven mindset. Excellent collaboration and communication skills across teams. Nice-to-Have: Experience in developing software systems for large-scale data center networks with RDMA/RoCE Familiar with distributed filesystem(e.g., 3FS, HDFS, Ceph) Familiar with open source distributed scheduling/orchestration frameworks, such as Kubernetes (K8S) Contributions to open-source deep learning inference projects. About Together AI Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers in our journey in building the next generation AI infrastructure. Compensation We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $160,000 - $230,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge. Equal Opportunity Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our privacy policy at https://www.together.ai/privacy

15h ago

AI infrastructure Engineer (SRE) Amsterdam

T

Together AI· Amsterdam

As a AI Infrastructure Engineer (SRE) at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a software engineer that applies sound engineering principles, operational discipline, and mature automation to our operating environments and codebase. You specialize in systems (operating systems, storage subsystems, networking), while implementing best practices for availability, reliability and scalability, with varied interests in algorithms and distributed systems. Requirements 7+ years of professional SRE or related experience Bachelor's degree in Computer Science or a related field or equivalent work experience Expert knowledge of Ansible (roles, playbooks), Terraform, and Kubernetes Proficiency in programming/scripting languages Direct experience in monitoring and observability practices Advanced knowledge of cloud services Ability to thrive in a collaborative environment involving different stakeholders and subject matter experts Responsibilities Be on an on-call (PagerDuty) rotation to respond to incidents that impact availability Build and run our infrastructure with Ansible, Terraform, and Kubernetes to enable scaling to a massive number of concurrent users Build monitoring systems to ensure the highest quality service for our customers Design and implement operational processes (such as deployments and upgrades) Debug production issues across all services and levels of the stack Identify improvements for the product architecture from the reliability, performance and availability perspectives Plan the growth of Together AI’s infrastructure About Together AI Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey in building the next generation AI infrastructure. Equal Opportunity Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more. Please see our privacy policy at https://www.together.ai/privacy

15h ago

AI Infrastructure Engineer

T

Together AI· San Francisco

As an AI Infrastructure Engineer at Together, you are responsible for keeping all user-facing services and production systems running smoothly. You are a blend of a pragmatic operator and a software engineer that applies sound engineering principles, operational discipline, and mature automation to our operating environments and codebase. You specialize in systems (operating systems, storage subsystems, networking), while implementing best practices for availability, reliability and scalability, with varied interests in algorithms and distributed systems. Responsibilities Participate in on-call rotation (Pagerduty) to respond to production incidents Build and run our infrastructure with Ansible, Terraform, and Kubernetes to enable scaling to a massive number of concurrent users Build monitoring systems to ensure the highest quality service for our customers Design and implement operational processes (such as deployments and upgrades) Debug production issues across all services and levels of the stack Identify improvements for the product architecture from the reliability, performance and availability perspectives Plan the growth of Together AI's infrastructure Requirements 5+ years of professional AI Infra or related experience Bachelor's degree in Computer Science or a related field or equivalent work experience Knowledge of Ansible (roles, playbooks), Terraform, and Kubernetes Proficiency in programming/scripting languages Direct experience in monitoring and observability practices Knowledge of cloud services Ability to thrive in a collaborative environment involving different stakeholders and subject matter experts About Together AI Together AI is a research-driven artificial intelligence company. We believe open and transparent AI systems will drive innovation and create the best outcomes for society, and together we are on a mission to significantly lower the cost of modern AI systems by co-designing software, hardware, algorithms, and models. We have contributed to leading open-source research, models, and datasets to advance the frontier of AI, and our team has been behind technological advancement such as FlashAttention, Hyena, FlexGen, and RedPajama. We invite you to join a passionate group of researchers and engineers in our journey in building the next generation AI infrastructure. Compensation We offer competitive compensation, startup equity, health insurance and other competitive benefits. The US base salary range for this full-time position is: $190,000 - $270,000 + equity + benefits. Our salary ranges are determined by location, level and role. Individual compensation will be determined by experience, skills, and job-related knowledge. Equal Opportunity Together AI is an Equal Opportunity Employer and is proud to offer equal employment opportunity to everyone regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, veteran status, and more.

15h ago

Strategic Finance Manager, Gen AI

S

Scale AI· San Francisco, CA

We are building out the Finance team to help make data-driven and financially sound decisions for Scale. The Finance team drives strategic, financial, and operational decisions by partnering with the leadership team to make critical decisions across Scale. We’re looking for a high-performing, all-rounded finance athlete to join our team and support the rapidly growing Generative AI (GenAI) business. You’ll collaborate closely with Product, Operations, Growth, and Go-to-Market leaders to bring financial rigor to decision-making, develop actionable insights that drive strategy, and build scalable systems as the business expands. This role is ideal for someone with 4-6 years of experience in a fast-paced, high-growth environment. Someone who thrives in ambiguity, can juggle multiple workstreams, and brings a mix of analytical rigor, business acumen, and strong execution. You will: Own and evolve part of the GenAI financial forecasting model, driving accuracy and insight across planning cycles Support reporting and performance management, including weekly and monthly reviews, consolidations, and ad hoc analyses Partner with GenAI leadership and cross functional teams to evaluate and execute key strategic and operational initiatives that scale the business multifold Conduct financial analyses and build business cases for new products, partnerships, and investments Collaborate with Accounting, and Corporate Finance to improve close, reporting, and planning cadences Continuously improve financial processes and systems to enhance scalability, forecast precision, and data visibility Ideally, you'd have: 4–6 years of experience in Strategic Finance, FP&A, or Business Operations, ideally within a high-growth technology company 2 years of investment banking experience at a top-tier firm Strong analytical and financial modeling skills; ability to translate complex data into actionable insights Excellent communication skills, with the ability to distill complexity into clear narratives for non-finance stakeholders Advanced proficiency in Excel, Google Sheets, and PowerPoint; strong command of financial modeling best practices Experience with SQL or Business Intelligence tools (e.g., Looker, Tableau) Familiarity with Anaplan, Adaptive Insights, or other planning systems Nice to haves: Bachelor’s degree in Finance, Accounting, Economics, Engineering, or a related field Prior experience supporting Product, Engineering, Growth, or Operations teams within a technology company Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. The base salary range for this full-time position in the location of San Francisco is: $176,400 — $220,500 USD PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.

17h ago

Applied AI Claude Evangelist, Startups

A

Anthropic· San Francisco, CA

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role: You'll be the face of Anthropic in the startup ecosystem — driving awareness and activation among founders and their technical teams. Your focus is lighting up the startup developer community and getting builders excited about, onboarded onto, and actively building on top of the Claude Developer Platform. This role sits at the intersection of startup ecosystem development and hands-on technical activation. Your primary mandate is driving adoption across the startup funnel — turning ecosystem touchpoints into active, building developers on Anthropic's platform. You'll work closely with the Startups GTM team to convert ecosystem relationships — with VCs, accelerators, and founder communities — into active, building developers. Responsibilities: Drive Net New Logo Acquisition Through Developer Enablement Lead hands-on developer onboarding experiences at ecosystem events that convert startup founders and their engineers from first API interaction to committed platform adoption Build scalable enablement programs — hands-on workshops, build-a-thons, and technical office hours — that activate developers at VC and accelerator partner events Develop a playbook for turning ecosystem touchpoints (builder summits, founder days, VC partnerships) into measurable developer sign-ups and active usage Partner closely with the Startups GTM team to create seamless handoffs from relationship → activation → retention Create Compelling Technical Content Create high-quality technical content — tutorials, demo apps, and blog posts — that showcase how to build real products on Anthropic's API, with a focus on common startup use cases Develop code demos and working prototypes that showcase Claude's capabilities in ways that resonate with technical founders, particularly at ecosystem events and webinars Partner with the Product team to stay current on platform capabilities and ensure content reflects the latest developer tooling and best practices Lead Developer Programming at Startup Events Own developer-facing programming at Anthropic's builder summits and global startup activations Design and run hands-on technical sessions that move developers from curiosity to active building within a single event Build relationships with key technical communities — developer platforms, open-source contributors, and startup accelerator cohorts — to grow Anthropic's developer mindshare Represent Anthropic as a trusted technical resource in the startup ecosystem Partner Across GTM and Ecosystem Surface market signal and developer sentiment from the startup ecosystem back to internal teams Partner with GTM, Sales, and Marketing to create developer-focused activation campaigns and messaging for the startup ecosystem Work with the Startups Partnerships team to ensure VC and accelerator relationships translate into real developer engagement and usage metrics Collaborate closely with the Product team to stay aligned on platform strategy, content, and developer programs Define Success and Scale Define and track success metrics — tied to net new logos, developer activation, and ecosystem engagement — and build reporting that connects evangelist activity to business outcomes Create scalable processes and grow the team as the startup ecosystem expands You may be a good fit if you have: 7+ years of experience across a combination of founding/building startups and developer-facing roles (developer relations, evangelism, or ecosystem development) Experience as a technical founder, early startup employee, or operator who has lived the 0-to-1 journey and knows what it takes to go from idea to product-market fit The ability to write production-quality code, build compelling demos, and credibly engage with technical co-founders and their engineering teams Strong public speaking skills with comfort on stage, the ability to command a room at builder summits and ecosystem events, and deliver technical content that energizes an audience Experience building or scaling developer programs, communities, or enablement motions that drove measurable adoption Builder credibility that earns trust with founders, VCs, and accelerator communities. You've shipped products and can speak from experience Willingness to travel regularly and flexibility to support events that fall outside standard business hours, including evenings and weekends Deep enthusiasm for AI with hands-on experience building with LLMs. You stay close to the technology and genuinely care about building it responsibly The annual compensation range for this role is listed below. For sales roles, the range provided is the role’s On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. Annual Salary: $275,000 — $380,000 USD Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.

19h ago

People Research Scientist, Recruiting

A

Anthropic· New York City, NY | Seattle, WA; San Francisco, CA | New York City, NY

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role We are seeking a Recruiting Research Scientist to join our People Data Solutions team. You’ll be the research expert supporting our Recruiting organization, using rigorous scientific methods to advance our understanding of recruiting funnels, interview effectiveness, candidate experience, and recruiting capacity. This role sits at the intersection of organizational science, behavioral research, and people strategy – developing novel frameworks and conducting systematic research that drives evidence-based people decisions across our growing organization. This role offers the opportunity to make a significant impact on both our recruiting practices and the broader field of people science at a leading AI safety company. Responsibilities Research design & scientific inquiry Design and execute systematic research studies to answer fundamental questions about recruiting funnel health, assessment quality, candidate experience, and quality of hire Generate and test hypotheses about sourcing strategies, interview design, and selection decisions using rigorous experimental and quasi-experimental methods Conduct mixed-method research to understand what are the drivers and blockers to recruiting operations. Navigate research ethics considerations when studying candidate data, ensuring responsible research practices Selection & assessment research Design and execute validation studies to assess the quality of interviews and other selection tools Utilize psychometric techniques to analyze and improve interviewer calibration and rating consistency Lead investigative research into innovative approaches for candidate assessment Metrics design and governance Design the metrics framework for recruiting org health — defining the canonical KPIs, dimensions, and definitions that leadership uses to understand funnel performance, capacity, and hiring quality Establish the governance and definitional rigor that keeps metrics consistent across tools and reporting surfaces Analytical solution building Architect analytical solutions that convert research insights into actionable products, empowering stakeholders to execute data-driven scenario and strategic planning Quantify the adoption and downstream impact of deployed tools, driving iterative improvements Visualization & communication Build compelling visualizations and dashboards that make complex research findings accessible to diverse audiences Present research findings to senior leadership with clear, actionable recommendations Minimum Qualifications: Hold an advanced degree (Master’s or PhD) in I/O Psychology, Organizational Behavior, Statistics, Data Science, Economics, Behavioral Science, or a related research field Have experience with selection research, assessment validation, psychometrics, or recruiting funnel analytics Are comfortable working in the People Analytics tech stack and collaborating with data engineers Are proficient in SQL and Python/R, with experience in statistical analysis and machine learning Have experience with data visualization and can tell compelling stories with research findings Possess excellent communication skills and can influence stakeholders at all levels Thrive in ambiguity and can balance rigor with pragmatism Have a track record of challenging assumptions with data and changing long-held practices Can navigate sensitive topics diplomatically while maintaining analytical rigor Demonstrate intellectual humility and comfort with iterative discovery Use data to improve how organizations find, assess, and hire talent Preferred Qualifications: 5 + years of experience in research, people analytics, or related quantitative fields with demonstrated research methodology expertise Background in recruiting analytics specifically (not just general analytics) Experience running interview or assessment validation studies Experience building self-service analytics tools or dashboards Previous experience in high-growth technology companies or AI/ML organizations Familiarity with network analysis, machine learning, or advanced statistical methods Experience with BigQuery and modern data stack tools Experience with Greenhouse, Gem, ModernLoop, or similar recruiting tools The annual compensation range for this role is listed below. For sales roles, the range provided is the role’s On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. Annual Salary: $275,000 — $370,000 USD Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.

20h ago

People Research Scientist, People

A

Anthropic· New York City, NY | Seattle, WA; San Francisco, CA | New York City, NY

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the Role: We are seeking a People Research Scientist to join our People Data Solutions team. You’ll be the research expert supporting our broader People organization, using rigorous scientific methods to advance our understanding of the employee experience, manager effectiveness, organizational health, and workforce dynamics. This role sits at the intersection of organizational science, behavioral research, and people strategy – developing novel frameworks and conducting systematic research that drives evidence-based people decisions across our growing organization. This role offers the opportunity to make a significant impact on both our people practices and the broader field of people science at a leading AI safety company. Responsibilities: Research Design & Scientific Inquiry Design and execute systematic research studies to answer fundamental questions about employee experience, manager effectiveness, and organizational health Generate and test hypotheses about people programs, employee behavior, and workforce outcomes using rigorous experimental and quasi-experimental methods Conduct longitudinal studies tracking employee cohorts to understand long-term workforce trends and the impact of people initiatives Perform meta-analyses of people interventions across the industry to identify best practices and knowledge gaps Navigate research ethics considerations when studying employee data, ensuring responsible research practices Employee listening & survey research Design, analyze, and iterate on employee listening programs including engagement surveys, pulse surveys, and lifecycle surveys Apply psychometric methods to validate survey instruments and ensure measurement reliability Translate survey findings into strategic recommendations that drive meaningful organizational change Manager research & organizational effectiveness Conduct research on manager behaviors, competencies, and their impact on team outcomes Build measurement frameworks to evaluate and improve manager effectiveness programs Study organizational dynamics including team composition, collaboration patterns, and their relationship to performance outcomes Visualization & communication Build compelling visualizations and dashboards that make complex research findings accessible to diverse audiences Present research findings to senior leadership with clear, actionable recommendations Develop self-service analytics capabilities that empower People team partners Minimum Qualifications: Hold an advanced degree (Master’s or PhD) in I/O Psychology, Organizational Behavior, Statistics, Data Science, Economics, Behavioral Science, or a related research field Have experience with experimental design, hypothesis testing, longitudinal research methods, and causal inference Are comfortable working in the People Analytics tech stack and collaborating with data engineers Are proficient in SQL and Python/R, with experience in statistical analysis and machine learning Have experience with survey design, psychometric methods, and employee listening programs Have experience with data visualization and can tell compelling stories with research findings Possess excellent communication skills and can influence stakeholders at all levels Thrive in ambiguity and can balance rigor with pragmatism Have a track record of challenging assumptions with data and changing long-held practices Can navigate sensitive topics diplomatically while maintaining analytical rigor Demonstrate intellectual humility and comfort with iterative discovery Use data to improve how organizations develop and support their people Preferred Qualifications: 5+ years of experience in research, people analytics, or related quantitative fields with demonstrated research methodology expertise Background in people analytics specifically (not just general analytics) Experience designing and analyzing employee engagement or pulse surveys Deep knowledge of manager effectiveness research and organizational science Experience building self-service analytics tools or dashboards Understanding of employee lifecycle metrics and people KPIs Previous experience in high-growth technology companies or AI/ML organizations Familiarity with network analysis, NLP, or advanced statistical methods Experience with BigQuery and modern data stack tools Experience with Qualtrics Experience with Workday The annual compensation range for this role is listed below. For sales roles, the range provided is the role’s On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. Annual Salary: $275,000 — $370,000 USD Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.

20h ago

Operations Program Manager (Computer Vision), Public Sector

S

Scale AI· St. Louis, MO; Washington, DC

Scale's Public Sector customer base is growing rapidly, and you will be on the front lines of machine learning (ML) operations to accelerate the development of artificial intelligence (AI) applications for the Department of Defense and other national security customers. As a Public Sector Operations Program Manager, you will act as the program manager for multiple projects on the Public Sector Computer Vision team, working cross-functionally with our Delivery and Engineering teams, and collaborating with a team of high performing operations managers and subject matter experts to drive operational efficiencies. You will have team and project-level ownership of the operations of our data labeling system and are responsible for delivering data on time and at a high level of quality across diverse data modalities and levels of clearance. All of this serves an ultimate goal of delivering outsize value in supporting our Public Sector customer’s AI / ML objectives.You will also help lead how we develop and communicate our operational improvements for the customer. You will: Design and lead cross-functional initiatives with engineering, product, analytics, and delivery teams (as well as relevant geospatial experts) to achieve project outcomes Innovate on complex computer vision workflows and evaluation of data quality Communicate and own the execution of multiple data annotation workflows across diverse data types (geospatial imagery, video, etc) Optimize data annotation workflows with systems analysis to improve data quality and throughput with a goal achieving optimized machine learning outcomes Outline and deliver on data-driven operational efficiencies on these workflows to achieve business outcomes Partner with an engagement managers to communicate with and shape outcomes for Scale’s Public Sector customers You have: 5+ years of experience in a general manager / program manager position or a similar leadership role in operations The ability to regularly work onsite in both unclassified and secure facilities in either downtown Saint Louis, MO or the Washington D.C. area An analytical, automation-focused mindset, able to analyze provable and traceable data to develop robust, flexible integrations and workflows with metrics-led results Experience working in a fast-paced environment Low ego and a proven pattern of sharing wins with your team A history of successful project management and comfort in ambiguity Willingness to travel up to 25% of the time based on customer + mission needs Nice to haves: An active TS-SCI security clearance A technical background (education or professional experience with economics, statistics, computer science, or engineering) Experience working in and / or with the U.S. government A deep understanding of ML operations for Computer Vision and Generative AI workflows / products MBA or relevant technical degree Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. The base salary range for this full-time position in the location of Washington DC/Hawaii is: $139,200 — $212,000 USD The base salary range for this full-time position in the location of St. Louis/Suffolk is: $116,000 — $177,000 USD PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.

3d ago

Applied AI Architect

A

Anthropic· Tokyo, Japan

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role As an Applied AI team member at Anthropic, you will be a Pre-Sales architect focused on becoming a trusted technical advisor helping large enterprises understand the value of Claude and paint the vision on how they can successfully integrate and deploy Claude into their technology stack. You'll combine your deep technical expertise with customer-facing skills to architect innovative LLM solutions that address complex business challenges while maintaining our high standards for safety and reliability. Working closely with our Sales, Product, and Engineering teams, you'll guide customers from initial technical discovery through successful deployment. You'll leverage your expertise to help customers understand Claude's capabilities, develop evals, and design scalable architectures that maximize the value of our AI systems. Responsibilities: Partner with account executives across Japan to deeply understand customer requirements and translate them into technical solutions, ensuring alignment between business objectives and technical implementation Serve as the primary technical advisor to enterprise customers across Japan throughout their Claude adoption journey, from discovery to initial evaluation through deployment. You will need to coordinate internally across multiple teams & stakeholders to drive customer success Support customers building with both the Claude API and Claude for Work Create and deliver compelling technical content tailored to different audiences across Japan. You will need to be able to spread the gamut from technical deep dives for engineering & development teams up to business value focused conversations with executives Guide technical architecture decisions and help customers across Japan integrate Claude effectively into their existing technology stack Help customers develop evaluation frameworks to measure Claude's performance for their specific use cases Identify common integration patterns and contribute insights back to our Product and Engineering teams Travel occasionally to customer sites for workshops, technical deep dives, and relationship building Maintain strong knowledge of the latest developments in LLM capabilities and implementation patterns You may be a good fit if you have: 5+ years of experience in technical customer-facing roles such as Solutions Architect, Sales Engineer, or Technical Account Manager Native-level Japanese fluency and business-level English required Experience working with enterprise customers, navigating complex buying cycles involving multiple stakeholders Exceptional ability to build relationships with and communicate technical concepts to diverse stakeholders to include C-suite executives, engineering & IT teams, and more Strong technical communication skills with the ability to translate customer requirements between technical and business stakeholders Experience designing scalable cloud architectures and integrating with enterprise systems Comfortable with python Familiarity with common LLM frameworks and tools or a background in machine learning or data science Excitement for engaging in cross-organizational collaboration, working through trade-offs, and balancing competing priorities A love of teaching, mentoring, and helping others succeed Excellent communication and interpersonal skills, able to convey complicated topics in easily understandable terms to a diverse set of external and internal stakeholders. You enjoy engaging in cross-organizational collaboration, working through trade-offs, and balancing competing priorities Passion for thinking creatively about how to use technology in a way that is safe and beneficial, and ultimately furthers the goal of advancing safe AI systems Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.

4d ago

AI Agent Platform Engineer

C

Cohere· Toronto, Canada

Design and build autonomous AI agents and agent orchestration infrastructure. You will work with Python, Go, Kubernetes, and Docker to scale our agent runtime. Familiarity with OpenAI and Claude APIs is a plus. $150k-$210k.

$150k - $210k

Full-time OpenAICohereLangGraph +6 more

4d ago

Research Engineer, Code RL (Reinforcement Learning)

A

Anthropic· San Francisco, CA | New York City, NY

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the RL Teams Our Reinforcement Learning teams play a critical role in advancing our AI systems. We've contributed to all Claude models, with significant impacts on the autonomy and coding capabilities of our latest Claude models. Our work spans several key areas: Developing systems that enable models to use computers effectively Advancing code generation through reinforcement learning Pioneering fundamental RL research for large language models Building scalable RL infrastructure and training methodologies Enhancing model reasoning capabilities We collaborate closely with Anthropic's alignment and frontier red teams to ensure our systems are both capable and safe. We partner with the applied production training team to bring research innovations into deployed models, and are dedicated to implement our research at scale. Our Reinforcement Learning teams sit at the intersection of cutting-edge research and engineering excellence, with a deep commitment to building high-quality, scalable systems that push the boundaries of what AI can accomplish. About the Role We're hiring for the Code RL team within the RL organization. As a Research Engineer, you'll advance our models' ability to write, edit, test, debug, and ship real software — end to end, on real codebases, with real tools — and to do it correctly, fast, and safely. This role blends research and engineering. You'll design RL environments and coding tasks, build the reward signals and verifiers that capture what "good code" means, run training experiments on frontier models, diagnose why a model does (or doesn't) get better at a class of software-engineering work, and improve the speed and reliability of the pipelines that make all of that iterate fast. Code RL spans several focus areas — from agentic coding behaviors and code correctness, to long-horizon autonomous engineering, to high-performance code for accelerators — and we'll match you to the area where you'll have the most impact. You may be a good fit if you: Have strong software-engineering skills and deep Python expertise, including async/concurrent programming Are comfortable owning systems end to end and debugging across the stack Can balance research exploration with engineering implementation, and engage rigorously in shaping experimental design and interpreting results Care about code quality, testing, and performance Are passionate about the potential impact of AI and are committed to developing safe and beneficial systems Strong candidates may also have: Experience with reinforcement learning, RLHF, post-training, or LLM finetuning Built coding agents, code-execution sandboxes, eval harnesses, verifiers, or developer tooling Background in program analysis, testing, verification, compilers, or formal methods Experience with PyTorch and large-scale distributed training; performance profiling and optimization of ML systems CUDA / GPU or TPU kernel experience and accelerator-performance intuition Experience with virtualization and sandboxed code execution environments Related roles If your background leans toward one of these areas specifically, you may also want to look at these postings: Research Engineer, Performance RL (Reinforcement Learning) — teaching Claude to write correct, fast code for accelerators Research Engineer, Universes — long-horizon, ultra-realistic agentic training environments Research Engineer, Cybersecurity RL (Reinforcement Learning) — RL for security-relevant coding capabilities The annual compensation range for this role is listed below. For sales roles, the range provided is the role’s On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. Annual Salary: $500,000 — $850,000 USD Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.

4d ago

Staff Engineer, Agentic

I

Inflection AI· Palo Alto, California, United States

About Inflection AI Inflection AI is a Public Benefit Corporation empowering people with human-centered, emotionally intelligent AI. We’re shaping the future of AI by combining emotional intelligence (EQ) and raw intelligence (IQ) to elevate people’s potential. Inflection AI created Pi, the world’s first emotionally intelligent AI, to help people work through decisions, emotions, and challenges. Pi is a personal AI agent powered by Inflection AI’s foundation model, proving that AI can be personal, empathetic, and contextually aware. About the Role As a Staff Engineer, AGentic at Inflection, you will own the platforms, systems, and services that bring our conversational AI to life at scale. You’ll collaborate across research, product, and infrastructure teams to enable rapid iteration, high reliability, and secure delivery of novel AI features to millions of users. Your work will directly impact both the pace of product development and the stability of our production systems. What you’ll do Design and build backend systems that power agentic workflows in production: orchestration, memory, retrieval, and tool integrations. Operate high-availability inference and conversation pipelines, optimizing for latency, cost, and quality. Build internal platform pieces service templates, CI/CD, observability, evaluation harnesses, and rollout tooling that compound engineering velocity. Partner with applied researchers and frontend engineers to design and iterate on user-facing features Drive features from prototype to production with measurable user impact, ensuring reliability and performance at scale Must-haves 10-15 years of backend engineering experience shipping production systems in cloud environments. Demonstrated experience building, shipping, and operating agentic AI systems in production environments with real user impact not just prototypes. Strong production experience with Python, Postgres, Redis, and at least one major cloud (AWS, GCP, or Azure), including containers and Kubernetes. Practical experience with LLM application patterns: RAG, tool use, evaluation loops, and the latency/cost/quality tradeoffs that come with them. Strong sense of ownership, with the ability to independently drive systems to production and operate them reliably Nice-to-haves Experience with agent orchestration frameworks (LangGraph, LlamaIndex, OpenAI/Anthropic Agent SDKs, or equivalent in-house systems) and evaluation tooling (LangSmith, Braintrust, or similar). Background in streaming, queueing, or vector data infrastructure at scale. Past on-call ownership of customer-facing services with strict SLOs. Contributions to open-source agent or LLM tooling. Employee Pay Disclosures At Inflection AI, we aim to attract and retain the best employees and compensate them in a way that appropriately and fairly values their individual contributions to the company. For this role, Inflection AI estimates a starting annual base salary to fall within the range of $350,000 to $500,000 depending on a candidate’s qualifications and level of experience. This role also includes a meaningful equity component, allowing employees to share in the long-term success of the company. Benefits Inflection AI values and supports our team’s mental and physical health. We are focused on building a positive, safe, inclusive and inspiring place to work. Our benefits include: Diverse medical, dental and vision options 401k matching program Unlimited paid time off Parental leave and flexibility for all parents and caregivers Support of country-specific visa needs for international employees living in the Bay Area

4d ago

Senior/Staff Machine Learning Research Engineer, General Agents, Enterprise GenAI

S

Scale AI· San Francisco, CA; New York, NY

Scale AI is the data foundation for AI, helping organizations build and deploy reliable production AI applications. We partner with leading enterprises and government organizations to accelerate their AI initiatives through our data annotation platform, generative AI solutions, and enterprise AI capabilities. About the General Agents Team The General Agents team, part of Scale’s Enterprise organization, builds robust general agents for customer use cases and applications. The team sits at the intersection of frontier agent development and real-world deployment, translating state-of-the-art reasoning and agentic capabilities into reliable, production-grade systems that drive real economic value. Our agents are scalable systems built around recurring enterprise problem domains, with a strong emphasis on generalization, extensibility, and deployment across many customers. About the Role As a Senior/Staff Machine Learning Engineer (MLE) on the General Agents team, you’ll play a critical role in designing, building, and deploying production-ready AI agents that solve high-impact enterprise problems. You will work across the full agent lifecycle—from model and system design to evaluation, deployment, and iteration—bridging cutting-edge agentic techniques with the constraints and requirements of real customer environments. You will: Design and implement end-to-end agent systems that combine LLM reasoning, tool use, memory, and control logic to solve recurring enterprise use cases. Build scalable, reliable agent architectures that can be deployed across many customers with varying data, tools, and constraints. Develop evaluation frameworks, datasets, environments, and metrics to measure agent performance, reliability, and business impact in production settings. Collaborate closely with product managers, customers, data annotators, and other engineering teams to translate enterprise requirements into robust agent designs. Productionize frontier agent techniques (e.g., planning, multi-step reasoning and tool-use, multi-agent patterns) into maintainable, observable systems. Own deployment, monitoring, and iteration of agent systems, including failure analysis and continuous improvement based on real-world usage. Contribute to technical direction and architectural decisions for general agent development best practices and methods, with increasing scope and leadership at the Staff level. Ideally you’d have: 5+ years of experience building and deploying machine learning or AI systems for real-world, production use cases. Strong engineering fundamentals, supported by a Bachelor’s and/or Master’s degree in Computer Science, Machine Learning, AI, or equivalent practical experience. Deep understanding of modern LLMs, prompt-, context-, and system-level optimization, and agentic system design. Proven proficiency in Python, including writing production-quality, testable, and maintainable code. Experience building systems that integrate models with external tools, APIs, databases, and services. Ability to operate in ambiguous problem spaces, balancing research-driven approaches with pragmatic product constraints. Strong communication skills and comfort working in customer-facing or cross-functional environments. Nice-to-haves: Hands-on experience building AI agents using modern generative AI stacks (OpenAI APIs, commercial or open-source LLMs). Experience with agent frameworks, orchestration layers, or workflow systems (e.g., tool calling, planners, multi-agent setups). Familiarity with evaluation, monitoring, and observability for LLM-powered systems in production. Experience deploying ML systems in cloud environments and operating them at scale. Experience fine-tuning or adapting foundation models using methods like supervised fine-tuning (SFT), reinforcement learning with verifiable rewards (RLVR), and low-rank adaptation (LoRA) to improve agent performance on domain-specific tasks. Interest in shaping the future of general-purpose enterprise agents and their real-world impact. Compensation packages at Scale for eligible roles include base salary, equity, and benefits. The range displayed on each job posting reflects the minimum and maximum target for new hire salaries for the position and may be inclusive of several career levels at Scale; it will be determined during the interview process based on work location and additional factors, including job-related skills, experience, qualifications, interview performance, and relevant education or training. Scale employees in eligible roles are also granted equity based compensation, subject to Board of Director approval. Your recruiter can share more about the specific salary range for your preferred location during the hiring process, and confirm whether the hired role will be eligible for equity grant. You'll also receive benefits including, but not limited to: comprehensive health, dental and vision coverage, retirement benefits, a learning and development stipend, and generous PTO. Additionally, this role may be eligible for additional benefits such as a commuter stipend. Please reference the job posting's subtitle for where this position will be located. For pay transparency purposes, the base salary range for this full-time position in the locations of San Francisco, New York, Seattle is: $264,800 — $331,000 USD PLEASE NOTE: Our policy requires a 90-day waiting period before reconsidering candidates for the same role. This allows us to ensure a fair and thorough evaluation of all applicants. About Us: At Scale, our mission is to develop reliable AI systems for the world's most important decisions. Our products provide the high-quality data and full-stack technologies that power the world's leading models, and help enterprises and governments build, deploy, and oversee AI applications that deliver real impact. We work closely with industry leaders like Meta, Ernst & Young, Mayo Clinic, Time Inc., the Government of Qatar, and U.S. government agencies including the Army and Air Force. We are expanding our team to accelerate the development of AI applications. We believe that everyone should be able to bring their whole selves to work, which is why we are proud to be an inclusive and equal opportunity workplace. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability status, gender identity or Veteran status. We are committed to working with and providing reasonable accommodations to applicants with physical and mental disabilities. If you need assistance and/or a reasonable accommodation in the application or recruiting process due to a disability, please contact us at [email protected]. Please see the United States Department of Labor's Know Your Rights poster for additional information. We comply with the United States Department of Labor's Pay Transparency provision . PLEASE NOTE: We collect, retain and use personal data for our professional business purposes, including notifying you of job opportunities that may be of interest and sharing with our affiliates. We limit the personal data we collect to that which we believe is appropriate and necessary to manage applicants’ needs, provide our services, and comply with applicable laws. Any information we collect in connection with your application will be treated in accordance with our internal policies and programs designed to protect personal data. Please see our privacy policy for additional information.

4d ago

Product Support Specialist (Singapore - Weekend Coverage)

A

Anthropic· Singapore

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role We are hiring our first Singapore-based Product Support Specialists, who will grow our global presence as a team and serve as champions of our unique support brand. You’ll be at the front lines of safely delivering AI to the world by responding to, investigating, and tracking user needs in your day to day. Additionally, you’ll help us identify – and close – gaps in our team’s technical knowledge, provide high-touch support to strategic customers, and demonstrate deep care for how we systematically support customers at scale. Note: Specialists in this role will work Wednesday to Sunday Responsibilities: Become an expert in all Anthropic products Respond to user support cases with a variety of complexity, from simple billing or account questions for individuals to complex API debugging for large businesses Clearly and empathetically communicate with a wide range of user personas, context-switching between guiding executives in a high-touch model to assisting consumer users in a rapid pace Manage on-call tasks for high-urgency user issues with extreme ownership. This includes being on critical ‘pager duty’ during APAC hours to ensure 24/7 Support for Critical Queries. Prioritize critically and comfortably adapt to an ever-evolving product landscape Operate in ambiguity, making informed decisions even in never-before-seen situations Partner with engineers, teammates, and other internal stakeholders to diagnose and resolve user issues, both individually and at scale Suggest and drive improvements to increase user satisfaction through support processes as well as own initiatives that increase efficiency and drive down contact rates Uplevel our team’s technical knowledge by scoping gaps, working with cross-functional partners to deeply understand relevant nuances, and building resources that grow with our products You may be a good fit if you: Have experience in providing technical product support within a second tier, escalated, or priority support team. Are familiar with APIs and technical SaaS products and can deeply understand technical docs with ease Have demonstrated an ability to thrive in fast-paced, reactive situations while meeting core support metrics targets (e.g. CSAT, SLA, etc.) Possess strong user empathy and are expert in the lifecycle of a support case; you can read between the lines of a user’s question, put yourself in their shoes, and get at the heart of their needs for a speedy, satisfying resolution Have crisp but kind written communication skills and a deep care for the details Enjoy helping others learn about new features and complex concepts Are persistent and curious; you delight in the hunt of tracking down a bug or issue, and are energized by fixing this for all similar users going forward Have experience contributing to the foundations of a support team – this is essential, highly valuable, but often unglamorous work. And particularly key for this role due to it being among the first Support hires in APAC. Are proficient at working in a technical environment and are interested in Anthropic’s products Possess a deep sense of ownership, and are excited to help us build our team! Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.

5d ago

AI Infrastructure Engineer

D

Databricks· Remote

Own the infrastructure that powers our ML and LLM training pipelines. Strong experience with Kubernetes, AWS, GCP, PostgreSQL, and Redis required. This is a fully remote position. Compensation: $170,000 to $240,000.

$170k - $240k

Remote Full-time AWSKubernetesPython +4 more

5d ago

Applied AI Security Architect

A

Anthropic· London, UK

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role: As an Applied AI Security Architect, you will serve as Anthropic's trusted security expert for our most demanding enterprise customers. You'll engage directly with CISOs, security architects, compliance officers, and technical leaders at the largest financial institutions, insurers, and other highly regulated enterprises across EMEA to address their most critical questions about deploying Claude safely and securely — including the security capabilities of our latest generations of Claude models. This is a pre-sales technical role focused on security, compliance, networking, and data architecture. Your job is to walk into a room full of security professionals and demonstrate deep expertise in enterprise security, regulatory compliance, and data protection. Whether you've been a Security Architect, Solutions Architect, Field CTO, or senior pre-sales engineer, what matters is that you understand how large European institutions evaluate and adopt technology, and can speak credibly to their security and compliance concerns. This is a senior role: you will own our most complex and escalated security conversations in the region, often as the decisive technical voice in front of a CISO. We are looking for someone excited to help define how European enterprises should think about security and compliance in the age of AI. How do MCP, autonomous agents, and RBAC work together? How do you deploy frontier models in line with GDPR, EU data residency, and the EU AI Act? If working at the intersection of AI adoption and regulated industries excites you, this is the role for you. Responsibilities: Serve as the primary security and compliance expert in customer engagements, addressing technical questions about Claude's architecture, data flows, encryption, access controls, and deployment models. Partner with CISOs, security architects, compliance teams, and DPOs to understand their security requirements and design solutions that meet European regulatory standards (GDPR, EU AI Act, DORA, NIS2, SOC 2, PCI-DSS, and national regulator expectations). Lead technical deep-dives on network architecture, EU data residency, data retention and Zero Data Retention (ZDR) policies, cross-border data transfers, API security, authentication/authorization, audit logging, and integration patterns for regulated environments. Support enterprise security reviews, vendor assessments, and due diligence with detailed technical documentation and expert guidance. Guide customers through EU AI Act readiness, DORA, and NIS2, and position Anthropic within the European competitive landscape. Collaborate with Sales and Applied AI teams from initial conversations through deployment. Partner closely with Anthropic’s product and engineering teams to understand Claude's security capabilities, relay customer feedback, and influence the roadmap. Develop security-focused collateral, reference architectures, and best practices for regulated industries. Travel regularly across EMEA for security workshops, architecture reviews, and strategic account meetings. You may be a good fit if you have: 7+ years of experience in enterprise security, cloud architecture, or technical pre-sales, with significant exposure to regulated industries in EMEA (financial services, insurance, healthcare). Deep technical knowledge of enterprise security concepts: network security, identity and access management, encryption (at rest and in transit), API security, and audit/logging requirements. Deep working knowledge of GDPR (data residency, data retention and Zero Data Retention, international transfers, DPIAs) and strong familiarity with the EU AI Act as it applies to foundation models. Experience navigating compliance frameworks relevant to European financial services and insurance (DORA, NIS2, SOC 2, PCI-DSS, and national regulators' guidance on AI/ML such as FCA/PRA, BaFin, ACPR, FINMA). A track record of leading complex, high-stakes engagements with CISOs, security teams, and compliance officers at large European enterprises. Strong understanding of cloud architecture and deployment models (AWS, Azure, GCP), including VPCs, private endpoints, hybrid connectivity, and the European sovereign cloud landscape. Excellent communication skills: able to explain complex security topics to technical and non-technical audiences. Fluent English; additional European languages are a strong plus. A good understanding of the EMEA enterprise AI market and competitive landscape. The ability to navigate ambiguity and move fast in a rapidly evolving market. A collaborative mindset: sales at Anthropic is a team sport. Excitement about AI's potential to transform highly regulated industries, and a genuine desire to help customers adopt it safely and responsibly. The annual compensation range for this role is listed below. For sales roles, the range provided is the role’s On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. Annual Salary: £190,000 — £225,000 GBP Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.

5d ago

Research Engineer, Performance RL (Reinforcement Learning)

A

Anthropic· San Francisco, CA

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the RL Teams Our Reinforcement Learning teams lead Anthropic's reinforcement learning research and development, playing a critical role in advancing our AI systems. We've contributed to all Claude models, with significant impacts on the autonomy and coding capabilities of Claude Sonnet 4.6 and Opus 4.6. Our work spans several key areas: Developing systems that enable models to use computers effectively Advancing code generation through reinforcement learning Pioneering fundamental RL research for large language models Building scalable RL infrastructure and training methodologies Enhancing model reasoning capabilities We collaborate closely with Anthropic's alignment and frontier red teams to ensure our systems are both capable and safe. We partner with the applied production training team to bring research innovations into deployed models, and are dedicated to implement our research at scale. Our Reinforcement Learning teams sit at the intersection of cutting-edge research and engineering excellence, with a deep commitment to building high-quality, scalable systems that push the boundaries of what AI can accomplish. About the Role We're hiring for the Code RL team within the RL organization. As a Research Engineer, you'll advance our models' ability to safely write correct, fast code for accelerators. You'll need to know accelerator performance well to turn it into tasks and signals models can learn from. Specifically, you will: Invent, design and implement RL environments and evaluations. Conduct experiments and shape our research roadmap. Deliver your work into training runs. Collaborate with other researchers, engineers, and performance engineering specialists across and outside Anthropic. You may be a good fit if you: Have expertise with accelerators (CUDA, ROCm, Triton, Pallas), ML framework programming (JAX or PyTorch). Have worked across the stack – kernels, model code, distributed systems. Know how to balance research exploration with engineering implementation. Are passionate about AI's potential and committed to developing safe and beneficial systems. Strong candidates may also have: Experience with reinforcement learning. Experience porting ML workloads between different types of accelerators. Familiarity with LLM training methodologies. The annual compensation range for this role is listed below. For sales roles, the range provided is the role’s On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. Annual Salary: $350,000 — $850,000 USD Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.

5d ago

Research Engineer, Cybersecurity RL (Reinforcement Learning)

A

Anthropic· San Francisco, CA | New York City, NY

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About Horizons The Horizons team leads Anthropic's reinforcement learning (RL) research and development, playing a critical role in advancing our AI systems. We've contributed to every Claude release, with significant impact on the autonomy, coding, and reasoning capabilities of Anthropic's models. About the role We're hiring for the Cybersecurity RL team within Horizons. As a Research Engineer, you'll help to safely advance the capabilities of our models in secure coding, vulnerability remediation, and other areas of defensive cybersecurity. This role blends research and engineering, requiring you to both develop novel approaches and realize them in code. Your work will include designing and implementing RL environments, conducting experiments and evaluations, delivering your work into production training runs, and collaborating with other researchers, engineers, and cybersecurity specialists across and outside Anthropic. The role requires domain expertise in cybersecurity paired with interest or experience in training safe AI models. For example, you might be a white hat hacker who's curious about how LLMs could augment or transform your work, a security engineer interested in how AI could help harden systems at scale, or a detection and response professional wondering how models could enhance defensive workflows. You may be a good fit if you: Have experience in cybersecurity research. Have experience with machine learning. Have strong software engineering skills. Can balance research exploration with engineering implementation. Are passionate about AI's potential and committed to developing safe and beneficial systems. Strong candidates may also have: Professional experience in security engineering, fuzzing, detection and response, or other applied defensive work. Experience participating in or building CTF competitions and cyber ranges. Academic research experience in cybersecurity. Familiarity with RL techniques and environments. Familiarity with LLM training methodologies. The annual compensation range for this role is listed below. For sales roles, the range provided is the role’s On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. Annual Salary: $300,000 — $405,000 USD Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.

5d ago

Applied AI Architect, Public Sector

A

Anthropic· London, UK

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role As an Applied AI team member at Anthropic, you will be a pre-sales architect focused on becoming a trusted technical advisor to the UK and Northern Europe public sector, with a primary focus on UK Central Government departments, executive agencies, and arm's length bodies, and a reach extending across devolved administrations, local government, the NHS, and Northern European public sector markets. This includes a growing focus on defence and national security, working with the MOD and intelligence agencies on some of the UK's most sensitive and mission-critical challenges. You will help these organisations understand the value of Claude and paint the vision for how they can successfully integrate and deploy Claude into their technology estates to modernise operations, improve policy delivery, and transform citizen services. You’ll combine your deep technical expertise with customer-facing skills to architect innovative LLM solutions that address complex mission challenges while maintaining our high standards for safety and reliability. Working closely with our Sales, Product, Engineering, and Partnerships teams, you’ll guide customers from initial technical discovery through successful deployment. You’ll leverage your expertise to help customers understand Claude’s capabilities, develop evals, and design scalable, compliant architectures that maximise the value of our AI systems within the constraints that public sector organisations operate under. Responsibilities Partner with account executives to deeply understand customer requirements and translate them into technical solutions, ensuring alignment between departmental outcomes, policy objectives, and technical implementation. Serve as the primary technical advisor to public sector customers throughout their Claude adoption journey, from discovery to initial evaluation through deployment. You will need to coordinate internally across multiple teams and stakeholders to drive customer success. Support customers building with Claude Code, the Claude API, and Claude for Enterprise. Create and deliver compelling technical content tailored to different audiences. You will need to span the gamut from technical deep dives for engineering and delivery teams up to business-value conversations with senior civil servants and C-suite executives (Permanent Secretaries, Directors General, SROs, CDIOs). Support defence and national security engagements, including with the MOD and intelligence agencies, designing solutions that work within the security, classification, and accreditation constraints of these environments. Guide technical architecture decisions and help customers integrate Claude effectively into their existing technology stack, with alignment to NCSC guidelines, Cyber Essentials Plus, the Government Security Classifications framework, the Technology Code of Practice, and the Service Standard. Help customers develop evaluation frameworks to measure Claude’s performance for their specific use cases. Identify common integration patterns across the UK public sector and contribute insights back to our Product and Engineering teams. Travel frequently to customer sites across the UK (and occasionally Northern Europe) for workshops, technical deep dives, and relationship building. Maintain strong knowledge of the latest developments in LLM capabilities and implementation patterns. You may be a good fit if you have Prior experience working with UK public sector organisations — particularly UK Central Government departments, executive agencies, or arm’s length bodies. Active UK Security Check (SC) clearance, with willingness and eligibility to be put forward for higher levels of clearance (e.g. DV) where customer engagements require it. 5+ years of experience in technical customer-facing roles such as Solutions Architect, Sales Engineer, or Technical Account Manager. Experience navigating complex public sector buying cycles involving multiple stakeholders — commercial teams, digital and technology leadership, policy owners, and SROs. Familiarity with UK public procurement routes and frameworks is preferred — e.g. G-Cloud, the AI Dynamic Purchasing System (AI DPS), Digital Outcomes, and Crown Commercial Service agreements — and experience working with systems integrators and delivery partners within these frameworks. Exceptional ability to build relationships with and communicate technical concepts to diverse stakeholders, including senior civil servants, C-suite executives, engineering, and IT teams. Strong technical communication skills with the ability to translate customer requirements between technical and business stakeholders. Experience designing scalable cloud architectures and integrating with enterprise systems. Comfortable with Python. Familiarity with common LLM frameworks and tools, or a background in machine learning or data science. Excitement for engaging in cross-organisational collaboration, working through trade-offs, and balancing competing priorities. A love of teaching, mentoring, and helping others succeed. Excellent communication and interpersonal skills, able to convey complicated topics in easily understandable terms to a diverse set of external and internal stakeholders. Passion for thinking creatively about how to use technology in a way that is safe and beneficial, and ultimately furthers the goal of advancing safe AI systems. The annual compensation range for this role is listed below. For sales roles, the range provided is the role’s On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. Annual Salary: £165,000 — £190,000 GBP Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.

6d ago

Manager of Applied AI Architecture, Enterprise Tech (Cyber)

A

Anthropic· New York City, NY; San Francisco, CA | New York City, NY; Seattle, WA

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the Role: As the manager of the Applied AI Architect, Enterprise Tech (Cyber) team at Anthropic, you will drive the adoption of frontier AI across the cybersecurity industry — enabling leading security companies to build Claude into their detection, response, and security operations products via Anthropic's API, Claude Code, and Claude for Enterprise. You'll leverage your technical skills and consultative sales experience to help security vendors ship AI-powered capabilities that meet the industry's exacting standards for reliability, accuracy, and safety. You'll be responsible for leading & growing the Applied AI Architect, Enterprise Tech (Cyber) team, establish processes and best practices for your segment's pre-sales engagements based on your years of experience, help each team member achieve success, high productivity, and career growth, and represent Anthropic as a technical lead on some of its most important partnerships. In collaboration with the Sales, Product, and Engineering teams, you'll help enterprise tech partners incorporate leading-edge AI systems into their cutting-edge products and platforms. You will employ your excellent communication skills to explain and demonstrate complex solutions persuasively to technical and non-technical audiences alike. You will play a critical role in identifying opportunities to innovate and differentiate our AI systems, while maintaining our best-in-class safety standards. Responsibilities: Manage and mentor a team of Applied AI Architects, Enterprise Tech (Cyber) providing both technical guidance and career development Set goals and reviews for your team, promoting growth and output Work with a handful of highest-value Enterprise Tech customers on their overall AI adoption strategies, focusing on pre-sales technical excellence including use case scoping, technical champion building, and POC execution Partner closely with your aligned GTM leadership to understand customer requirements & co-build GTM strategies to drive adoption for Enterprise Tech (Cyber) customers Own the technical portions of pre-sales engagements, ensuring your team provides compelling demos and validates enterprise customer ROI from Anthropic products Drive collaboration from cross-functional teams to influence and unify stakeholders at all levels of the organization to drive business outcomes Travel occasionally to customer sites for executive-level sessions, technical workshops, and building relationships Establish a shared vision for creating solutions that enable beneficial and safe AI in technology products Lead the vision, strategy, and execution of innovative solutions that leverage our latest models' capabilities for tech-forward use cases Contribute to thought leadership through conference presentations, webinars, and technical content creation Stay current with emerging AI/ML trends and competitive landscape in the enterprise tech sector You may be a good fit if you: 7+ years of experience as a Solutions Architect, Sales Engineer, or similar pre-sales technical role 3+ years of technical go-to-market management experience, specifically managing pre-sales teams Have experience working with and selling to Digital Native focused customers (Vertical Enterprise SaaS, Horizontal Enterprise SaaS, Consumer Technology Companies, PaaS, etc.) Experience with the unique technical requirements and technical procurement process of enterprise tech companies Deep technical proficiency with enterprise AI deployments, API integrations, and production LLM use cases Have an organizational mindset and enjoy building foundational teams in a relatively unstructured environment Have excellent communication, collaboration, and coaching abilities Are comfortable dealing with highly uncertain, ambiguous, and fast-moving environments typical of the tech industry Strong executive presence and ability to foster deep relationships with technical leaders and engineering teams Have at least a high level familiarity with the architecture and operation of large language models and/or ML in general Experience with prompt engineering, LLM evaluation, and architecting AI-powered systems Make ambiguous problems clear and identify core principles that can translate across scenarios Have a passion for making powerful technology safe and societally beneficial Think creatively about the risks and benefits of new technologies, and think beyond past checklists and playbooks Stay up-to-date and informed by taking an active interest in emerging research and industry trends Understanding of developer tooling, SDKs, and technical integration patterns common in enterprise tech companies Strong candidates may have: Enterprise SA Leadership at Scale : 5+ years leading solution architect teams through hypergrowth (ideally 10→50+ people), with direct experience managing both senior SAs and developing junior talent in complex enterprise software environments AI/ML Technical Depth + Executive Engagement : Hands-on experience with AI/ML platforms and enterprise integration patterns, combined with proven track record engaging C-level stakeholders in $10M+ technical evaluations and enterprise sales cycles Multi-Segment GTM Experience : Demonstrated success adapting technical approaches across customer segments (startup to Fortune 500), with experience spanning the full deal spectrum from $2M employee empowerment through $100M+ core business transformation initiatives Deadline to apply: None. Applications will be reviewed on a rolling basis. The annual compensation range for this role is listed below. For sales roles, the range provided is the role’s On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. Annual Salary: $315,000 — $380,000 USD Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.

6d ago

Machine Learning Platform Engineer

S

Scale AI· San Francisco, CA

Develop internal tooling for fine-tuning and evaluating large language models. Requires PyTorch, TensorFlow, Python, and experience with NLP pipelines. $140,000 - $190,000.

$140k - $190k

Full-time PythonFine-TuningPyTorch +2 more

7d ago

Research Engineer / Research Scientist, Tokens

A

Anthropic· New York City, NY; New York City, NY | Seattle, WA; San Francisco, CA

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. You want to build large scale ML systems from the ground up. You care about making safe, steerable, trustworthy systems. As a Research Engineer, you'll touch all parts of our code and infrastructure, whether that's making the cluster more reliable for our big jobs, improving throughput and efficiency, running and designing scientific experiments, or improving our dev tooling. You're excited to write code when you understand the research context and more broadly why it's important. Note: This is an "evergreen" role that we keep open on an ongoing basis. We receive many applications for this position, and you may not hear back from us directly if we do not currently have an open role on any of our teams that matches your skills and experience. We encourage you to apply despite this, as we are continually evaluating for top talent to join our team. You are also welcome to reapply as you gain more experience, but we suggest only reapplying once per year. We may also put up separate, team-specific job postings . In those cases, the teams will give preference to candidates who apply to the team-specific postings, so if you are interested in a specific team please make sure to check for team-specific job postings! You may be a good fit if you: Have significant software engineering experience Are results-oriented, with a bias towards flexibility and impact Pick up slack, even if it goes outside your job description Enjoy pair programming (we love to pair!) Want to learn more about machine learning research Care about the societal impacts of your work Strong candidates may also have experience with: High performance, large-scale ML systems GPUs, Kubernetes, Pytorch, or OS internals Language modeling with transformers Reinforcement learning Large-scale ETL Representative projects: Optimizing the throughput of a new attention mechanism Comparing the compute efficiency of two Transformer variants Making a Wikipedia dataset in a format models can easily consume Scaling a distributed training job to thousands of GPUs Writing a design doc for fault tolerance strategies Creating an interactive visualization of attention between tokens in a language model The annual compensation range for this role is listed below. For sales roles, the range provided is the role’s On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. Annual Salary: $350,000 — $500,000 USD Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.

7d ago

Applied AI Engineer, CyberSecurity

M

Mistral AI· Paris

About Mistral    At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.   We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise needs, whether on-premises or in cloud environments. Our offerings include Vibe, the AI assistant for life and work.   We are a dynamic, collaborative team passionate about AI and its potential to transform society. Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.   Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on https://mistral.ai/careers.   Role summary We are seeking an Applied AI Engineer to build the customer-facing cyber service and use-cases on top of our cyber harnesses. You will turn "we have agents" into "we shipped something a client uses." This role is about composing red-team/blue-team agentic workflows, configuring harnesses for real-world scenarios, and delivering value directly to clients. Your work will bridge the gap between our foundational cyber harnesses and the agentic solutions that clients deploy. While SWE-Cyber makes the platform robust and scalable, and Pentesters run the solution and guide the direction, you will focus on building the agentic use-cases and client-facing service that solve real security problems.   What you will do Client-Facing Agentic Solutions • Compose red-team / blue-team agentic workflows for production use-cases • Configure harnesses for cloud defense, vulnerability scanning, dynamic red-teaming, and penetration testing scenarios • Work directly on client use-cases, translating security requirements into agentic solutions • Turn prototype agents into deployed services that clients rely on Context Engineering & Orchestration • Design and implement context engineering that enables agents to operate effectively in cybersecurity domains • Orchestrate multi-agent systems for complex security workflows • Build the agentic layer that sits between the harness and the client Service Delivery • Ship fast and iterate based on client feedback and real-world performance • Operate independently while leveraging internal building blocks and frameworks • Collaborate with pentesters to ensure domain accuracy and effectiveness • Partner with SWE-Cyber to ensure the platform supports your use-case requirements   About you • Strong applied AI engineer with hands-on experience building agents, LLM orchestration, context engineering, evals, and RAG systems • You ship fast, with a bias toward action and delivery • Operate independently with a customer-led mindset • Able to reuse internal components and bricks effectively • Some cyber context is a real plus, but not mandatory - we can pair you with pentesters for domain depth • Strong problem-solving abilities and attention to detail • Excellent communication skills and collaborative attitude It would be ideal if you also have: • Genuine cyber or pentest knowledge • Experience building agentic harnesses or multi-agent systems end-to-end • Strong background in evals and benchmarking of agent systems • Experience with security tooling or workflows • Prior work on production AI systems in regulated or high-stakes environments

Full-time

7d ago

Applied AI Architect (Startups)

A

Anthropic· Dublin, IE

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickly growing group of committed researchers, engineers, policy experts, and business leaders working together to build beneficial AI systems. About the role As an Applied AI Architect on the Startups segment at Anthropic, you will win the trust of founders and engineers by being an exceptional technical partner, helping startups successfully build on the Claude Developer Platform as they grow from early product to scale. You'll combine deep technical expertise with a builder-first mindset to help startups architect innovative LLM solutions, win technical evaluations, and get the most out of Claude. Working closely with Account Executives and the broader Sales, Product, and Engineering teams, you'll guide startups from initial technical discovery through successful deployment and beyond. You'll leverage your expertise to help founders understand Claude's capabilities, develop evals, and design architectures that maximize the value of our AI systems. Responsibilities: Partner with Account Executives to deeply understand startup requirements, translate them into technical solutions, and serve as a trusted technical advisor throughout the Claude adoption journey from initial evaluation through deployment and expansion Win technical evaluations that demonstrate why Claude is the best foundation for what startups are building, helping them develop evaluation frameworks to measure Claude's performance for their specific use cases Build technical credibility with founders, founding engineers, and startup engineering teams by speaking their language, understanding their build patterns, and guiding technical architecture decisions Gather insights on how startups are building with Claude, identify emerging use cases and deployment patterns, and deliver feedback to Product and Engineering teams Travel to customer sites, startup-focused events, and industry conferences for workshops, technical deep dives, and relationship building You may be a good fit if you have: 5+ years of experience in technical customer-facing roles such as Solutions Architect, Sales Engineer, Forward Deployed Engineer, or technical founder who has done founder-led sales Experience working with startups or high-growth technology companies; you understand the velocity and constraints of early-stage companies Strong builder credibility through experience as a software engineer or technical founder; you speak the language of builders and have the presence to earn the trust of technical founders and startup engineering teams Deep technical proficiency with building and scaling AI products, a pulse on AI engineering best practices, and the ability to demonstrate why Claude is the best foundation Hands-on experience building and deploying LLM-powered applications in production, with expertise in context engineering, evaluation frameworks, and modern AI architectures Strong technical communication skills with the ability to translate complex AI concepts into actionable insights for founders and engineering teams Track record of selling technical products in competitive markets Comfortable with Python and familiarity with common LLM frameworks, tools, and integration patterns Passion for making powerful technology safe and societally beneficial The annual compensation range for this role is listed below. For sales roles, the range provided is the role’s On Target Earnings ("OTE") range, meaning that the range includes both the sales commissions/sales bonuses target and annual base salary for the role. Annual Salary: €190.000 — €215.000 EUR Logistics Minimum education: Bachelor’s degree or an equivalent combination of education, training, and/or experience Required field of study: A field relevant to the role as demonstrated through coursework, training, or professional experience Minimum years of experience: Years of experience required will correlate with the internal job level requirements for the position Location-based hybrid policy: Currently, we expect all staff to be in one of our offices at least 25% of the time. However, some roles may require more time in our offices. Visa sponsorship: We do sponsor visas! However, we aren't able to successfully sponsor visas for every role and every candidate. But if we make you an offer, we will make every reasonable effort to get you a visa, and we retain an immigration lawyer to help with this. We encourage you to apply even if you do not believe you meet every single qualification. Not all strong candidates will meet every single qualification as listed. Research shows that people who identify as being from underrepresented groups are more prone to experiencing imposter syndrome and doubting the strength of their candidacy, so we urge you not to exclude yourself prematurely and to submit an application if you're interested in this work. We think AI systems like the ones we're building have enormous social and ethical implications. We think this makes representation even more important, and we strive to include a range of diverse perspectives on our team. Your safety matters to us. To protect yourself from potential scams, remember that Anthropic recruiters only contact you from @anthropic.com email addresses. In some cases, we may partner with vetted recruiting agencies who will identify themselves as working on behalf of Anthropic. Be cautious of emails from other domains. Legitimate Anthropic recruiters will never ask for money, fees, or banking information before your first day. If you're ever unsure about a communication, don't click any links—visit anthropic.com/careers directly for confirmed position openings. How we're different We believe that the highest-impact AI research will be big science. At Anthropic we work as a single cohesive team on just a few large-scale research efforts. And we value impact — advancing our long-term goals of steerable, trustworthy AI — rather than work on smaller and more specific puzzles. We view AI research as an empirical science, which has as much in common with physics and biology as with traditional efforts in computer science. We're an extremely collaborative group, and we host frequent research discussions to ensure that we are pursuing the highest-impact work at any given time. As such, we greatly value communication skills. The easiest way to understand our research directions is to read our recent research. This research continues many of the directions our team worked on prior to Anthropic, including: GPT-3, Circuit-Based Interpretability, Multimodal Neurons, Scaling Laws, AI & Compute, Concrete Problems in AI Safety, and Learning from Human Preferences. Come work with us! Anthropic is a public benefit corporation headquartered in San Francisco. We offer competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues. Guidance on Candidates' AI Usage: Learn about our policy for using AI in our application process.

8d ago

Prompt Engineer

O

OpenAI· Remote

Craft and evaluate prompts for production LLM applications. Deep knowledge of prompt engineering, fine-tuning, and OpenAI APIs required. Remote, $90 - $130 per hour.

$90k - $130k

Remote Contract OpenAIPrompt EngineeringFine-Tuning

8d ago

Get new AI jobs in your inbox

A weekly digest of the newest LLM, RAG, and AI agent engineering roles.

© 2026 AI Job Board. All rights reserved.