[Remote] Principal AI/ML Researcher / Engineer In Reasoning, Planning, and Decision-making systems

Remote Full-time Hiring now

Note: The job is a remote job and is open to candidates in USA. Airbnb is a global platform for unique stays and experiences, and they are seeking a Principal AI/ML Researcher/Engineer to advance their AI capabilities in reasoning, planning, and decision-making systems. The role involves designing and operationalizing intelligent decision-making frameworks, collaborating across disciplines, and leading the development of multi-agent systems to improve decision quality and adaptive personalization for guests and hosts.

Responsibilities

Drive foundational and applied research in reasoning engines, planning architectures, and decision-making frameworks at scale in order to incorporate genAI into the ranking / recommendation / personalization stack in both single model to multi-agent ( system ) level intelligence with objective to grow the business (new user growth, abandoned user, long tailed user) in existing and new business areas while supporting Multi-Modal NL → Conversational Interfaces
Advance techniques in LLM/LRM post-training, reinforcement learning–based decisioning, and knowledge-integrated agents
Design methods for plan induction, value estimation, and contingency modeling within intelligent agents
Explore and validate protocols for distributed reasoning and joint planning among cooperative agents in multi-agent systems
Architect RPD systems that integrate post-trained LLMs/LRMs, graph-structured memory (e.g., KGs), and RL-driven controllers
Design recursive task planners, search-based or policy-based reasoners, and belief-state trackers that can interoperate with large model substrates
Ensure modularity and extensibility through multi-agent frameworks, agentic substrates, and declarative planning pipelines
Define communication protocols, coordination strategies, and cross-agent knowledge alignment mechanisms to foster emergent cooperative intelligence
Build and evolve stateful, dynamic models that combine supervised learning with online/offline reinforcement, simulation-based rollouts, and symbol grounding
Implement hybrid pipelines that couple learned embeddings, prompted generative models, and graph-theoretic inference
Optimize systems for adaptive exploration, planning horizon control, and policy robustness
Develop frameworks for distributed value propagation, multi-agent credit assignment, and global planning from local agents
Set direction for planning/reasoning infrastructure within the AI/ML platform strategy
Serve as the technical conscience and architectural leader across high-stakes AI initiatives involving autonomous agents or high-fidelity decision pipelines
Mentor teams in systems thinking, causal modeling, symbolic-connectionist integrations, and long-term planning under uncertainty
Lead development of multi-agent reasoning systems, defining principles for inter-agent knowledge exchange, goal delegation, and cooperative decision resolution
Work across disciplines—product, infra, and design—to translate ambiguous product intent into multi-stage reasoning pipelines
Partner with researchers, ontologists, and ML engineers to encode world knowledge, goals, and values into usable inference artifacts
Contribute to a company-wide understanding of what it means to make intelligent choices, not just predictions
Collaborate with internal teams on distributed agent coordination, shared memory protocols, and policy harmonization across decision surfaces
Productionize real-time reasoning loops with low-latency inference, caching, retrieval-augmented generation, and streaming updates to symbolic memory
Deploy post-training hooks for inserting logic, constraints, and domain priors into existing large models
Create advanced monitoring, attribution, and evaluation pipelines for agent behavior and decision quality
Operationalize multi-agent orchestration, ensuring reliable and fault-tolerant communication and decision propagation

Skills

Masters or equivalent in Computer Science, AI, Cognitive Science, or related fields
Recent published work or patents in AI, Cognitive Science, or related fields
15+ years in AI/ML, including post-training architectures and production-scale reasoning systems
Advanced coding proficiency in Java, Python, C++, or similar, with experience in ML/RL frameworks (e.g., PyTorch, Ray, JAX, RLlib) at scale
Proven experience integrating LLMs/LRMs with Knowledge Graphs or structured world models
Deep understanding of Reinforcement Learning and its application to decisioning and planning
Fluency in hybrid model architectures: connectionist-symbolic fusion, retrieval-based agents, or goal-directed transformers
Experience working on multi-agent coordination, distributed RL, or cooperative inference systems
Ph.D. in AI, Machine Learning, Robotics, Cognitive Systems, or related areas
Published work or patents in multi-agent reasoning, plan synthesis, knowledge-augmented learning, or generative control
Experience in cognitive architectures, neuro-symbolic systems, or agent-based simulation environments
Demonstrated ability to lead cross-functional research-to-production transitions
Experience with memory architectures, task graphs, or semantic program induction
Prior work on distributed intelligence platforms with explicit agent interaction models and collective decision-making logic

Benefits

This role may also be eligible for bonus, equity, benefits, and Employee Travel Credits.

Company Overview

Airbnb is an online community marketplace for people to list, discover, and book accommodations through mobile phones or the Internet. It was founded in 2008, and is headquartered in San Francisco, California, USA, with a workforce of 5001-10000 employees. Its website is https://www.airbnb.com.

Company H1B Sponsorship

Airbnb has a track record of offering H1B sponsorships, with 27 in 2026, 234 in 2025, 176 in 2024, 160 in 2023, 270 in 2022, 250 in 2021, 274 in 2020. Please note that this does not guarantee sponsorship for this specific role.

Apply To This Job

Apply

[Remote] Principal AI/ML Researcher / Engineer In Reasoning, Planning, and Decision-making systems

Related roles

[Remote] Senior Financial Analyst

[Remote] Network Architect-22583

[Remote] Staff Engineer - Grow Talent Experience

[Remote] Google Holiday Sales Associate Program 2025 – Be the Spark Behind the Season!

[Remote] FP&A Analyst/Manager

[Remote] Mortgage Wholesale Account Executive - New Jersey

[Remote] Regional Sales Manager, Channel (Southeast - Remote)

[Remote] Majesco Billing Analyst

[Remote] Customer Success Specialist

[Remote] Business Analyst -Cash Management

Senior Payment Accuracy Specialist

Chief Compliance Officer, CCO

Remote Part-Time Virtual Assistant & Data Entry Specialist – Flexible Schedule with arenaflex – E‑Commerce Support Role

Part Time Veterinarian - San Francisco & North Bay, California (AUG)

Amazon Delivery Driver

Experienced Customer Service & Sales Representative – Building Meaningful Connections and Driving Growth at arenaflex

Experienced Remote Customer Service Associate – Delivering Exceptional Customer Experiences at arenaflex

Senior Vulnerability Management Engineer

Remote Outbound Call Agent - Plastic and Cosmetic Surgery Practice

Temporary Part Time Librarian