← all jobs

[Remote] Research Scientist, Data

Work from home Full-time role Hiring

Note: The job is a remote job and is open to candidates in USA. Pika is pioneering the next generation of creative infrastructure built around real-time, multimodal generation and intelligent agentic platforms. They are looking for a staff or lead-level Research Engineer, Data to architect and scale data engineering systems supporting model training for advanced multimodal foundation models.

Responsibilities

  • Take ownership of large-scale data pipeline architecture and implementation to support model training and research workflows for text, image, audio, and video datasets
  • Partner with research and engineering teams to curate, clean, and manage diverse, sensory-rich datasets for pre-training and mid-training of multimodal models
  • Develop strategies and tools for scalable data ingestion, labeling, filtering, augmentation, and storage
  • Ensure data quality, reliability, and compliance, including managing privacy and ethical considerations throughout the data lifecycle
  • Optimize data processing, transformation, and delivery for large-scale distributed training pipelines
  • Prototype and productionize new methods for dataset creation, management, and continuous improvement in response to researcher needs
  • Contribute to the integration of research-driven data advancements into production-ready systems
  • Stay informed on emerging data engineering and ML data management developments, bringing best practices to our systems

Skills

  • 5+ years of experience building and scaling data pipelines for machine learning applications at staff or lead engineer level, ideally in research or model training environments
  • Strong background in data engineering and ML data curation for LLMs, VLMs, or other large-scale multimodal models
  • Expertise in distributed data systems (e.g., Spark, Hadoop, Ray, or similar) and efficient large dataset processing/ETL workflows
  • Proven ability to build robust, scalable, and production-grade data infrastructure for ML pipelines
  • Experience developing tools for data labeling, filtering, deduplication, quality assurance, and dataset management
  • Strong programming skills (Python, SQL, PySpark, or similar) and familiarity with cloud data platforms (AWS, GCP, Azure)
  • Knowledge of privacy, compliance, ethics, and best practices in data collection and management
  • Excellent cross-functional collaboration, problem-solving, and communication skills
  • Passion for enabling cutting-edge generative AI and creative technology through data excellence

Benefits

  • Competitive salary and substantial equity in a high-growth startup
  • Full health benefits, 401k matching, and more
  • Collaborative, mission-driven team environment with major growth opportunities
  • Flexible on-site/remote hybrid (HQ in Palo Alto, CA)

Company Overview

  • Pika is an AI platform that allows users to create videos from text prompts, including text to video, image to video, and editing tools. It was founded in 2023, and is headquartered in Palo Alto, California, USA, with a workforce of 2-10 employees. Its website is https://pika.art.
  • Company H1B Sponsorship

  • Pika has a track record of offering H1B sponsorships, with 9 in 2025. Please note that this does not guarantee sponsorship for this specific role.
  • More open positions

    [Remote] Senior Director, Corporate Systems- Finance Analytics & Reporting

    Work from home Full-time role

    [Remote] Strategic Sales Director

    Work from home Full-time role

    [Remote] Cereals Product Manager

    Work from home Full-time role

    [Remote] Manager, Business Systems & Analytics

    Work from home Full-time role

    [Remote] Account Executive, Social & Influencer

    Work from home Full-time role

    Platform Engineer II

    Work from home Full-time role

    Sales Director – Digital Solutions

    Work from home Full-time role

    Lead P&C Design Engineer

    Work from home Full-time role

    Remote Truck Dispatcher- New York City,US

    Work from home Full-time role

    Online Secondary Art Teacher, Gr 6-8

    Work from home Full-time role

    Mission Critical Project Manager - Automated Logic

    Work from home Full-time role

    Lead Software Engineer (Remote, United Kingdom)

    Work from home Full-time role

    Writing Tutor (Private) in Fort Worth, TX | TeachMe.To

    Work from home Full-time role

    Representative, Voice Ordering Accessibility

    Work from home Full-time role

    Educate Landlords and Property Owners of Partnering with Nonprofit

    Work from home Full-time role

    Freelance IT Product Manager for a LIMS in Clinical Diagnostics

    Work from home Full-time role

    Forward Deployed Engineer (Berlin)

    Work from home Full-time role

    Technical Services Representative Senior

    Work from home Full-time role

    Wholesale Account Manager (remote)

    Work from home Full-time role

    Associate Product Manager

    Work from home Full-time role

    University Internship

    Work from home Full-time role