Data Scientist - Remote
Join the transformative team at City of Hope, where we're changing lives and making a real difference in the fight against cancer, diabetes, and other life-threatening illnesses. City of Hope’s growing national system includes its Los Angeles campus, a network of clinical care locations across Southern California, a new cancer center in Orange County, California, and treatment facilities in Atlanta, Chicago and Phoenix. Our dedicated and compassionate employees are driven by a common mission: To deliver the cures of tomorrow to the people who need them today. • * This is a Fully Remote Opportunity within the United States** As a successful candidate, you will: The Data Scientist will design, develop, and deploy advanced machine learning and artificial intelligence solutions to support cancer research and clinical operations. This role integrates data science, AI engineering, and ML systems development, including deep learning architectures and large language model (LLM) applications such as retrieval-augmented generation (RAG), agent-based systems, and domain-specific model adaptation. The position also involves development of multimodal AI models that integrate structured clinical data, unstructured text, medical imaging, and genomic data. The role requires advanced theoretical and applied knowledge in machine learning, distributed systems, and modern AI frameworks. • Collaborate with administrative leaders, clinicians and IT specialists • Query large datasets from multiple systems • Design, develop, validate, and deploy machine learning and deep learning models, including transformer-based architectures, multimodal neural networks, and large language model (LLM) systems • Architect, deploy, and monitor production-grade AI systems, including LLM-based RAG pipelines, agent frameworks, and multimodal inference systems in cloud or distributed environments. • Produce technical documentation, visualizations and presentations • Lead technical design and implementation of complex AI initiatives involving cross-functional research, clinical, and engineering teams. • Completes own work independently • Design multimodal AI solutions integrating text, imaging, genomic, and structured clinical data using deep learning and transformer-based models. • Performs other related duties as assigned or requested. Your qualifications should include: • Master’s degree in Computer Science, Data Science, Machine Learning, Artificial Intelligence, Statistics, Applied Mathematics, Bioinformatics, Engineering, or a closely related quantitative field ; or Bachelor’s degree with 3+ years of experience. Preferred: • Familiarity with a health care environment and EHR data • Familiarity with multimodal data such as genomics and medical images • Background in deep neural networks and Natural Language Processing (NLP) • Familiarity with Cloud services platforms like Microsoft Azure • Experience with Generative AI and Large Language Models (LLMs), including RAG, fine-tuning, embedding models, prompt engineering, and integration into agentic data science workflows. • Proficiency with version control systems (Git) and collaborative development workflows • Awareness of the most recent trends and developments in the Data Science / Machine Learning fields Skills: • Experience with relational databases and SQL queries • Proficient in Python and related data and ML packages • Proficient in standard data science workflows, including training, evaluating/testing, and deploying machine learning and deep learning models • Familiarity with LLM frameworks and packages such as LangChain, LangGraph, for building advanced AI solution • Ability to work independently and use creative approaches to problem-solving • Capable of turning questions into testable hypotheses, extracting data from complex databases, creating and evaluating statistical models • Experience in data visualization and ability to communicate results to both technical and non-technical audiences • Experience in authoring and contributing to peer-reviewed scientific publications • Experience in completing end to end projects with minimal to no supervision • Experience in being a part of big projects with many internal and external teams involved • Experience managing collaborative data science projects We currently have 4 Openings City of Hope employees pay is based on the following criteria: work experience, qualifications, and work location. City of Hope is an equal opportunity employer. To learn more about our Comprehensive Benefits, please CLICK HERE.
City of Hope is seeking a Data Scientist for a fully remote position in the United States. The role involves designing, developing, and deploying advanced machine learning and AI solutions to support cancer research and clinical operations.
Responsibilities
- Collaborate with administrative leaders, clinicians, and IT specialists.
- Query large datasets from multiple systems.
- Design, develop, validate, and deploy machine learning and deep learning models.
- Architect, deploy, and monitor production-grade AI systems.
- Produce technical documentation, visualizations, and presentations.
- Lead technical design and implementation of complex AI initiatives.
- Complete work independently.
- Design multimodal AI solutions integrating various data types.
- Perform other related duties as assigned.
Requirements
- Master’s degree in Computer Science, Data Science, Machine Learning, Artificial Intelligence, Statistics, Applied Mathematics, Bioinformatics, Engineering, or a related field; or Bachelor’s degree with 3+ years of experience.
- Experience with relational databases and SQL queries.
- Proficient in Python and related data and ML packages.
- Proficient in standard data science workflows.
- Familiarity with LLM frameworks and packages.
- Ability to work independently and creatively solve problems.
- Experience in data visualization and communicating results.
Nice to Have
- Familiarity with a healthcare environment and EHR data.
- Experience with Generative AI and Large Language Models (LLMs).
- Proficiency with version control systems (Git).
- Awareness of recent trends in Data Science and Machine Learning.