Machine Learning Scientist

Calico Life Sciences
Research
United States California South San Francisco
grnh.se/4abe14c42us

Description

Who We Are:

Calico (Calico Life Sciences LLC) is an Alphabet-founded research and development company whose mission is to harness advanced technologies and model systems to increase our understanding of the biology that controls human aging. Calico will use that knowledge to devise interventions that enable people to lead longer and healthier lives. Calico’s highly innovative technology labs, its commitment to curiosity-driven discovery science, and, with academic and industry partners, its vibrant drug-development pipeline, together create an inspiring and exciting place to catalyze and enable medical breakthroughs.

Position Description:

Calico is seeking a Machine Learning Scientist to join a team investigating aging and disease using genetics and functional genomics in humans and model systems. Genomics offers rich, informative profiles of organism state throughout life. We collect these data to build causal models for organism state, that reveal the critical underlying cellular processes and highlight promising intervention points. The ideal candidate is passionate about unraveling how genome sequence determines function and the adverse transformations that accompany aging.

Predictive modeling with cutting edge machine learning tools are at the core of this work, exemplified by recent publications modeling regulatory activity as a function of DNA sequence using deep learning.

• Avsec, Ž. et al. Effective gene expression prediction from sequence by integrating long-range interactions. Nat Methods 18, 1196–1203 (2021).
• Yuan, H. & Kelley, D. R. scBasset: sequence-based modeling of single-cell ATAC-seq using convolutional neural networks. Nat Methods 19, 1088–1096 (2022).
• Linder, J., Srivastava, D., Yuan, H., Agarwal, V. & Kelley, D. R. Predicting RNA-seq coverage from DNA sequence as a unifying model of gene regulation. bioRxiv (2023).
Additional research can be found here,

This work benefits from world-class computing infrastructure and continued collaboration with top machine learning researchers within Alphabet.

Position Responsibilities:

• Develop novel computational methods for biological sequence analysis, single cell genomics, and statistical genetics
• Interact closely with experimental biologists to analyze large-scale genomics data to reach biological insights that inform Calico’s pursuit
• Communicate research via publications, presentations, web interfaces, and open software


Qualifications

• PhD in computational biology, bioinformatics, computer science, or a related discipline
• Strong computational fundamentals including machine learning, algorithms, data structures, and statistics
• Strong molecular biology and genetics knowledge and interest; familiarity with biological data resources
• Fluent coding in preferably Python but at least one common bioinformatics programming language
• Experience analyzing genomics experiments (e.g., RNA-seq, ATAC-seq, ChIP-seq, scRNA-seq, scATAC-seq)
• Experience applying machine learning to biological problems
• Intellectual curiosity, attention to detail, consistent follow-through, proactive approach to collaborations, demonstrated ability to work in a team environment
• Must be willing to work onsite at least three days per week


Start date

To be determined

How to Apply

Please apply directly: grnh.se/4abe14c42us