How can we turn the vast and rapidly growing collections of publicly-available biological data into reusable engines for discovery? And how do we build the software infrastructure that makes that reuse possible at scale — not just once, but sustainably, across the research community?
The Davis (seandavi.github.io) and Krishnan (www.thekrishnanlab.org) labs work at the intersection of these questions. We develop ML/AI methods and open-source software and data infrastructure that make massive public biological data collections reusable and turn them into platforms for biological discovery. A central theme in our groups is the development of transferable learned representations from large-scale public omics data and metadata and the software infrastructure needed to build, maintain, and distribute them at community scale, including through active contributions to the Bioconductor ecosystem.
THE POSITION
============
We are seeking a postdoctoral researcher who is excited to engage deeply with the scientific questions that animate our groups and help push them forward in creative and impactful ways. This is not a narrowly defined project-based position. Instead, we are looking for someone who is interested in contributing to the broader intellectual life of the labs while developing their own research directions.
Postdocs in our groups are encouraged to propose new ideas, shape research directions, and participate actively in the scientific discussions that drive the lab’s work. We also see postdocs as important mentors and role models for graduate students in the Krishnan lab, helping elevate the scientific thinking and rigor of the group. Through the Davis lab, the postdoc will also join a multi-site collaborative group of Bioconductor developers to help drive informatics tooling for cancer genomics.
WHAT WE’RE LOOKING FOR
=======================
We are particularly interested in candidates who:
- Think deeply about scientific questions and enjoy identifying important problems
- Demonstrate intellectual independence and initiative
- Value careful reasoning, rigorous analysis, and clear scientific communication
- See software as a scientific contribution that enhances research
- Enjoy collaborative science and exchanging ideas across projects
- Are interested in mentoring and supporting graduate students
- Can translate ideas into finished scientific work, including the potential for published software as well as papers
The most successful postdocs in our groups tend to be those who enjoy proposing new ideas, questioning assumptions, helping push the scientific conversation forward, and who find satisfaction in turning their ideas into useful tools.
WHAT WE PROVIDE
================
Our labs aim to create an environment where postdoctoral researchers can do their best work and grow into independent scientific leaders in academia or industry. Postdocs are treated as emerging colleagues and intellectual partners, with opportunities to develop and lead research directions aligned with the lab’s broader themes.
Working across both groups, a postdoc can engage with the full stack of modern computational biology: from data engineering and large-scale infrastructure to method development and biological interpretation. They will have access to established open-source software ecosystems with real user communities, as well as the methods expertise and collaborative environment of an active computational biology group. We emphasize thoughtful scientific discussion, intellectual generosity, and rigorous analysis — and we believe that ML methods development and software infrastructure are equally important scientific contributions.
Minimum Qualifications:
● PhD (or equivalent) in Computational Biology / Bioinformatics, Biomedical Informatics, Computer Science, Statistics / Biostatistics, Applied Mathematics, or a closely related discipline
● Demonstrated research productivity, evidenced by ≥1 first-author peer-reviewed publication or preprint in a relevant area (computational biology, ML, genomics, etc.)
● Strong computational background, including:
- 2 years of experience in programming (e.g., Python, R, Julia, or similar);
- Evidence of working with large-scale datasets (e.g., genomics, transcriptomics, or other high-dimensional data);
Preferred Qualifications:
● ≥2 first-author publications or a strong publication record in relevant areas
● Experience developing or applying advanced ML/AI methods (e.g., deep learning, generative models, representation learning)
● Experience working with large public biological datasets/repositories (e.g., GEO, SRA, UK Biobank, GTEx, etc.)
● Demonstrated experience in method and tool development (e.g., new algorithms, tools, or computational frameworks)
● Evidence of interdisciplinary research, bridging computational and biological domains
● Prior experience mentoring junior researchers (e.g., listed supervision, co-authorship patterns)
● Evidence of collaborative research, including multi-author or cross-group projects
● Experience with machine learning, AI, statistical modeling, or data analysis, demonstrated through publications, projects, or dissertation work
● Track record of completing research projects, as evidenced by publications, preprints, or clearly defined completed work on CV and Google Scholar
● Contributions to open-source software, reproducible pipelines, or publicly available tools (e.g., GitHub repositories)
Find more details at the CU Careers portal (cu.taleo.net/careersection/2/jobdetail.ftl?job=39540&lang=en) and apply there with:
- Detailed CV with a link to your Google Scholar page and GitHub profile
- 2 page research statement describing past research and future directions
- 1 page description of one representative paper explaining the scientific question, your individual contribution, what results and insights came out of your work, and why they matter
- Names and contact information of three references (contacted only for finalists)