A Senior Bioinformatics Programmer position is available in Dr. Bing Zhang’s laboratory to work on bioinformatics data processing and analysis. Our research is well-funded by the National Cancer Institute (NCI), the Cancer Prevention and Research Institute of Texas (CPRIT), and the McNair Medical Institute at The Robert and Janice McNair Foundation. We are involved in several large projects that generate multidimensional omics data at DNA, mRNA, protein, and clinical phenotype levels from human tumors, patient-derived xenografts (PDXs), and cancer cell lines to drive precision oncology discovery and practice. The candidate will play a critical role in acquiring, processing, and analyzing data from local and external sources to enable efficient data integration and data mining.
The position offers an excellent opportunity to conduct research in a supportive and stimulating environment, and to collaborate with bioinformaticians, biostatisticians, computer scientists, biologists, and clinicians.
ESSENTIAL FUNCTIONS AND DUTIES
- Designs, codes, tests, debugs, and maintains scripts and pipelines for retrieving data from various biological and biomedical databases based on project requirements
- Designs, codes, tests, debugs, and maintains scripts and pipelines for processing data to the proper format or structure for downstream analysis or uploading to databases
- Prepares detailed documentations for implemented scripts and pipelines
- Performs basic differential expression, clustering, and gene set enrichment analysis
- Performs other job-related duties as assigned.
- Bachelor's degree in Computer Science, Bioinformatics, or a related field.
- Two years’ experience in bioinformatics data processing or a Master's degree with relevant project experience.
- Proficiency in scripting languages such as Python, Perl, Bash, and R
- Familiarity with regular expression
- Experience with Unix/Linux environment including basic commands and shell scripting
- Strong communication skills
- Familiarity with common bioinformatics resources such as NCBI databases, Ensembl, KEGG, GO, USCS genome browser, etc.
- Basic understanding of next generation sequencing, transcriptome profiling, and proteome profiling technologies and data derived from these technologies
- Good understanding of statistical techniques for differential expression, clustering, and gene set enrichment analysis
- Experience with version control systems, preferably Git