The recent emergence of high-throughput sequencing technologies has revolutionized genomics by providing a new wealth of data for biologists to learn from. Projects such as the cancer genome atlas project (TCGA) has sequenced hundreds of tumors from various cancer types, and projects such as the Genome10K will sequence the genomes of thousands of different species. However, our ability to interpret these datasets is currently limited due to their size and complexity.
In my lab, we work on developing algorithms, data structures, and computational tools to investigate the biological meaning behind these large datasets. There are several projects available, tailored to the student’s strengths and preferences. I am looking for extremely motivated computer science (or closely related) students with capacity for independent research, and exceptional analytical and programming skills. Students should have the objective to turn their project into a published research paper. No prior knowledge of biology is required, but could be helpful.