Introduction - Willing To Contribute (GSOC'26) #1179
Replies: 2 comments 2 replies
-
|
Hi Vinit, Thank you for your help. I think you have found ways to contribute without my help. |
Beta Was this translation helpful? Give feedback.
-
|
Hi everyone, I’m a Master’s student in Data Science at City, University of London, and I’m planning to apply to GSoC 2026 for the project “Building a machine-learning taxon classifier for genomic classification in malaria mosquitoes.” From the project description, I understand that the goal is to classify mosquito samples directly from raw FASTQ sequencing data without relying on full variant-calling pipelines. I’m currently trying to understand the best way to represent raw DNA sequences as input features for machine learning models. I wanted to ask:
I’d really appreciate any guidance on aligning the approach with existing workflows. Thank you! |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi @jonbrenas @tristanpwdennis,
My name is Vinit Jain, and I’m a 3rd year student currently focusing on Python and machine learning, with a growing interest in bioinformatics applications.
I’ve been exploring the Building a machine-learning taxon classifier for genomic classification in malaria mosquitoes project and found the problem of FASTQ-based classification particularly interesting—especially the idea of avoiding full variant calling pipelines and working directly with raw sequencing reads.
I’ve started going through the malariagen-data-python repository to better understand the existing API and data workflows, and I’m currently identifying areas where I can make a meaningful first contribution. I’m also beginning to explore approaches such as k-mer based representations and lightweight classification models as a potential direction for the project.
I plan to submit a PR soon and would really appreciate any guidance on areas where contributions would be most valuable for new contributors.
Looking forward to learning and contributing!
Thanks,
Vinit Jain
Github: @vinitjain2005
Beta Was this translation helpful? Give feedback.
All reactions