Open Source Contributor
email: jiefu[dot]zhang1226[at]gmail[dot]com
Education: Bachelor of Physics, Nagoya University, Japan, 2023 - 2027
Ongoing project:
Enhance and develop GeneROOT infrastructure
This project builds on the GeneROOT RNTuple foundation established in GSoC
2025. The work expands the benchmark suite to run reproducibly across heavy
bioinformatics datasets, and introduces cross-format comparisons against
SAM, BAM, and CRAM. On the performance side, it optimizes RNTuple’s
indexing and search by replacing the linear scan in region queries with a
logarithmic binary search, eliminating redundant index structures, and
exposing configurable index granularity. It also evaluates modern genomic
compression schemes (Crumble, QVZ, CALQ, P-block, etc.) for quality-score
data and integrates the most effective candidates. Finally, it brings
RAMtools to feature parity with SAMtools by adding Stat, IdxStats,
FlagStat, View, Split, Merge, and Sort.
Project Proposal: URL
Mentors: Martin Vassilev, Vassil Vassilev, Aaron Jomy