๐ Home
๐ญ About
๐บ Programs
Overview
๐งช Open Source Research Experience
๐งช Summer of Reproducibility
๐ชบ Open Source Incubator Fellowship
๐ Open Source Education
๐ Resources
๐ Blog
๐ช Events
machine learning
RAG-ST: Retrieval-Augmented Generation for Spatial Transcriptomics
Hi everyone! My name is Zeyu, and I will be working on a project for a retrieval-enhanced generative framework for spatial transcriptomics during Google Summer of Code 2025. My project is called RAG-ST: Retrieval-Augmented Generation for Spatial Transcriptomics and is supervised by Ziheng Duan.
Zeyu Zou
Last updated on Jun 19, 2025
EnvGym โ An AI System for Reproducible Custom Computing Environments
Hello, My name is Yiming Cheng. I am a Pre-doc researcher in Computer Science at University of Chicago. I’m excited to be working with the Summer of Reproducibility and the Chameleon Cloud community as a project leader.
Yiming Cheng
Last updated on Jul 11, 2025
osre25
,
reproducibility
,
EnvGym
Smart Environments โ An AI System for Reproducible Custom Computing Environments
Hi everyone, I’m Sam! I’m excited to be working with the Argonne National Laboratory and SoR this summer on Smart Environments. Have you ever encountered a great opensource project and wanted to run it or use it locally, only to find that it’s such a headache to set up all the dependencies?
Sam Huang
,
Paul Marshall
Last updated on Jun 16, 2025
Applying MLOps to overcome reproducibility barriers in machine learning research
Topics: machine learning, MLOps, reproducibility Skills: Python, machine learning, GitOps, systems, Linux, data, Docker Difficulty: Hard Size: Large (350 hours) Mentors: Fraida Fund and Mohamed Saeed Project Idea Description
Fraida Fund
Smart Environments โ An AI System for Reproducible Custom Computing Environments
Overview The complexity of environment setup and the expertise required to configure specialized software stacks can often hinder efforts to reproduce important scientific achievements in HPC and systems studies. Researchers often struggle with incomplete or ambiguous artifact descriptions that make assumptions about “common knowledge” that is actually specific domain expertise.
Paul Marshall
,
Yiming Cheng
Smart Environments โ An AI System for Reproducible Custom Computing Environments
Overview The complexity of environment setup and the expertise required to configure specialized software stacks can often hinder efforts to reproduce important scientific achievements in HPC and systems studies. Researchers often struggle with incomplete or ambiguous artifact descriptions that make assumptions about “common knowledge” that is actually specific domain expertise.
Paul Marshall
Disentangled Generation and Editing of Pathology Images
Topics: computational pathology, image generation, disentangled representations, latent space manipulation, deep learning Skills: Programming Languages: Proficient in Python, with experience in machine learning libraries such as PyTorch or TensorFlow. Generative Models: Familiarity with Variational Autoencoders (VAEs), Generative Adversarial Networks (GANs), and contrastive learning methods.
Xi Li
RAG-ST: Retrieval-Augmented Generation for Spatial Transcriptomics
Topics: bioinformatics, spatial transcriptomics, gene expression generation, retrieval-augmented generation, large models Skills: Programming Languages: Proficient in Python, and familiarity with machine learning libraries such as PyTorch. Data Analysis: Experience with spatial transcriptomics datasets and statistical modeling.
Ziheng Duan
ML-Powered Problem Detection in Chameleon
Hello! My name is Syed Mohammad Qasim, a PhD candidate at the Department of Electrical and Computer Engineering, Boston University. This summer I worked on the project ML-Powered Problem Detection in Chameleon as part of the Summer of Reproducibility (SoR) program with the mentorship of Ayse Coskun and Michael Sherman.
Syed Mohammad Qasim
Last updated on Dec 6, 2024
SoR
Final Blog: BenchmarkST: Cross-Platform, Multi-Species Spatial Transcriptomics Gene Imputation Benchmarking
Hello! I’m Qianru! I have been contributing to the BenchmarkST: Cross-Platform, Multi-Species Spatial Transcriptomics Gene Imputation Benchmarking project under the mentorship of Ziheng Duan. My project aims to provide a standardized, easily accessible evaluation framework for gene imputation in spatial transcriptomics.
Qianru Zhang
Last updated on Jan 25, 2025
osre24
,
reproducibility
»
Cite
×