machine learning

RAG-ST: Retrieval-Augmented Generation for Spatial Transcriptomics

Hi everyone! My name is Zeyu, and I will be working on a project for a retrieval-enhanced generative framework for spatial transcriptomics during Google Summer of Code 2025. My project is called RAG-ST: Retrieval-Augmented Generation for Spatial Transcriptomics and is supervised by Ziheng Duan.

Last updated on Jun 19, 2025

RAG-ST: Retrieval-Augmented Generation for Spatial Transcriptomics

EnvGym – An AI System for Reproducible Custom Computing Environments

Hello, My name is Yiming Cheng. I am a Pre-doc researcher in Computer Science at University of Chicago. I’m excited to be working with the Summer of Reproducibility and the Chameleon Cloud community as a project leader.

Last updated on Jul 11, 2025 osre25, reproducibility, EnvGym

EnvGym – An AI System for Reproducible Custom Computing Environments

Smart Environments – An AI System for Reproducible Custom Computing Environments

Hi everyone, I’m Sam! I’m excited to be working with the Argonne National Laboratory and SoR this summer on Smart Environments. Have you ever encountered a great opensource project and wanted to run it or use it locally, only to find that it’s such a headache to set up all the dependencies?

Sam Huang, Paul Marshall

Last updated on Jun 16, 2025

Smart Environments – An AI System for Reproducible Custom Computing Environments

Applying MLOps to overcome reproducibility barriers in machine learning research

Topics: machine learning, MLOps, reproducibility Skills: Python, machine learning, GitOps, systems, Linux, data, Docker Difficulty: Hard Size: Large (350 hours) Mentors: Fraida Fund and Mohamed Saeed Project Idea Description

Applying MLOps to overcome reproducibility barriers in machine learning research

Smart Environments – An AI System for Reproducible Custom Computing Environments

Overview The complexity of environment setup and the expertise required to configure specialized software stacks can often hinder efforts to reproduce important scientific achievements in HPC and systems studies. Researchers often struggle with incomplete or ambiguous artifact descriptions that make assumptions about “common knowledge” that is actually specific domain expertise.

Paul Marshall, Yiming Cheng

Smart Environments – An AI System for Reproducible Custom Computing Environments

Smart Environments – An AI System for Reproducible Custom Computing Environments

Overview The complexity of environment setup and the expertise required to configure specialized software stacks can often hinder efforts to reproduce important scientific achievements in HPC and systems studies. Researchers often struggle with incomplete or ambiguous artifact descriptions that make assumptions about “common knowledge” that is actually specific domain expertise.

Smart Environments – An AI System for Reproducible Custom Computing Environments

Disentangled Generation and Editing of Pathology Images

Topics: computational pathology, image generation, disentangled representations, latent space manipulation, deep learning Skills: Programming Languages: Proficient in Python, with experience in machine learning libraries such as PyTorch or TensorFlow. Generative Models: Familiarity with Variational Autoencoders (VAEs), Generative Adversarial Networks (GANs), and contrastive learning methods.

Disentangled Generation and Editing of Pathology Images

RAG-ST: Retrieval-Augmented Generation for Spatial Transcriptomics

Topics: bioinformatics, spatial transcriptomics, gene expression generation, retrieval-augmented generation, large models Skills: Programming Languages: Proficient in Python, and familiarity with machine learning libraries such as PyTorch. Data Analysis: Experience with spatial transcriptomics datasets and statistical modeling.

ML-Powered Problem Detection in Chameleon

Hello! My name is Syed Mohammad Qasim, a PhD candidate at the Department of Electrical and Computer Engineering, Boston University. This summer I worked on the project ML-Powered Problem Detection in Chameleon as part of the Summer of Reproducibility (SoR) program with the mentorship of Ayse Coskun and Michael Sherman.

Syed Mohammad Qasim

Last updated on Dec 6, 2024 SoR

Final Blog: BenchmarkST: Cross-Platform, Multi-Species Spatial Transcriptomics Gene Imputation Benchmarking

Hello! I’m Qianru! I have been contributing to the BenchmarkST: Cross-Platform, Multi-Species Spatial Transcriptomics Gene Imputation Benchmarking project under the mentorship of Ziheng Duan. My project aims to provide a standardized, easily accessible evaluation framework for gene imputation in spatial transcriptomics.

Last updated on Jan 25, 2025 osre24, reproducibility