0 Posted 2025-07-09Updated 2025-07-17Biology / Bioinformatics / Single Cell5 minutes read (About 712 words)

Cell Ranger for scRNA-Seq Data Analysis

Cell Ranger is a software suite for analyzing single-cell RNA sequencing (scRNA-Seq) data, developed by 10x Genomics. It provides tools for processing raw sequencing data, performing quality control, and generating gene expression matrices.

0 Posted 2025-07-06Updated 2025-07-17Biology / Bioinformatics / Single Cell3 minutes read (About 414 words)

Reading Public scRNA-seq Data into Seurat

This page provides a guide on how to read public scRNA-seq datasets into Seurat objects, including examples for specific datasets.

0 Posted 2025-04-28Updated 2025-04-28Machine Learning7 minutes read (About 1102 words)

Backpropagation Hand by Hand

Backpropagation is the algorithm that trains neural networks by adjusting their weights to minimize the loss. It works by applying the chain rule of calculus to efficiently compute how the loss changes with respect to each weight. Starting from the output layer, it propagates the error backward through the network, layer by layer, updating the weights based on their contribution to the error.$$W^\ell\!\leftarrow W^\ell - \eta\, (a^ {\ell-1}) ^\top \delta^\ell,\quad b^\ell\!\leftarrow b^\ell - \eta\,\sum\delta^\ell.$$

0 Posted 2025-04-23Updated 2025-07-17Machine Learning8 minutes read (About 1130 words)

GNN: Graph Neural Networks

Graph Neural Networks (GNNs) are a class of neural networks designed to work directly with graph-structured data. They have gained significant attention in recent years due to their ability to model complex relationships and interactions in various domains, including social networks, molecular biology, and recommendation systems.

0 Posted 2025-04-11Updated 2025-04-22Notes / Biology / Immunity8 minutes read (About 1177 words)

Influenza

0 Posted 2025-04-10Updated 2025-04-10Machine Learning / LM / Proteina minute read (About 178 words)

Rosetta, the Pioneer of Protein Structure Prediction

Rosetta is a comprehensive computational suite that plays a pivotal role in the protein folding field by predicting and designing protein structures based on amino acid sequences. It employs a combination of physics-based energy functions and advanced algorithms, such as fragment assembly and Monte Carlo sampling, to simulate the folding process and explore the vast conformational landscape of proteins. By iteratively optimizing potential structures, Rosetta helps researchers identify low-energy, stable configurations that closely resemble naturally occurring proteins. This tool not only aids in elucidating fundamental principles of protein structure and function but also supports the design of novel proteins and therapeutic interventions, making it an indispensable resource in structural biology and bioengineering.

0 Posted 2025-04-05Updated 2025-04-09Machine Learning / LM / Protein11 minutes read (About 1671 words)

AlphaFold

AlphaFold2

0 Posted 2025-03-13Updated 2025-04-12Notes / Biology / Immunity5 minutes read (About 806 words)

Hemagglutinin, the Influenza Virus Protein

Hemagglutinin is a protein found on the surface of the influenza virus. It is responsible for binding the virus to the host cell, initiating the infection process. Hemagglutinin is a target for the immune system, and antibodies against it can prevent infection.

0 Posted 2025-03-03Updated 2025-03-03Paper10 minutes read (About 1463 words)

Protein Loop Refinement

Protein Loop refinement:

0 Posted 2025-02-06Updated 2025-02-06AI / Machine Learning5 minutes read (About 709 words)

AlexNet

AlexNet is a convolutional neural network that won the ImageNet Large Scale Visual Recognition Challenge in 2012. It was designed by Alex Krizhevsky, Ilya Sutskever, and Geoffrey Hinton. The network has eight layers, five of which are convolutional layers and three are fully connected layers. It uses ReLU activation functions, dropout for regularization, and data augmentation techniques to improve performance. AlexNet significantly advanced the field of deep learning and computer vision.

0 Posted 2025-01-22Updated 2025-01-22Biology / Bioinformatics / Protein Structure4 minutes read (About 616 words)

esm, Evolutionary Scale Modeling

ESM (Evolutionary Scale Modeling) is a family of large-scale protein language models developed by Meta AI. They’re trained on massive protein sequence databases, learning contextual representations of amino acids purely from sequence data. These representations—often called embeddings—capture both structural and functional clues.
In practice, you feed a protein sequence into ESM to obtain per-residue embeddings, which you can then use for downstream tasks like structure prediction, function annotation, or variant effect prediction. If you batch multiple sequences together, ESM aligns them by adding special start/end tokens and padding shorter sequences to match the longest one. You then slice out the valid embedding region for each protein, ignoring any padding.

0 Posted 2025-01-06Updated 2025-01-084 minutes read (About 668 words)

PCA

0 Posted 2025-01-01Updated 2025-01-06Notes / Class / UIUC / AI12 minutes read (About 1859 words)

High Dimension Data

0 Posted 2024-12-30Updated 2024-12-30Notes / Class / UIUC / AI7 minutes read (About 1090 words)

AI: Logistic Regression

Logistic regression is a supervised machine learning algorithm used for binary classification tasks. Unlike linear regression, which predicts continuous values, logistic regression predicts the probability that a given input belongs to a certain class.