SeqKit: A Cross-Platform and Ultrafast Toolkit for FASTA/Q File Manipulation© Della-3
IMGT®: the international ImMunoGeneTics information system®© DALLE 3

IMGT®: the international ImMunoGeneTics information system®

MGT®, the international ImMunoGeneTics information system®, is a comprehensive resource for immunogenetics and immunoinformatics. It was created and has been developed since 1989 by Professor Marie-Paule Lefranc at the Université de Montpellier, France. IMGT® is recognized as the global reference in immunogenetics and immunoinformatics due to its high-quality, integrated knowledge and its standardized nomenclature.
Read more
Immunoglobulin BLAST (Igblast), a Blast Tool Specific for Antibodies© Karobben

scVDJ-Seq Pipeline (CellRanger)

The cellranger vdj pipeline can be used to analyze sequencing data produced from Chromium Next GEM Single Cell 5' V(D)J libraries. It takes FASTQ files for V(D)J libraries and performs sequence assembly and paired clonotype calling. The pipeline uses the Chromium Cell Barcodes (also called 10x Barcodes) and UMIs to assemble V(D)J transcripts per cell. Clonotypes and CDR3 sequences are output as a .vloupe file which can be loaded into Loupe V(D)J Browser. Visit the What is Cell Ranger page to learn more about Cell Ranger for Immune Profiling. (10X Genomics)
Read more
Phylogenetic Tree© Karobben

Phylogenetic Tree

A phylogenetic tree is a branching diagram that shows the evolutionary relationships among different species or entities, based on their physical or genetic characteristics. It illustrates how species have diverged from common ancestors over time. These trees are constructed using morphological or genetic data and are used in biology, epidemiology, and conservation to understand the evolutionary history and relationships of organisms.
Read more

CD-HIT was originally a protein clustering program. The main advantage of this program is its ultra-fast speed. It can be hundreds of times faster than other clustering programs, for example, BLASTCLUST. Therefore it can handle very large databases, like NR. The 1st version of this program, CD-HI, was published and released in 2001. The 2nd version, called CD-HIT, was published in 2002 with significant improvements. Since 2004, CD-HIT has been hosted at bioinformatics.org as an open source project. Current CD-HIT package can perform various jobs like clustering a protein database, clustering a DNA/RNA database, comparing two databases (protein or DNA/RNA), generating protein families, and many others.
Read more
Extract Information from PDB and Visualize with Pyvista© Karobben

Extract Information from PDB and Visualize with Pyvista

In summary, extracting information from PDB files and visualizing it with PyVista enables a deep and interactive exploration of biomolecular structures, which is vital for various scientific and medical research purposes. This approach harnesses the power of computational tools to augment our understanding of complex biological systems.
Read more
Visualize the Protein Mesh with pyvista© Karobben

Visualize the Protein Mesh with pyvista

The main idea of this work is to use PyMOL for calculating the surface information/mesh structure, and then employ Python to read that information for visualization and other complex/advanced calculations. In other words, it seamlessly bridges PyMOL and Python.
Read more
Understanding PacBio Sequencing: A Deep Dive for RNA-Seq Enthusiasts© Dalle3

Understanding PacBio Sequencing: A Deep Dive for RNA-Seq Enthusiasts

The blog post delves into the realm of PacBio sequencing, elucidating its significance in the world of next-generation sequencing. Contrasting PacBio with other sequencing technologies, such as Illumina's short-read and Oxford Nanopore's long-read sequencing, the article highlights the unique advantages and challenges posed by each. The comprehensive PacBio data analysis pipeline is elucidated step by step, from raw data collection to final report generation. A special section is dedicated to comparing tools used in the PacBio pipeline, offering insights into their strengths and limitations.
Read more
ATAC-seq: A Powerful Tool for Mapping Gene Regulation© Dalle3

ATAC-seq: A Powerful Tool for Mapping Gene Regulation

Gene regulation plays a crucial role in various biological processes, and understanding its mechanisms is essential for advancing our knowledge in life sciences. The Advent of ATAC-seq, a powerful tool for mapping open chromatin regions, has revolutionized the study of gene regulation by providing insight into the regulatory elements that control gene expression. This review aims to provide an overview of the current state of ATAC-seq applications in various fields, including stem cell biology, cancer research, neurobiology, immunology, plant biology, microbiology, drug discovery, personalized medicine, and synthetic biology. We discuss the advantages and limitations of ATAC-seq and highlight its potential for identifying new therapeutic targets and developing personalized therapies. Overall, ATAC-seq has proven to be a valuable tool for unlocking gene regulation and has the potential to lead to significant breakthroughs in many areas of life science research.
Read more
Pseudotime Analysis with Monocle: A Beginner's Guide© Dalle3

Pseudotime Analysis with Monocle: A Beginner's Guide

Pseudotime analysis provides a transformative lens into cellular dynamics, offering an avenue to chart the developmental journey of individual cells. This primer introduces the novice to the realm of pseudotime and its significance in the intricate landscape of cell differentiation and gene expression. Utilizing Monocle, a pioneering tool in this domain, the article elucidates how cellular trajectories are constructed from single-cell RNA-sequencing data. The comparison of Monocle with its contemporaries highlights its robustness in handling complex trajectories and its unparalleled flexibility. As the biological world delves deeper into cellular intricacies, tools like Monocle stand as indispensable allies in unearthing the secrets of cellular progression. This article serves as a beacon for those navigating the vast ocean of single-cell analysis.
Read more
Integrating scRNA-Seq and scATAC-Seq Data: A Primer© DALLE3

Integrating scRNA-Seq and scATAC-Seq Data: A Primer

Single-cell sequencing technologies, notably scRNA-Seq and scATAC-Seq, offer unparalleled insights into gene expression and chromatin accessibility at the cellular level. However, integrating these distinct datasets presents a challenge due to their inherent differences. This article delves into the process of transforming scATAC-Seq data from genomic regions to gene-centric information and subsequently integrating it with scRNA-Seq data using shared latent spaces. By leveraging tools and methods that identify underlying patterns across datasets, researchers can achieve a comprehensive view of cellular states, bridging gene expression with chromatin dynamics.
Read more
scRNA-Seq: makers explore© Karobben

A Beginner's Guide to scRNA-Seq Data Integration

Single-cell RNA sequencing (scRNA-seq) offers unparalleled insights into cellular heterogeneity. However, integrating datasets from diverse sources poses challenges, especially for newcomers. This guide provides a concise walkthrough of scRNA-seq data integration using the Seurat package, coupled with essential tips for beginners. From preprocessing to downstream analysis, we cover the key steps to ensure effective data harmonization, aiming to empower researchers to derive meaningful insights from integrated datasets.
Read more

Understanding and Tackling Batch Effects in Single-Cell RNA-Seq Analysis

In single-cell RNA sequencing (scRNA-seq) analysis, batch effects—non-biological variations from different sample processing—are pervasive challenges. Without correction, they can obscure genuine biological signals. This article elucidates the importance of batch effect removal and presents a comparative overview of three popular correction methods within Seurat: Harmony, fastMNN, and SCTransform. Choosing an apt method ensures accurate and unbiased biological insights, highlighting the significance of vigilant batch correction in scRNA-seq studies.
Read more