Antibody 12/23 rule

How Does Antibody Fragments Jointed Together?

We all know that antibodies are composed of V, D, and J segments, which originate from different locations on the chromosome. But how are they connected together after post-transcriptional modification? The 12/23 rule is the fundamental mechanism that ensures proper recombination of these segments.

The V region, or V domain, of an immunoglobulin heavy or light chain is encoded by more than one gene segment. For the light chain, the V domain is encoded by two separate DNA segments. The first segment encodes the first 95–101 amino acids of the light chain and is termed a V gene segment because it encodes most of the V domain. The second segment encodes the remainder of the V domain (up to 13 amino acids) and is termed a joining or J gene segment[1].

What Is 12/23 Rule

Video tutorial: Daniel Levy; 2013. VDJ Gene Recombination

The 12/23 rule is a principle in V(D)J recombination, a process crucial for generating the diversity of antibodies and T-cell receptors. It states that recombination can only occur between gene segments flanked by recombination signal sequences (RSS) with spacers of 12 base pairs (bp) and 23 bp. This ensures proper alignment and prevents inappropriate recombination events, maintaining the integrity and functionality of the immune system’s response.

12/23 rule illustration
© Charles A. Janeway

This image illustrates the 12/23 rule in the context of V(D)J recombination.

  1. Recombination Signal Sequences (RSS):

    • Heptamer: A conserved sequence of 7 base pairs.
    • Nonamer: A conserved sequence of 9 base pairs.
    • Spacer: The region between the heptamer and nonamer, either 12 or 23 base pairs long.
  2. V(D)J Recombination Process:

    • Segments: V (Variable), D (Diversity), and J (Joining) segments.
    • Rule Application: The 12/23 rule ensures that only segments with RSS of different spacers (12 and 23 bp) recombine, facilitating the correct assembly of these segments.

The image visually represents how the rule guides the alignment and recombination of V, D, and J gene segments, which is essential for generating the diversity of antibodies and T-cell receptors.

How It Worked

The heptamer and nonamer is the recombination signal sequences (RSSs). The (RAG1-RAG2)2 endonuclease complex (RAG) specifically recognizes and cleaves a pair of RSSs[2]

© Ru H[2:1]

How Does It Applied

IgDetective

IgDetective, published by Vikram Sirupurapu and Yana Safonova[3], is a tool for detecting and naming antibodies based on the 12/23 rule. This tool leverages the stringent application of the 12/23 rule among mammals, making it suitable primarily for mammalian species.

  • Test: not suitable for birds like chicken

williamdlees-Digger

This is another open source tool developed by William D. Lees[4], aimed at annotating the positions of V/D/J genes on newly assembled genomes. According to the results shown in the documentation, it has very high accuracy.

Prerequisite:

  • a set of known core coding region allele sequences
  • Position-weighted matrices

Some technic details worthy to know:

  • The search for V-sequences uses the parameters gapopen 5, gapextend 5, penalty -1, word_size 11.
  • D and J searches, word_size is reduced to 7 to reflect the shorter sequences
  • D sequences the evalue is set to 100 rather than the default of 10 to widen the search.
    How it find the heptamers and nonamers:
Type Functionality criteria
V RSS nonamer and heptamer pass PWM thresholds, and match canonical consensus, if defined.
L-PART1 and L-PART2 pass PWM thresholds and splice to form a coding sequence with no STOP-CODONs that is in-frame with the V sequence.
V-REGION is in-frame and has first and second cysteines at the correct positions in the IMGT alignment.
No STOP-CODONs are present in the V-REGION before the second cysteine.
D RSS nonamers and heptamers match canonical consensus, if defined.
J RSS nonamer and heptamer pass PWM thresholds, and match canonical consensus, if defined.
The J-motif is found at the expected position relative to the end of the J sequence.
The donor splice is found at the expected position, given the length of the matched sequence.

Limitation: It only supports IMGT well-annotated species because it relies on germline annotation from the IMGT database.
Merits: Easy used and to be understood (write by python).

Digger Results
© William D. Leeszs
extract_refs -L IGH "Gallus gallus"
fix_macaque_gaps Gallus_gallus_IGHV_gapped.fasta \
Gallus_gallus_IGHV_gapped_fixed.fasta IGH
cat Gallus_gallus_IGHV.fasta Gallus_gallus_IGHD.fasta Gallus_gallus_IGHJ.fasta \
> Gallus_gallus_IGHVDJ.fasta

parse_imgt_annotations --save_sequence IMGT000014.fasta \
"https://www.imgt.org/ligmdb/view.action?format=IMGT&id=IMGT000014" \
IMGT000014_genes.csv IGH

cat IMGT000014.fasta > Mmul_051212.fasta


mkdir motifs
cd motifs
parse_imgt_annotations \
"https://www.imgt.org/ligmdb/view?format=IMGT&id=IMGT000014" \
IMGT000014_genes.csv IGH
calc_motifs IGH IMGT000014_genes.csv
cd ..

makeblastdb -in Gallus_gallus_IGHV.fasta -dbtype nucl
makeblastdb -in Gallus_gallus_IGHD.fasta -dbtype nucl
makeblastdb -in Gallus_gallus_IGHJ.fasta -dbtype nucl

blastn -db Gallus_gallus_IGHV.fasta -query Mmul_051212.fasta -out mmul_IGHV.out \
-outfmt 7 -gapopen 5 -gapextend 5 -penalty -1 -word_size 11
blastn -db Gallus_gallus_IGHD.fasta -query Mmul_051212.fasta -out mmul_IGHD.out \
-outfmt 7 -gapopen 5 -gapextend 5 -penalty -1 -word_size 4 -evalue 100
blastn -db Gallus_gallus_IGHJ.fasta -query Mmul_051212.fasta -out mmul_IGHJ.out \
-outfmt 7 -gapopen 5 -gapextend 5 -penalty -1 -word_size 4



blastresults_to_csv mmul_IGHV.out mmul_ighvdj_
blastresults_to_csv mmul_IGHD.out mmul_ighvdj_ -a
blastresults_to_csv mmul_IGHJ.out mmul_ighvdj_ -a

find_alignments Gallus_gallus_IGHVDJ.fasta \
Mmul_051212.fasta \
"mmul_ighvdj_nw_*.csv" \
-ref imgt,Gallus_gallus_IGHVDJ.fasta \
-align Gallus_gallus_IGHV_gapped_fixed.fasta \
-motif_dir motifs \
Mmul_051212.csv

digger ../../../Duck/data/GCA_015476345.1_ZJU1.0_genomic.fna \
-v_ref Homo_sapiens_IGHV.fasta \
-d_ref Homo_sapiens_IGHD.fasta \
-j_ref Homo_sapiens_IGHJ.fasta \
-v_ref_gapped Homo_sapiens_IGHV_gapped.fasta \
-ref imgt,Homo_sapiens_IGHVDJ.fasta \
-species Chicken \
-locus IGH \
IMGT000035.csv


How It Really Looks Like in Human Genome

I randomly checked a few sequences from the homo genome and found that those regions from different V, D, and J genes are very similar. It could because that the 12/23 rule is not very stringent but flexible. But it could also caused by they are prevented to be recombined.


  1. Janeway C, Travers P, Walport M, et al. Immunobiology: the immune system in health and disease[M]. New York: Garland Pub., 2001. ↩︎

  2. Ru H, Zhang P, Wu H. Structural gymnastics of RAG-mediated DNA cleavage in V (D) J recombination[J]. Current opinion in structural biology, 2018, 53: 178-186. ↩︎ ↩︎

  3. Sirupurapu V, Safonova Y, Pevzner P A. Gene prediction in the immunoglobulin loci[J]. Genome research, 2022, 32(6): 1152-1169. ↩︎

  4. Lees W D, Saha S, Yaari G, et al. Digger: directed annotation of immunoglobulin and T cell receptor V, D, and J gene sequences and assemblies[J]. Bioinformatics, 2024, 40(3): btae144. ↩︎

Author

Karobben

Posted on

2024-05-18

Updated on

2024-05-29

Licensed under

Comments