NCERT Grounding
NCERT Class 12 Biology, Chapter 9 (Biotechnology: Principles and Processes), Section 9.3.2 is titled "Cutting of DNA at Specific Locations". The text establishes the central workflow: purified DNA is incubated with a restriction enzyme under its optimal conditions; gel electrophoresis verifies digestion; the vector undergoes the same treatment; and finally, the gene of interest and the linearised vector are mixed with ligase to yield recombinant DNA. The earlier section 9.2.1 (Restriction Enzymes) provides the mechanistic grounding — palindromic recognition, staggered cuts, and sticky ends — which this process section invokes without repeating in full.
"The cut piece of DNA was then linked with the plasmid DNA… The linking… became possible with the enzyme DNA ligase, which acts on cut DNA molecules and joins their ends."
NCERT Class 12 Biology, Chapter 9
Recognition Sequences and Palindromes
Every restriction endonuclease recognises a specific, short DNA sequence and cleaves within or adjacent to it. These recognition sequences are almost always palindromic — a property that has a precise meaning in the context of double-stranded DNA distinct from the everyday language sense.
A DNA palindrome reads the same on both strands when each strand is read in its own 5'→3' direction. The canonical NCERT example is the EcoRI recognition sequence:
| Strand | Sequence (5'→3') |
|---|---|
| Top strand | 5'— G A A T T C —3' |
| Bottom strand (written 5'→3') | 5'— G A A T T C —3' |
The bottom strand, written conventionally as 3'—CTTAAG—5', reads as GAATTC when reversed to the 5'→3' orientation. This symmetry is not coincidental: most restriction enzymes function as homodimers, with each identical subunit contacting one strand. The palindromic sequence places an identical binding interface on each strand, allowing the two subunits to bind symmetrically. Recognition sequences are typically four to eight base pairs long. Six-base cutters (such as EcoRI, BamHI, HindIII) produce fragments averaging 4096 bp in a random-sequence genome; four-base cutters (such as MboI) cut roughly every 256 bp and yield far smaller fragments.
The naming convention is examinable: the first letter is the genus initial, the next two are the species initials, and Roman numerals indicate the order of isolation from that strain. EcoRI derives from Escherichia coli strain RY 13, the 'R' denotes the strain, and 'I' denotes the first enzyme isolated from it. HindII derives from Haemophilus influenzae strain Rd, and was the first restriction endonuclease whose sequence-specificity was characterised (1968, Kelly and Smith).
Figure 1. EcoRI binds the palindrome 5'-GAATTC-3'. The two cuts (red lines) are staggered: the top strand is cleaved between G and A, while the bottom strand is cleaved between A and G, four nucleotides offset. This offset generates 5' overhangs of four bases (sticky ends).
Staggered Cuts vs Flush (Blunt) Cuts
Restriction enzymes fall into two mechanistic categories based on where within the recognition site each strand is cleaved.
Staggered (Sticky-End) Cuts
5' or 3' overhang
single-stranded tail
- Cut offset from the centre of palindrome
- Leaves 4–6 nt single-stranded overhang
- 5' overhang: EcoRI, BamHI, HindIII
- 3' overhang: KpnI, PstI, SacI
- Overhangs are complementary to themselves (self-complementary sticky ends) — anneal with H-bonds
- Ligation efficiency ~100× higher than blunt ends
Flush (Blunt-End) Cuts
No overhang
both strands cut at same position
- Cut at exact centre of palindrome
- Leaves no single-stranded tail
- Examples: SmaI (CCC↓GGG), HaeIII (GG↓CC), EcoRV
- Can join any blunt end to any other blunt end — less specific
- Requires T4 DNA ligase (not E. coli ligase) and higher enzyme concentration
- Used when no compatible sticky-end site is available
The stickiness of cohesive ends arises from transient Watson-Crick base-pairing between complementary overhangs. These hydrogen bonds hold two fragments in close proximity long enough for DNA ligase to seal the nick. The same complementary-end principle is the reason why the insert and vector must be digested with the same restriction enzyme: only then do their overhangs share the complementary sequence that permits annealing. If a partial compatibility is needed — for instance, joining a BamHI-cut fragment to a BglII-cut vector — it is possible because BamHI (5'-GGATCC) and BglII (5'-AGATCT) both produce the same 5'-GATC-3' overhang, making them compatible sticky ends. The resulting junction, however, is no longer a BamHI or BglII site and cannot be re-cleaved by either enzyme — an important consideration in subcloning strategies.
DNA Ligase and the Ligation Reaction
Mechanism of ligase action
DNA ligase catalyses the formation of a 3'-5' phosphodiester bond between two adjacent nucleotides at a nick in a double-stranded DNA molecule. The reaction requires (a) a free 3'-OH at one end, (b) a 5'-phosphate at the adjacent end, and (c) the energy cofactor. The mechanism proceeds in three steps:
DNA Ligase Reaction — Three Steps
-
Step 1
Enzyme adenylation
Ligase reacts with NAD+ (E. coli) or ATP (T4, eukaryotic) to form a covalent enzyme-AMP intermediate; AMP is transferred to the active-site lysine (Lys-AMP).
-
Step 2
Nick adenylation
The AMP group is transferred to the 5'-phosphate at the nick, forming a 5'-adenylated (5'-AppN) intermediate — a high-energy intermediate that activates the 5'-phosphate.
-
Step 3
Phosphodiester bond formation
The 3'-OH attacks the adenylated 5'-phosphate in an in-line nucleophilic substitution; the phosphodiester bond forms and AMP is released, sealing the nick.
E. coli ligase vs T4 DNA ligase
| Property | E. coli DNA Ligase | T4 DNA Ligase |
|---|---|---|
| Energy cofactor | NAD+ | ATP |
| Nick-sealing efficiency | High | High |
| Blunt-end ligation | Very poor | Efficient (with PEG) |
| Optimal temperature (cohesive ends) | 37 °C | 4–16 °C |
| Use in recombinant DNA | Rarely used | Standard (NCERT reference) |
| Source | E. coli chromosome | Bacteriophage T4 |
NCERT references "the enzyme DNA ligase" and "T4 ligase" specifically in the context of joining recombinant molecules. T4 DNA ligase is the workhorse of molecular cloning because it tolerates a wider range of substrates and can join blunt ends when required. The low temperature (4–16 °C) used for sticky-end ligation is a kinetic compromise: it stabilises the transient base-pairing of the overhangs while still permitting sufficient ligase activity.
Ligation efficiency advantage
Sticky-end ligation is approximately 100-fold more efficient than blunt-end ligation under standard conditions. This is why cloning strategies are designed around shared restriction enzyme sites rather than blunt-end joining whenever possible.
Optimising ligation efficiency
Several practical variables govern ligation success in the laboratory, and understanding them clarifies why the procedure is designed as it is. The insert-to-vector molar ratio is typically maintained at 3:1 to 10:1; too little insert drives self-ligation or empty vector; too much insert produces concatemers. Total DNA concentration influences whether ligation proceeds intermolecularly (giving circular ligation products) or intramolecularly. At low concentrations, intramolecular self-ligation of linearised vector is favoured — this is why alkaline phosphatase pre-treatment of the vector is used to suppress this pathway (see below). PEG (polyethylene glycol) is often added to blunt-end ligations because its macromolecular crowding effect increases the effective concentration of DNA ends.
Alkaline Phosphatase and Self-Ligation Prevention
One of the most important troubleshooting steps in cloning is the prevention of vector self-ligation. When a vector is linearised by a single restriction enzyme, both ends carry compatible sticky overhangs — they can reanneal and be sealed by ligase to regenerate the original empty circular vector. These self-ligated vectors transform host cells at high efficiency but carry no insert; the resulting colonies are background noise that must be screened out.
Alkaline phosphatase eliminates this background by removing the 5'-phosphate group from linearised vector termini. Calf intestinal alkaline phosphatase (CIP) or bacterial alkaline phosphatase (BAP) catalyses hydrolysis of the 5'-phosphomonoester bond, converting a 5'-phosphate terminus to a 5'-hydroxyl terminus. Because DNA ligase requires a 5'-phosphate as one of its substrates, dephosphorylated vector ends cannot be joined — the vector cannot re-circularise and remain linear in solution.
The insert DNA, however, is not phosphatase-treated and therefore retains its 5'-phosphates. In the ligation reaction, the insert's 5'-phosphates provide the phosphodiester bond at each junction. Ligase seals two nicks: one nick at each end of the insert, on the strand that has the vector 5'-OH adjacent to the insert 5'-phosphate (actually, these nicks cannot be sealed in vitro; they are sealed by intracellular ligase after transformation). The practical outcome is that transformants carrying inserts are strongly selected over empty-vector background. Quantitatively, dephosphorylated vector backgrounds can be reduced from ~50% self-ligated colonies to less than 1–2%.
Figure 2. Left: without phosphatase treatment, the linearised vector's 5'-phosphate ends (red circles) can re-ligate to reform empty circular vector. Right: after CIP/BAP treatment, the 5'-phosphate is replaced by a 5'-OH (open circles). Ligase cannot seal a nick with two OH groups; only when an insert provides a 5'-phosphate is ligation possible, strongly selecting for recombinants.
Complete vs Partial Digests
The degree to which restriction enzyme digestion proceeds to completion has major consequences for the resulting fragment pattern and its downstream use.
Complete Digest
Conditions: excess enzyme, optimal temperature and buffer, sufficient incubation time (1–2 hr)
Result: reproducible, defined fragment set; predictable by restriction map
Use: routine subcloning, diagnostic digests, RFLP analysis
Risk: star activity (enzyme cuts non-canonical sites at high enzyme concentration, glycerol, or non-optimal pH) can generate spurious fragments
Gel electrophoresis confirms complete digestPartial Digest
Conditions: limiting enzyme or shortened incubation
Result: a nested set of overlapping fragments spanning adjacent recognition sites
Use: genomic library construction — adjacent clones overlap, enabling assembly of contiguous sequences
Risk: difficult to control reproducibly; large inserts may be unstable in standard vectors
Library construction, chromosome walkingIn genomic library construction, partial digestion is deliberately employed because any single restriction enzyme will produce gaps in the genome at sites where recognition sequences are rare or absent. If adjacent fragments overlap (because some sites between them were not cut), any gene of interest will be contained within at least one clone in the library, regardless of where restriction sites fall relative to the gene. The degree of overlap is controlled by adjusting enzyme concentration. Typically, partial digests use 1/100 to 1/1000 of the enzyme amount needed for a complete digest.
Worked Examples
A student treats a plasmid with EcoRI and separates the digest by agarose gel electrophoresis. Two bands appear. What does this tell you about the number of EcoRI recognition sites in the plasmid?
Answer: Exactly 2 EcoRI recognition sites are present. A circular plasmid with one EcoRI site would linearise to give a single band. With two sites, the plasmid is cut into two linear fragments of (possibly) different sizes, producing two bands. Each additional site adds one more fragment: n cuts in a circular molecule produce n fragments.
A researcher wants to clone a gene into the BamHI site of pBR322 and selects transformants. She finds colonies growing on ampicillin plates but not all carry the insert. Explain why and suggest an improvement.
Answer: The BamHI site in pBR322 lies within the tetracycline resistance gene (tetR). Successful recombinants have the insert disrupting tetR, so they grow on ampicillin but not tetracycline (insertional inactivation). Colonies without insert (self-ligated vector) grow on both antibiotics. To identify recombinants, replica-plate ampicillin-selected colonies onto tetracycline-containing medium; colonies that fail to grow on tetracycline are recombinants. An improvement for reducing background is to treat the BamHI-linearised vector with alkaline phosphatase before ligation, which dephosphorylates vector ends and prevents self-ligation, raising the fraction of insert-containing transformants.
Identify which of the following sequences is a palindrome that could be recognised by a type II restriction enzyme: (A) 5'-GAATTC-3' / 3'-CTTAAG-5' (B) 5'-CTCAGT-3' / 3'-GAGTCA-5' (C) 5'-GTATTC-3' / 3'-CATAAG-5'
Answer: Option A is the palindrome. To check: read the bottom strand in its 5'→3' direction (reverse the 3'→5' sequence CTTAAG): GAATTC — identical to the top strand. Options B and C fail this test: the 5'→3' reading of their bottom strands is not identical to their top strands. This is the EcoRI recognition site and corresponds directly to a NEET 2022 question pattern (Q.146).
Common Confusion and NEET Traps
Exonuclease
- Removes nucleotides from the ends (5' or 3') of a DNA strand
- Does NOT make internal cuts
- Used in proofreading (DNA polymerase 3'→5' exonuclease), nick translation, Bal 31 trimming
- Cannot produce fragments from intact linear DNA by itself
Endonuclease
- Cuts within (endo) the DNA strand at internal positions
- Restriction endonucleases are a specialised class recognising specific sequences
- DNase I is a non-specific endonuclease (used in DNase footprinting)
- Restriction enzymes = restriction endonucleases (not exonucleases)