NCERT grounding
Section 9.2.2 of NCERT Class 12 Biology opens the discussion of cloning vectors by observing that plasmids and bacteriophages can replicate inside bacterial cells independently of chromosomal DNA, and that bacteriophages, because of their high copy number per cell, amplify any linked DNA enormously. NCERT then enumerates three non-negotiable features — origin of replication, selectable marker, and cloning sites — and uses pBR322 as the canonical worked example, explaining how insertional inactivation of the tetracycline resistance gene signals a successful recombinant clone. The Ti plasmid of Agrobacterium tumefaciens and disarmed retroviruses are introduced as vectors for plant and animal cells, respectively.
"Vectors used at present are engineered in such a way that they help easy linking of foreign DNA and selection of recombinants from non-recombinants."
NCERT Class 12 Biology — Chapter 9, Section 9.2.2
What is a cloning vector?
A cloning vector is a DNA molecule capable of autonomous replication inside a host cell that serves as a carrier to transport a foreign DNA fragment into the host and maintain it stably through many rounds of cell division. The term "vector" is borrowed from epidemiology: just as a mosquito vector carries a parasite into a human host, a plasmid or phage vector delivers an alien DNA sequence into a bacterium, yeast, or plant cell.
The critical requirement is that the vector must replicate not only its own sequence but also whatever foreign DNA has been ligated into it. This depends on a specific origin of replication (ori) sequence recognised by the host's replication machinery. Without a functional ori, the inserted DNA is diluted out of the host population within a few generations and is lost.
Vectors also serve a second purpose: they provide a framework for selecting and identifying the rare cells that have successfully taken up a recombinant DNA molecule. In a typical transformation experiment, fewer than 1 in 10,000 bacterial cells picks up a plasmid, and of those transformants only a fraction carry the insert. Selectable markers and reporter systems on the vector allow the experimenter to find these needles in a haystack.
Three essential features of a cloning vector
NCERT specifies three features that every cloning vector must possess. Absence of any one makes the vector functionally useless for gene cloning.
Origin of Replication (ori)
Function: The sequence at which host-cell DNA polymerase initiates replication of the plasmid or phage genome.
Copy number: Different ori sequences support different copy numbers per cell. pUC vectors carry the pMB1 ori and reach 500–700 copies per cell. pBR322 carries the ColE1-derived ori and is maintained at 15–20 copies per cell under normal conditions, rising to several hundred when amplified by chloramphenicol treatment.
NEET point: The ori sequence also controls how many copies of the linked foreign DNA will be produced. High-copy-number ori = more product. NEET 2020 directly tested this (Answer: ori site).
Selectable Marker
Function: A gene whose expression distinguishes transformed cells (those that took up the vector) from untransformed cells (those that did not).
Classic markers: Antibiotic-resistance genes — ampR (ampicillin), tetR (tetracycline), kanR (kanamycin), camR (chloramphenicol). Normal E. coli cells carry none of these, so plating on antibiotic medium kills all non-transformants.
Reporter markers: lacZ (blue-white screening), green fluorescent protein (GFP), luciferase. These distinguish recombinant from non-recombinant transformants by visible phenotype rather than growth/death.
Cloning Sites (MCS)
Function: One or a few unique restriction enzyme recognition sequences where foreign DNA can be inserted without disrupting vector replication.
Multiple Cloning Site (MCS): Modern vectors such as pUC19 cluster 10–13 unique restriction sites in a short MCS region within lacZ, enabling flexible choice of enzyme for directional cloning.
NCERT warning: If a vector has more than one site for the same restriction enzyme, cutting that enzyme will produce multiple fragments, complicating cloning. This is a direct NEET trap (NEET 2022 tested it).
Plasmid vectors in detail
pBR322 — the prototype vector
pBR322 was one of the first purpose-built cloning vectors, constructed in 1977 by Bolivar and Rodriguez (the "BR" in the name). It is 4,361 base pairs in length, circular, double-stranded, and carries the following elements:
| Element | Location / Notes | Function |
|---|---|---|
| ori | ColE1-derived; ~15–20 copies/cell | Autonomous replication in E. coli |
| ampR | Encodes beta-lactamase; contains PstI and PvuI sites | Primary selectable marker; insertion at PstI disrupts ampicillin resistance |
| tetR | Contains BamHI, SalI, HindIII, SphI, EcoRV sites | Secondary marker; insertion disrupts tetracycline resistance — classic insertional inactivation screen |
| rop gene | Encodes ROM protein | Controls plasmid copy number by stabilising the RNA I / RNA II duplex that regulates replication |
The standard cloning strategy with pBR322 exploits the tetR gene. Foreign DNA is ligated at the BamHI site within tetR, inactivating it. Transformants are first plated on ampicillin medium (all transformants grow; non-transformants die). Surviving colonies are then replica-plated onto tetracycline medium. Colonies that grow on ampicillin but fail on tetracycline carry recombinant plasmid. Colonies that grow on both media carry the non-recombinant (self-ligated) vector.
Figure 1. Schematic map of pBR322. The ampR gene (teal arc) and tetR gene (amber arc) are the two selectable markers. Foreign DNA inserted at the BamHI site within tetR inactivates tetracycline resistance; transformants are identified by growth on ampicillin but failure on tetracycline. The rop gene (purple) encodes ROM protein, which controls copy number.
pUC19 and blue-white screening
pUC19 (2,686 bp) improved on pBR322 in two major ways: it uses a high-copy-number ori (500–700 copies/cell) and incorporates a blue-white screening system that distinguishes recombinants from non-recombinants on a single plate, eliminating the labour-intensive replica-plating step required with pBR322.
The cloning sites of pUC19 are clustered in a Multiple Cloning Site (MCS) embedded within the lacZ gene that encodes the alpha-fragment of beta-galactosidase. When the host E. coli strain (lacZ deletion mutant) is transformed with pUC19 and plated on medium containing IPTG (inducer) and X-gal (chromogenic substrate), functional beta-galactosidase produces a blue colour. Insertion of foreign DNA into the MCS disrupts the lacZ reading frame; no functional enzyme is produced, and colonies remain white. White colonies are recombinant; blue colonies carry self-ligated (non-recombinant) vector.
Bacteriophage vectors
Lambda (λ) phage
Bacteriophage lambda is a double-stranded DNA phage with a genome of ~48.5 kb. Its life cycle has two pathways: the lytic cycle (phage replicates, lyses the cell, releases progeny) and the lysogenic cycle (phage DNA integrates into the bacterial chromosome as a prophage). As a cloning vector, lambda operates through the lytic cycle to produce large amounts of recombinant DNA.
The lambda genome has a central non-essential "stuffer" region (genes governing lysogeny) that can be deleted and replaced by foreign DNA without affecting the phage's ability to replicate and package. This stuffer can be replaced by inserts of up to ~20 kb. Two types of lambda vectors exist: insertion vectors (one restriction site, small inserts up to 10 kb) and replacement vectors (two flanking restriction sites, stuffer removed, inserts 9–23 kb). Replacement lambda vectors are used for constructing genomic DNA libraries from organisms with large genomes.
Lambda phage vectors offer a critical practical advantage over plasmids: they can be packaged into phage heads in vitro and introduced into E. coli by infection, which is orders of magnitude more efficient than chemical transformation used for plasmids. This matters when constructing comprehensive genomic libraries where every possible fragment must be represented.
M13 filamentous phage
M13 is a filamentous, single-stranded DNA phage that infects male (F+) E. coli cells through the F pilus. Unlike lambda, M13 does not lyse its host but instead causes a chronic, slowed infection, continuously secreting phage particles. The M13 genome is circular, single-stranded, ~6.4 kb.
M13 vectors produce single-stranded DNA, which is ideal for two applications: Sanger DNA sequencing (single-stranded templates work more efficiently than double-stranded) and site-directed mutagenesis (a synthetic mismatched oligonucleotide anneals to the single-stranded template and primes synthesis of a mutant strand). M13mp series vectors incorporate a lacZ MCS for blue-white selection, mirroring pUC vectors.
High-capacity vectors: cosmids, BAC, YAC
As genome projects required cloning of increasingly large DNA fragments, plasmid and phage lambda vectors proved insufficient. Three generations of high-capacity vectors were developed, each exploiting a different replication strategy.
BAC — Bacterial Artificial Chromosome
100–300 kb
insert capacity
- Based on F-plasmid (fertility plasmid) ori of E. coli
- Single-copy maintenance — very low chimaera rate
- Propagated in standard E. coli; easy manipulation
- Workhorse of the Human Genome Project sequencing phase
- Selectable marker: camR (chloramphenicol)
- Transformed by electroporation
YAC — Yeast Artificial Chromosome
200–2000 kb
insert capacity
- Carries yeast centromere (CEN), two telomeres (TEL), ARS (autonomously replicating sequence)
- Propagated in Saccharomyces cerevisiae as a genuine chromosome
- Highest capacity — essential for cloning very large genes
- Higher chimaera rate than BACs (two inserts can ligate)
- Selectable markers: TRP1 and URA3 (auxotrophic complementation)
- Used in early HGP physical mapping phase
Cosmids
Cosmids are hybrid vectors that combine elements of a plasmid with the cos sites (cohesive ends) of phage lambda. The cos sites (a ~200 bp sequence) are the only lambda sequences retained; everything else is replaced by a plasmid backbone containing an ori, a selectable marker (usually ampR), and a cloning site. Because phage packaging machinery requires only cos sites spaced ~40–50 kb apart, foreign DNA (30–45 kb) inserted between the cos sites is packaged into phage heads in vitro. This gives cosmids the transformation efficiency of phage infection combined with the simplicity of plasmid propagation.
Cosmid insert capacity (kb)
Larger than plasmid (~10 kb) and lambda replacement vectors (~23 kb), yet propagated as a plasmid. The packaging efficiency of phage infection is retained through the cos sites.
Figure 2. Comparative insert capacities of common cloning vectors. The scale is illustrative (YAC capacity truncated). YAC's insert capacity exceeds that of all others, followed by BAC, cosmid, lambda phage, and simple plasmids. For NEET, remembering that YAC holds the largest insert and BAC is the E. coli equivalent is sufficient.
Vectors for plant and animal cells
Ti plasmid of Agrobacterium tumefaciens
Agrobacterium tumefaciens is a gram-negative soil bacterium that causes crown gall disease in many dicotyledonous plants. The pathogenicity depends on a large (~200 kb) extrachromosomal plasmid called the Ti plasmid — Ti standing for Tumour Inducing. A defined segment of this plasmid, the T-DNA (transferred DNA, 15–30 kb), is excised from the Ti plasmid, transferred through the bacterial cell wall, transported across the plant cell wall and membrane, and integrated semi-randomly into the plant nuclear genome. Once integrated, T-DNA genes are expressed under plant promoters to produce opines (amino acid derivatives used by the bacterium as a carbon and nitrogen source) and plant growth hormones (auxin, cytokinin) that drive tumour formation.
Biotechnologists recognised that the T-DNA delivery mechanism is independent of the T-DNA sequence itself: the bacterium's vir (virulence) genes encoded elsewhere on the Ti plasmid are responsible for excision and transfer. This means the T-DNA region can be replaced with any gene of interest without affecting delivery. The standard strategy produces a disarmed Ti plasmid by removing the opine synthesis genes and the oncogenes (tumour-inducing genes) from the T-DNA, replacing them with the foreign gene of interest along with a plant selectable marker (usually nptII, conferring kanamycin resistance in plant cells). The disarmed Ti plasmid is then introduced into Agrobacterium, and the modified bacterium is used to infect plant leaf discs. Transformed plant cells are selected on kanamycin medium and regenerated into complete transgenic plants.
The Ti plasmid system works efficiently for dicotyledonous plants (tobacco, tomato, potato, Arabidopsis) but is less effective for many monocots (wheat, rice, maize), for which alternative methods such as biolistics (gene gun) are used instead.
Retroviral vectors for animal cells
Retroviruses naturally insert their RNA genome (as cDNA) into the host cell's chromosomal DNA via their integrase enzyme. This integration is stable and heritable. In gene delivery, disarmed retroviruses — in which the genes encoding viral proteins have been removed and replaced by the therapeutic gene — serve as efficient delivery vehicles for animal cells. The viral structural proteins needed for packaging are supplied separately in trans by a packaging cell line. The resulting recombinant retroviral particles infect target cells and integrate the foreign gene stably into the genome, but cannot replicate further because structural genes are absent.
The limitation of retroviruses is that they can only infect actively dividing cells and insert randomly, occasionally activating proto-oncogenes. Lentiviral vectors (derived from HIV) overcome the dividing-cell restriction. Adenoviral and adeno-associated virus (AAV) vectors, which do not integrate, are preferred for in vivo gene therapy because they avoid insertional mutagenesis.
Worked examples
A scientist inserts a foreign gene at the BamHI site of pBR322 and transforms E. coli. She plates the cells on (i) LB + ampicillin and (ii) LB + tetracycline. Which colonies carry recombinant plasmid?
Solution: BamHI cuts within the tetR gene of pBR322. Inserting foreign DNA at this site causes insertional inactivation of tetR, so the recombinant plasmid confers ampicillin resistance but not tetracycline resistance. Colonies that grow on ampicillin medium but fail to grow on tetracycline medium carry the recombinant plasmid. Colonies that grow on both media carry self-ligated (non-recombinant) vector with an intact tetR gene.
A cloning vector has two BamHI recognition sites and one EcoRI site. A researcher uses BamHI to clone a foreign gene. What problem will arise?
Solution: Two BamHI sites on the vector mean BamHI digestion will produce two vector fragments rather than a linear vector ready to accept an insert. The foreign DNA insert can ligate with either or both fragments in multiple orientations, producing heterogeneous, unintended constructs. NCERT explicitly states that a vector must have very few, preferably single recognition sites for the restriction enzyme used, because multiple sites complicate gene cloning. The researcher should use EcoRI instead if the foreign gene lacks EcoRI sites.
A pUC19 transformation is plated on LB + ampicillin + IPTG + X-gal. Explain the colour of (a) non-transformant colonies, (b) non-recombinant transformants, (c) recombinant transformants.
Solution: (a) Non-transformants: no pUC19 taken up, so no ampicillin resistance; these cells die and form no colonies. (b) Non-recombinant transformants: carry intact lacZ within the MCS; IPTG induces lacZ expression, beta-galactosidase cleaves X-gal to produce an insoluble blue indigo dye — colonies are blue. (c) Recombinant transformants: foreign DNA interrupts the lacZ reading frame; no functional beta-galactosidase; X-gal is not cleaved — colonies are white. Pick white colonies for recombinant clones.
Distinguish between a cosmid and a BAC, and state the insert capacity of each.
Solution: A cosmid is a plasmid containing phage lambda cos sites; it is packaged in vitro into phage heads, infects E. coli, then circularises and replicates as a plasmid. Insert capacity: 30–45 kb. A BAC (Bacterial Artificial Chromosome) is based on the F-plasmid origin and is propagated as a single-copy plasmid in E. coli. Insert capacity: 100–300 kb. BACs are preferred for genome sequencing because their low copy number reduces the frequency of chimaeric clones seen in cosmids.
Common confusion and NEET traps
Blue colonies
Non-recombinant
Intact lacZ gene
- Plasmid re-ligated without insert (self-ligation)
- lacZ intact — beta-galactosidase produced
- X-gal cleaved to blue indigo dye
- Ampicillin resistant (transformant)
- Discard — no insert of interest
White colonies
Recombinant
Disrupted lacZ gene
- Foreign DNA inserted into MCS within lacZ
- lacZ reading frame disrupted — no beta-galactosidase
- X-gal not cleaved — no blue colour
- Ampicillin resistant (transformant)
- Pick — these colonies carry the insert