Principles of biotechnology — what the discipline rests on
The European Federation of Biotechnology defines biotechnology as the integration of natural science and organisms, cells, parts thereof, and molecular analogues for products and services. Stripped of jargon: it is the deliberate use of living systems — or their molecular machinery — to make things humans want. The traditional examples are familiar: curd, bread, wine, all microbe-mediated. Modern biotechnology narrows the term to processes that use genetically modified organisms at industrial scale, plus allied techniques like in-vitro fertilisation, DNA vaccines, and gene therapy.
Two core techniques made the modern discipline possible. Genetic engineering alters the chemistry of DNA or RNA and introduces it into a host so the host's phenotype changes. Bioprocess engineering maintains sterile, contamination-free conditions so the engineered cell can grow in industrial quantities to manufacture the desired product. Without sterile bioprocessing, the cleverest engineered cell is useless — a contaminating microbe will outgrow it.
Once you grant those two techniques, three research areas underpin every recombinant operation: identification of DNA with desirable genes, introduction of that DNA into the host, and maintenance of the introduced DNA in the host across generations so the protein keeps being made. Each of these maps to a tool or a process you will meet in this chapter.
The three pillars of any rDNA project. Every step that follows — choosing a restriction enzyme, picking a vector, transforming a host — addresses one of these three research areas. NEET often tests them in the abstract, as the "basic principles of genetic engineering."
1 · Identify the gene
Locate the DNA with the desirable trait — isolate it, characterise it, sequence it.
2 · Introduce into host
Link to a vector with origin of replication, transform a competent cell.
3 · Maintain & transfer
Ensure the alien DNA replicates with the host genome and passes to progeny.
An alien piece of DNA dropped into a cell is doomed unless it can replicate. Replication requires an origin of replication (ori) — a specific sequence that initiates copying. So the alien DNA must be linked to an ori, either by integrating into the host chromosome or by being ligated into a plasmid that already has one. This linking is what we call cloning — producing many identical copies of a template DNA.
An alien DNA cannot replicate on its own. It survives only by becoming part of something that already knows how to copy itself.
The central principle of recombinant DNA
The first artificial recombinant DNA — built by Cohen and Boyer in 1972 — was exactly this: an antibiotic-resistance gene cut out of one piece of DNA and stitched into a native plasmid of Salmonella typhimurium using restriction enzymes (the "molecular scissors") and DNA ligase. When transferred into E. coli, the new construct replicated and produced antibiotic-resistant bacteria — the first demonstration that genes could be moved, deliberately, across species boundaries.
Tools of rDNA — restriction enzymes
Restriction enzymes were discovered in 1963 in E. coli, where a pair of enzymes restrict bacteriophage growth — one methylates host DNA to mark it, the other cuts unprotected (phage) DNA. The cutter, when it was characterised in 1968 and named Hind II, turned out to recognise a specific 6-base-pair sequence and to cut DNA wherever that sequence appeared. That property — sequence-specific cutting — turned a defensive bacterial enzyme into the foundational tool of genetic engineering. Today more than 900 restriction enzymes have been isolated from over 230 bacterial strains.
Restriction enzymes are endonucleases — they make cuts within the DNA, unlike exonucleases which chew nucleotides from the ends. The naming convention is set: the first letter is the genus, the next two letters the species, an additional letter for the strain, and a Roman numeral for the order of isolation. EcoRI is from Escherichia coli strain RY13, isolation number I.
Three NEET-favourite restriction enzymes. All three recognise 6-bp palindromes and produce 4-base 5' sticky ends. Memorise their recognition sequences — direct PYQ targets in 2020 and 2022.
EcoRI
G↓AATTC
Escherichia coli RY13
Cuts between G and A, leaving 4-base 5'-AATT overhangs (sticky ends).
PYQ 2020, 2022Hind III
A↓AGCTT
Haemophilus influenzae Rd
From the same family as Hind II — the first restriction enzyme ever characterised. Cuts between A and A.
PYQ 2016BamH I
G↓GATCC
Bacillus amyloliquefaciens H
Recognition site sits inside the tetR gene of pBR322 — used for insertional inactivation cloning.
NCERT exampleEach enzyme inspects the DNA, finds its specific palindromic recognition sequence, binds, and cuts both strands of the double helix at fixed positions in the sugar-phosphate backbone. A palindrome in DNA is a sequence that reads identically on the two strands when both are read 5' to 3'. For EcoRI, that sequence is:
5'— G A A T T C —3'
3'— C T T A A G —5'EcoRI palindrome — read 5'→3' on either strand and you get GAATTC
Critically, restriction enzymes cut a little away from the centre of the palindrome — but between the same two bases on opposite strands. For EcoRI, the cut falls between G and A. The result is an asymmetric break: each fragment has a short, single-stranded overhang. These overhangs are the famous sticky ends — they hydrogen-bond with their complementary cut counterparts and let two fragments be re-joined by DNA ligase.
Because both source DNA and vector DNA are cut with the same enzyme, both end up with matching sticky ends. DNA ligase then seals the phosphodiester backbone on each strand, generating a single circular recombinant molecule. If you used different enzymes on source and vector, the overhangs would not match — and the recombinant simply could not be formed.
Cloning vectors — pBR322 and its relatives
A cloning vector is the DNA carrier that ferries the gene of interest into the host cell. The textbook vector is pBR322, an engineered E. coli plasmid. It is small, circular, double-stranded, and carries the four features every good cloning vector must have.
Four features of a good cloning vector. pBR322 has all four. NEET 2022 asked which feature is not desirable — the trap was "two or more recognition sites for the same enzyme," because multiple sites would fragment the vector itself during cloning.
ori — origin of replication
Replication start
controls copy number
Any DNA linked to ori replicates inside the host. The ori also controls how many copies per cell — a PYQ-favourite fact.
PYQ 2020Selectable markers — ampR, tetR
Antibiotic resistance
filters transformants
Genes for resistance to ampicillin, tetracycline, kanamycin, or chloramphenicol. Untransformed E. coli die on these antibiotics.
PYQ 2017, 2022Cloning sites — single, unique
Few RE sites
preferably one each
Single recognition sites for Hind III, EcoRI, BamH I, Sal I, Pvu II, Pst I, Cla I — each within a marker gene for screening.
PYQ 2022 — trap on multiple sitesSmall size
Compact circle
easy to manipulate
A small vector is easier to isolate, easier to transform, and replicates faster. pBR322 is about 4.3 kilobases.
pBR322's two markers — ampR (ampicillin resistance) and tetR (tetracycline resistance) — enable a beautiful screening trick called insertional inactivation. The Pst I cloning site sits inside the ampR gene; the BamH I site sits inside the tetR gene. If a foreign gene is ligated into the BamH I site, the tetR gene is disrupted — the recombinant plasmid keeps ampR but loses tetR. You then plate the bacteria on two media: ampicillin alone selects all transformants; replica-plating onto tetracycline kills the recombinants while non-recombinants survive. The colonies that grew on amp but died on tet are the recombinants — and you go back to the amp plate to pick them.
Insertional inactivation works but is laborious. A faster alternative uses blue-white screening: the foreign DNA is inserted into the coding sequence of β-galactosidase. Non-recombinants produce functional β-galactosidase, which cleaves a chromogenic substrate to give blue colonies. Recombinants have the gene disrupted — no enzyme, no colour — so they appear white. You pick the white colonies. One plate, two colours, no replica-plating.
Vectors for plant and animal cells are more elaborate. The Ti plasmid of Agrobacterium tumefaciens, a soil bacterium that naturally infects dicots and transfers its T-DNA into the plant genome to cause crown gall tumours, has been disarmed and engineered into a cloning vector for plants. Retroviruses, which integrate their RNA-derived DNA into animal genomes, have likewise been disarmed and turned into vectors for animal-cell gene therapy. BACs (bacterial artificial chromosomes) and YACs (yeast artificial chromosomes) carry much larger inserts than plasmids and are essential for genomic libraries — NEET 2023 confirmed all three (BAC, YAC, pBR322) as vectors while flagging a "probe" as the non-vector option.
Competent hosts — getting DNA across the membrane
DNA is a large, negatively charged, hydrophilic molecule. It cannot diffuse across the lipid bilayer of a bacterial cell. Bacteria must first be made competent — chemically primed to take up DNA from their environment. The standard recipe: treat E. coli with chilled calcium chloride. The divalent Ca²⁺ neutralises the negative charges on DNA and on the bacterial cell wall, opening transient pores. Plasmid DNA is then added on ice, and the mixture is given a brief heat shock at 42 °C for about 90 seconds, then returned to ice. The thermal jolt drives the plasmid through the now-permeable membrane.
Three other delivery routes exist for situations where chemical transformation will not work. In microinjection, a glass micropipette punctures the nuclear membrane of an animal cell and injects the recombinant DNA directly into the nucleus — the workhorse method for making transgenic mice and fertilising IVF embryos with engineered genes. In biolistics (also called the gene gun method), microparticles of gold or tungsten are coated with DNA and fired at high velocity into plant cells; the inert metals do not chemically alter the cell. This is the go-to method for plant transformation when Agrobacterium is uncooperative. The fourth route — disarmed pathogen vectors — uses Ti plasmids for plants or retroviruses for animals to do the work of delivery using machinery the pathogen evolved over millions of years.
The host organism itself is usually E. coli for cloning bacterial proteins, Agrobacterium tumefaciens for plants, and Saccharomyces cerevisiae (yeast) for eukaryotic proteins that need post-translational modification. Yeast, being eukaryotic, can glycosylate proteins correctly — a property bacteria lack and a reason recombinant erythropoietin and hepatitis B vaccine are made in yeast.
Process 1 — isolation of the genetic material
The first process step in any rDNA workflow is to extract pure DNA from the source cell. DNA sits behind a cell wall (in plants, fungi, bacteria), a cell membrane, and — in eukaryotes — a nuclear envelope, woven through with histones and contaminating RNA, proteins, polysaccharides, and lipids. Each barrier and each contaminant must be addressed.
Cell-wall removal uses enzyme cocktails matched to the source: lysozyme for bacteria (it digests peptidoglycan), cellulase for plant cells (it digests cellulose), chitinase for fungi (it digests chitin). Once the wall is breached, mechanical disruption or detergents release the cellular contents. The released mix contains DNA tangled with RNA, histones, and other proteins.
Protease digests away the histones and other proteins. Ribonuclease (RNase) breaks down the RNA. Polysaccharides and lipids are removed by appropriate solvent steps. What remains in the aqueous phase is purified DNA, dissolved in salt buffer.
The dramatic finale: chilled ethanol. DNA is insoluble in cold ethanol; adding two volumes of cold ethanol to the aqueous DNA solution causes the DNA to precipitate as fine, white, thread-like fibres. These threads can be lifted out on a glass rod — a step called spooling. The spooled DNA, dissolved in TE buffer, is now pure enough for restriction digestion.
Process 2 — cutting at specific locations & gel electrophoresis
Once the DNA is pure, both source DNA and vector DNA are incubated with the chosen restriction enzyme under its optimal conditions (specific buffer, pH, and temperature for that enzyme). The digestion produces fragments — many for genomic source DNA, two pieces for a vector cut at a single site.
To check whether the digestion has worked and to separate fragments by size, you use gel electrophoresis. DNA is negatively charged because of the phosphate groups in its backbone — at physiological pH every phosphate carries a single negative charge. Place the DNA in a gel between two electrodes, switch on the current, and the fragments migrate towards the anode (positive electrode). The matrix is usually agarose, a natural polysaccharide extracted from seaweed. Agarose forms a porous mesh; smaller DNA fragments thread through the pores faster, larger fragments are held back. Result: the smaller the fragment, the farther it moves.
DNA on the gel is invisible to the eye in visible light. To see the bands, the gel is stained with ethidium bromide (EtBr), a flat aromatic molecule that intercalates between DNA base pairs. EtBr fluoresces under ultraviolet light, producing bright orange bands wherever DNA has migrated. Photograph the gel under UV, and you have a permanent record of the restriction pattern.
The next step is elution — physically cutting the desired band out of the gel and extracting the DNA from that gel slice. The eluted, purified DNA fragment is now ready to be ligated into the cut vector using DNA ligase, producing the recombinant DNA molecule.
Process 3 — amplification by PCR
PCR — Polymerase Chain Reaction — is the technique that lets you make a billion copies of a single DNA fragment in a few hours, inside a single test tube. Kary Mullis won the 1993 Nobel for inventing it. The trick: cycle the temperature so that the DNA denatures, primers anneal, and a heat-stable polymerase extends — over and over, doubling the target each round.
You need four ingredients: the template DNA (the sample), two primers (short chemically-synthesised single-stranded oligonucleotides, each complementary to one of the flanking regions of the target), free deoxynucleotide triphosphates (dNTPs), and a thermostable DNA polymerase. The polymerase of choice is Taq DNA polymerase, isolated from the thermophilic bacterium Thermus aquaticus, which lives in hot springs. Taq survives the 94 °C denaturation step without unfolding — ordinary E. coli polymerase would denature within seconds.
The mathematics is simple and brutal: one cycle doubles the DNA. After n cycles you have 2n copies. After 30 cycles, that's just over a billion copies (2³⁰ ≈ 1.07 × 10⁹). After 32 cycles, around four billion. PCR is exponential amplification, which is why it has revolutionised molecular biology — every diagnostic test for HIV, COVID-19, tuberculosis, every prenatal genetic screen, every forensic match relies on it.
PCR has applications well beyond cloning. Gene amplification for sequencing or cloning, molecular diagnosis of pathogens (HIV, hepatitis, COVID), detection of gene mutations in cancers and inherited diseases, and DNA fingerprinting all use PCR. What PCR cannot do is protein work — NEET 2021 specifically tested this with "purification of isolated protein" as a non-application.
Process 4 — insertion of recombinant DNA into the host
With the recombinant DNA built — vector + insert, ligated and verified — it must be transferred into a recipient cell. Bacterial cells are made competent by the CaCl₂ + heat-shock method described earlier. Plant cells are bombarded with DNA-coated gold or tungsten microparticles (biolistics) or co-cultivated with disarmed Agrobacterium. Animal cells receive their cargo by microinjection or by infection with engineered retroviruses.
Just transferring DNA is not enough — you need to know which cells actually took it up. Out of millions of bacteria exposed to plasmid, only a tiny fraction become transformed. This is where the selectable marker on the vector earns its keep. If the vector carries the ampR gene, plate the bacteria on agar containing ampicillin. Untransformed cells die. The colonies that grow are guaranteed to carry the plasmid — and the gene of interest along with it.
The full sequence of recombinant DNA technology, as NEET 2023 tested it: isolation of source DNA → cutting with restriction endonuclease to obtain the desired fragment → amplification by PCR (often inserted here to multiply the gene before cloning) → ligation into a vector with DNA ligase → transformation into the host → selection of transformants on selective medium → culture at scale in a bioreactor → extraction and downstream processing of the product.
Process 5 — obtaining the foreign gene product
Once the cell is engineered, the goal is to extract the protein the foreign gene encodes — the recombinant protein. Small cultures suffice for the laboratory, but commercial-scale production of insulin, growth hormone, or vaccines requires far larger volumes. The solution is the bioreactor — a vessel typically holding 100 to 1000 litres of culture, providing precisely controlled conditions for the engineered cell to grow and synthesise its product.
A bioreactor is not just a big flask. It has an agitator system (the stirrer that mixes the culture), an oxygen delivery system (sparger or sterile air supply), a foam control system, temperature control, pH control, and sampling ports for periodic withdrawal of culture. Continuous-culture mode keeps cells in their exponential growth phase by adding fresh medium on one side while drawing off spent medium on the other — biomass stays at maximum, productivity stays high.
NEET 2019 tested this directly: which equipment is essentially required for growing microbes on a large scale, for industrial production of enzymes? The answer is the bioreactor — not a BOD incubator (which is for small-scale microbial work), not a sludge digester (sewage treatment), not an industrial oven. Bioreactors are where biotechnology stops being a laboratory hobby and becomes industry.
Process 6 — downstream processing
After the bioreactor has done its work, the product — the recombinant protein floating in a litre of culture broth contaminated with cell debris, lipids, salts, and host metabolites — has to be processed before it can be sold or injected into a patient. This is downstream processing — the collective name for everything that happens between the bioreactor and the marketable product.
Four steps make up downstream processing. Separation removes cells and large debris by centrifugation or filtration. Purification isolates the target protein from contaminating proteins by techniques such as ion-exchange chromatography, affinity chromatography, gel-filtration chromatography, or HPLC. Quality assurance (QA) and clinical trials verify that the product is biochemically pure, biologically active, and safe; for drugs this involves rigorous regulatory trials. Formulation packages the purified product with appropriate preservatives, stabilisers, and excipients in the final dosage form.
Downstream processing is product-specific. The exact sequence of unit operations differs between a small peptide (like insulin), a glycosylated antibody, and a live attenuated vaccine. NEET 2017 tested the definition directly: "separation and purification of expressed protein before marketing" is downstream processing.
Separation
Centrifugation, filtration, cell lysis. Removes cells and large debris from the broth.
Purification
Chromatography (ion-exchange, affinity, gel-filtration), HPLC. Isolates the recombinant protein.
QA & clinical trials
Identity, purity, potency, sterility tests. For drugs, full clinical trial sequence under regulatory oversight.
Formulation
Preservatives, stabilisers, buffers, vials or tablets. Final dosage form ready for distribution.
NEET PYQ Snapshot
Real NEET previous-year questions — solve before moving on.
Main steps in the formation of Recombinant DNA are given below. Arrange these steps in a correct sequence. A: Insertion of recombinant DNA into the host cell · B: Cutting of DNA at specific location by restriction enzyme · C: Isolation of desired DNA fragment · D: Amplification of gene of interest using PCR.
Answer: (2) B, C, D, AWhy: The correct order is cut (B) → isolate the fragment (C) → amplify by PCR (D) → insert into host (A). Note that "isolation of desired DNA fragment" comes after the cut, not before.
In the following palindromic base sequences of DNA, which one can be cut easily by a particular restriction enzyme?
Answer: (1) 5′GAATTC3′ ; 3′CTTAAG5′Why: Only option 1 is a true palindrome — read 5'→3' on either strand and you get GAATTC. This is the EcoRI recognition sequence. The other three differ between strands when read 5'→3' and are not palindromes.
Plasmid pBR322 has Pst I restriction enzyme site within gene ampR that confers ampicillin resistance. If this enzyme is used for inserting a gene for β-galactoside production and the recombinant plasmid is inserted in an E. coli strain:
Answer: (2) Cannot confer ampicillin resistanceWhy: Inserting the β-galactoside gene at the Pst I site disrupts the ampR gene by insertional inactivation. The recombinant E. coli loses ampicillin resistance. It still produces β-galactoside, but ampR is gone.
The specific palindromic sequence which is recognised by EcoRI is:
Answer: (4) 5'-GAATTC-3', 3'-CTTAAG-5'Why: EcoRI's palindrome is GAATTC. Option 2 looks similar but the orientation is wrong — the 5' to 3' sequence must be GAATTC on the top strand. Option 3 (GGATCC) is BamH I, not EcoRI.
The Taq polymerase enzyme is obtained from:
Answer: (4) Thermus aquaticusWhy: Taq is named for its source — Thermus aquaticus, a thermophilic bacterium from hot springs. Its thermostability lets it survive the 94 °C denaturation step of PCR every cycle without unfolding.
Expert FAQs
Questions NEET has asked from this chapter, answered straight.
What is the recognition sequence of EcoRI?
From which organism is Taq DNA polymerase obtained?
What are the three steps of PCR, and at what temperatures?
What is a selectable marker, and which is the most common one in pBR322?
Why is chilled ethanol added during DNA isolation?
What does origin of replication (ori) do in a cloning vector?
What stains DNA in gel electrophoresis, and why is UV needed?
What is a palindromic DNA sequence?
What is downstream processing?
What metals are used in the gene gun (biolistics) method?
Go Deeper
Drill into the subtopics that NEET asks most often.