Molecular biology
Intro
Kappa casein is the protein responsible for milk coagulation into cheese when it is cleaved by rennet (chymosin). If we could express this protein in yeast, we could essentially make vegan cheese.
A 2005 paper showed that the C-terminal part of the human kappa casein protein can be overexpressed in S. cerevisiae (baker’s yeast) and Pichia pastoris, and they were able to get it secreted in sufficient quantity that they could run the culture medium on a gel, and see a clear band for the protein. (“This macropeptide has various biological activities and is used as a functional food ingredient as well as a pharmaceutical compound.“ - valuable stuff, not just for vegan cheese!)
Our first task would be to replicate this result - ideally in the BioBricks format. Next would be to express and secrete the full-length protein. If we can get it to express and secrete in sufficient quantity, we should be able to show that you get a single band on the gel for the intact protein, but two bands after cleaving the protein with chymosin.
We can have some people working simultaneously on trying to express the other casein proteins (alpha-s1, alpha-s2, and beta). But since these are hydrophobic, we may not be able to express and secrete them without kappa-casein to hold them in suspension in a casein micelle. We can also attempt to modify the yeast lipid biosynthesis pathways to produce milk fats.
Essential reading
As recommended by Patrik:
- Rennet coagulation of milk
- Mechanisms of Coagulation: The principles, the science and what they mean to cheesemakers
- Production of human caseinomacropeptide in recombinant Saccharomyces cerevisiae and Pichia pastoris (on seafile)
Proteins
Patrik D says: Turns out casein proteins are not quite as insoluble as I had feared, especially if you keep the Ca concentration low enough. They've even been explored as chaperones to get other problematic proteins to fold better. There's also some papers on alpha- and beta-casein expression in E. coli and yeast, and on reconstitution of casein micelles from beta and kappa casein (beta casein is the major component in human milk).
Exon-intron structure of bovine milk proteins:
Organization of the casein genes in the human, mouse, rat, and cattle genome:
Alpha-s1 casein (CSN1S1)
Main component of cow's milk, along with beta casein.
Genetic variants & health effects
A recent review of milk protein nomenclature (Farrell et al., 2004) indicated 8 αs1-CN (A, B, C, D, E, F, G, H) [...] variants in the Bos genus. [1]
α-s1 Casein is the most prevalent form of casein in bovine milk. It has been reported to exhibit antioxidant and radical scavenging properties.1 It has also been reported to be involved in the transport of and casein from the endoplasmic reticulum to the Golgi apparatus. [Chanat, E., J. Cell Sci., 112, 3399-3412 (1999)] [2]
Sequences
Alpha-s2 casein (CSN1S2)
Genetic variants & health effects
A recent review of milk protein nomenclature (Farrell et al., 2004) indicated [...] 4 αs2-CN (A, B, C, D) [...] variants in the Bos genus. [3]
Proteolytic fragments of α-s2 Casein have been shown to exhibit antibacterial activity. Specifically the 39 amino acid casocidin-1 peptide fragment has been shown to inhibit E. coli and Staph. carnosis growth. [Zucht, H. D., FEBS Lett., 371, 185-188 (1995)] [4]
Sequences
Beta casein (CSN2)
Main component of cow's and human milk.
Genetic variants & health effects
A recent review of milk protein nomenclature (Farrell et al., 2004) indicated [...] 12 β-CN (A1, A2, A3, B, C, D, E, F, G, H1, H2, I) [...] variants in the Bos genus. [5]
β-Casein and its fragments have been implicated in a number of biological functions. The casoparan peptide has been reported to activate macrophage phagocytosis and peroxide release. Casohypotensin and casoparan may be involved in bradykinin regulation. Casohypotensin has also been shown to be a strong inhibitor of endo-oligopeptidase A, a thiol-activated protease capable of degrading bradykinin and neurotensin, and hydrolyzing enkephalin-containing peptides to produce enkephalins. β-Caseins are also a source of casomorphin peptides which exhibit opioid activity binding to opioid receptors. Casomorphins may be the hydrolysis product of dipeptidyl peptidase IV. [Miyamoto, Y., et al., Am. J. Physiol. Renal. Physiol., 252, 670-F677 (1987)] [6]
"Populations, which consume milk containing high levels of β-casein A2 variant, have a lower incidence of cardiovascular disease and type 1 diabetes. Furthermore, consumption of milk with the A2 variant may be associated with less severe symptoms of autism and schizophrenia." [7] "Human milk, goat milk, sheep milk and other species’ milk contain beta-casein which is ‘A2 like’, because they have a proline at the equivalent position in their beta-casein chains." [8].
Sequences
Kappa casein (CSN3)
κ-Casein's orientation on the surface of the casein micelle functions as an interface between the hydrophobic interior caseins and the aqueous environment. During clotting of milk, hydrolysis by chymosin or rennin releases the water soluble fragment, para-k-casein and the hydrophobic caseinomacropeptide. [9]
- Kappa-casein on wikipedia
- Bovine kappa-casein precursor gene CSN3
- kappa-casein genes in other mammals
Genetic variants & health effects
A recent review of milk protein nomenclature (Farrell et al., 2004) indicated [...] 11 κ-CN (A, B, C, E, F1, F2, G1, G2, H, I, J) [...] variants in the Bos genus. [10]
- Casoxins A, B and C have opioid antagonist activity. Casoxin C binds to the complement C3a receptors. Casoplatelin inhibits platelet aggregation. [Lawrence K., Creamer, L. K., et al., Journal of Dairy Science, 81, 3004-3012 (1998)] [11]
- The protein polymorphism at position 148-150 led to 2 different ACE inhibiting peptides. The peptide Val-Ser-Pro derived from CSN3*F1 showed the highest IC50 value (21.8) and is also known from maize (Miyoshi et al., 1991). The peptide Ala-Ser-Pro at the same position of the amino acid sequence but derived from the alleles CSN3*B and CSN3*C is not known for ACE inhibitory effects yet and showed an IC50 value of 242.3.
- The genetic variation at amino acid position 96–99 produced 2 further ACE inhibitory peptides. In CSN3*C, the peptide Ala-His-His-Pro (IC50 = 847.6) was identified and at the same position the peptide Ala-Cys-His-Pro (IC50 = 360.7) was identified in CSN3*G2. These 2 peptides are not known for ACE inhibitory effects yet.
- The dominant genetic variant of CSN3 in German cattle breeds is CSN3*A, whereas the B allele shows a high frequency in German Brown (0.479; Erhardt, 1989). Compared with CSN3*A, the B allele shows association with higher milk protein content and positive effects on coagulation properties and cheese yield (NgKwai-Hang and Grosclaude, 1992). In addition to these positive attributes, the present study shows another reason for preferring milk with the κ-casein allele B, because of the appearance of a new ACE inhibitory peptide at position 148–150 of the protein. [12]
- CSN3 B positively affected milk coagulation [13]
Caseinomacropeptide
Kim YJ, Oh YK, Kang W, Lee EY, Park S. Production of human caseinomacropeptide in recombinant Saccharomyces cerevisiae and Pichia pastoris. J Ind Microbiol Biotechnol. 2005 Sep;32(9):402-8.
“Caseinomacropeptide is a polypeptide of 64 amino acid residues (106-169) derived from the C-terminal part of the mammalian milk k-casein. This macropeptide has various biological activities and is used as a functional food ingredient as well as a pharmaceutical compound. The gene encoding the human caseinomacropeptide (hCMP) was synthesized and expressed with an alpha-factor secretion signal in the two yeast strains, Saccharomyces cerevisiae and Pichia pastoris. The complete polypeptide of the recombinant hCMP was produced and secreted in a culture medium by both the strains, but the highest production was observed in S. cerevisiae with a galactose-inducible promoter. In a fed-batch bioreactor culture, 2.5 g/l of the recombinant hCMP was obtained from the S. cerevisiae at 97 h.” “Escherichia coli XL1-Blue (Stratagene, USA) was used for cloning and propagating genes. Host strains for recombinant hCMP were S. cerevisiae 2805 (his-, ura-) [...] For S. cerevisiae, pYIGP (containing a constitutive GAP promoter) or pYEGa (containing a galactose-inducible GAL promoter) was used.”
Sequence
Bovine kappa-casein is 190 AA of which the first 21 AA form a secretion signal:
MMKSFFLVVT ILALTLPFLG AQEQNQEQPI RCEKDERFFS DKIAKYIPIQ YVLSRYPSYG LNYYQQKPVA LINNQFLPYP YYAKPAAVRS PAQILQWQVL SNTVPAKSCQ AQPTTMARHP HPHLSFMAIP PKKNQDKTEI PTINTIASGE PTSTPTTEAV ESTVATLEDS PEVIESPPEI NTVQVTSTAV
Post-translational modification
According to the Uniprot record, the post-translation modifications include:
- Glycosylation of six residues, all of them Threonine ((O-GalNAc)
- Phosphorylation of three residues, all of them Serine
- One disulphide bond between two cysteine residues
- One "Pyrrolidone carboxylic acid" modification to a glutamine (?)
We should investigate if any of this is important for micelle formation or micellar flocculation.
Quote from University of Illinois' Milk Composition & Synthesis Resource Library:
Phosphorylation is a ubiquitous biological process. It is involved in many regulatory mechanisms. It occurs after completion of the polypeptide chain. It can occur at Ser, Thr, Glu, Asp, His, Lys, and Tyr. Many enzymes will phosphorylate proteins with ATP, but they differ in specificity. These are called Kinases. Kinases are membrane-bound in the smooth endoplasmic reticulum and Golgi.
Caseins are phosphoproteins.
- k-Casein is not extensively phosphorylated- probably because of competition between the glycosyltransferases and the kinases for sites on the protein.
- Caseins are in multiphosphorylated forms - the same casein polypeptide chain may be phosphorylated at varying sites.
- Probably due to the tremendously high rate of protein synthesis which exceeds the phosphorylating capacity of the membrane-bound kinases.
Casein Kinase
Phosphorylation is important for proteins.
Two species of casein kinase from lactating bovine mammary gland have been identified; a Ca2+- and CM-independent casein kinase and a Ca2+- and CM-dependent casein kinase. The Ca2+- and CM-independent casein kinase phosphorylates previously dephosphorylated αs1-, β- or κ-casein while the Ca2+- and CM-dependent casein kinase prefers previously dephosphorylated β- or κ-casein as substrates. Two activities are indicated by their substrate specificity, sensitivity to Ca2+ and CM, pH maxima, and differential solubilization by anionic detergents. The presence of a regulated casein kinase in the lactating mammary gland suggests that casein phosphorylation may be a regulator of micelle formation or secretion.[14]
Fam20C appears to be the Golgi casein kinase that phosphorylates secretory pathway proteins within S-x-E motifs. Fam20C phosphorylates the caseins and several secreted proteins implicated in biomineralization
[Casein kinase-1 and casein kinase-2] are physiologically unrelated to casein because they are mainly cytosolic and nuclear proteins that would be unlikely to encounter casein in the secretory pathway and have therefore been renamed protein kinase CK1 and protein kinase CK2. A physiological casein kinase activity has been characterized from highly enriched Golgi fractions of lactating mammary gland, liver, brain, and kidney and named Golgi-enriched fraction casein kinase (GEF-CK). The GEF-CK specifically recognizes the consensus S-x-E/pS (where x is any amino acid and E/pS can be Glu or phosphoserine). [...] A peptide substrate reproducing one of the sites phosphorylated in β-casein, KKIEKFQSE-EQQQ (β28-40), is selectively phosphorylated by GEF-CK. [...] Protein immunoblotting of immunoprecipitates from conditioned medium of U2OS cells overexpressing V5-tagged αs1-casein with FLAG-tagged Fam20C revealed a mobility shift in αs1-casein that was absent when αs1-casein was coexpressed with the inactive Fam20C D478A mutant [...] Thus, Fam20C appears to phosphorylate casein in vitro and in mammalian cells. -- source
Woah. That changes things. The uniprot record for alpha-s1 shows that all phosphorylated residues except one follow the S-x-E/pS pattern. If this is true for the other casein proteins, then this Fam20C is likely what we want, and not casein kinase II. (Juul (talk))
Beta-casein expressed in E. coli has been successfully phosphorylated by co-expression of casein kinase II: Expression and characterization of phosphorylated recombinant human beta-casein in Escherichia coli. We do not know if CK2 will also work correctly for alpha-casein (for which phosphorylation is important in micelle formation) nor for kappa-casein (where phosphorylation is probably less important), but there is probably a good chance that it does. More research needed.
S. cerevisiae does have Casein Kinase II, but it is somewhat different from bovine CK2. Here's a comparison:
- Casein kinase II subunit alpha: yeast vs. cow: ~49% identity
- Casein kinase II subunit alpha': yeast vs. cow: ~54% identity
- Casein kinase II subunit beta: yeast vs. cow: ~32% identity
- Casein kinase II subunit beta': yeast vs. cow: ~38% identity
- Cow does not have a beta' so this last comparison is between yeast beta' and cow beta.
Is there any way we can gauge if the yeast CK2 is likely to work? If we co-express bovine CK2, how might that affect the yeast and the casein?
Genetic variants
Excellent review: Milk protein polymorphisms in cattle: Effect on animal breeding and human nutrition. [15]
"Genetic variants can result from SNP, as well as nucleotide (nt) deletions or insertions. A recent review of milk protein nomenclature (Farrell et al., 2004) indicated 8 αs1-CN (A, B, C, D, E, F, G, H), 4 αs2-CN (A, B, C, D), 12 β-CN (A1, A2, A3, B, C, D, E, F, G, H1, H2, I), 11 κ-CN (A, B, C, E, F1, F2, G1, G2, H, I, J), 11 β-LG (A, B, C, D, E, F, G, H, I, J, W), and 3 α-LA (A, B, C) variants in the Bos genus."
"Substantial variation in milk coagulation properties has been observed among dairy cows. [...] the cows were genotyped for major genetic variants in the αS1- (CSN1S1), β- (CSN2), and κ-casein (CSN3) genes, revealing distinct differences in variant frequencies among breeds. [...] CSN1S1 C, CSN2 B, and CSN3 B positively affected milk coagulation, whereas CSN2 A(2), in particular, had a negative effect" [16]
Host organism
Which organism should we use? We've mostly assumed we were going with S. cerevisiae but Pichia pastoris might be preferable. Here is a review comparing the two.
P. pastoris
Advantages over S. cerevisiae
- Likely to have fewer problems with efficient protein secretion
- It can grow to very high cell densities
- It can grow on methanol which is cheap
- This is kinda funny, since we can make methanol from methane (cow-farts) which means that our organism can eat the major greenhouse gas produced by cows.
"Pichia can grow to very high cell densities, and under ideal conditions can multiply to the point where the cell suspension is practically a paste. As the protein yield from expression in a microbe is roughly equal to the product of the protein produced per cell and the number of cells, this makes Pichia of great use when trying to produce large quantities of protein without expensive equipment" -- wikipedia quoting Cregg et al. "Expression in the yeast Pichia pastoris" 2009
Potential problems
- It is unclear if it is safe for human consumption (meaning more purification needed)
- Fewer resources available
S. cerevisiae
Advantages
- Familiar as a human consumable
- FDA GRAS status
- We can probably get away with less or no purification and still have a food-safe product
Potention problems
"Although S. cerevisiae has a well-developed eukaryotic secretory pathway, it is not the most efficient yeast for high-level export of proteins to the extracellular medium."
"...often transport to the extracellular medium is inefficient, especially for complex mammalian proteins of high molecular weight."
-- Overview of Protein Expression in Saccharomyces cerevisiae by Strausberg and Strausberg
Strain selection
The Johns Hopkins' team used YPH500
- List of commonly used lab strains
- A comparison of four strains (article access?)
Vectors
The Jons Hopkins iGEM team modified pRS400-series vectors, removing the MCS and replacing them with biobrick prefix and suffix.
Patrick says that galactose induction is the most common form of controlling heterologous expression. Common vectors are the pYES2 and pESC series vectors.
Secretion
Secretion of proteins will require a secretion signal (triggering processing through the ER, golgi and then exocytosis).
iGem Parts
- VitaYeast project, by Johns Hopkins’ 2011 iGEM team
- Characterized promoters and terminators
- Yeast parts in the BioBricks parts registry
Other notes
pESC contains a bidirectional promoter which may allow expression of both parts of the protein without cleavage being required. How?
Estimating amount of expressed protein
In the 2005 Kim et al. paper where they express caseinomacropeptide they describe using densitometry to estimate amount of expressed protein. They run the stained protein on an SDS-PAGE and use a gel imager to estimate amount of protein. There is an interesting article on how to use a normal scanner for densitometry here. We'd need at least of couple of calibration points which could be known amounts of a different protein. We'd have to know if the stain shows in equal amounts for the different proteins (per mole or per gram).
Clotting / Micellar flocculation
- Kinetics of milk coagulation: II. Kinetics of the secondary phase: micelle flocculation. (Carlson et al, 1987)
- Casein Interactions: Casting Light on the Black Boxes, the Structure in Dairy Products (Home, 1998, review)
- Studies of casein micelle structure: the past and the present (Qi, 2007, review)
If the model proposed by Home is correct, then micelle formation will require alpha, beta, kappa-casein and colloidal calcium. The model also implies that the phosphoserine residues of both alpha and kappa-casein play an important role in micelle formation.
Synthetic casein micelles
Some researchers have created synthetic casein micelles. We should study their efforts:
- Sub-structure of synthetic casein micelles (Knoop et al, 1979)
- A phosphate-induced sub-micelle-micelle equilibrium in reconstituted casein micelle systems (Slattery, 1979)
- Re-assembled casein micelles and casein nanoparticles as nano-vehicles for ω-3 polyunsaturated fatty acids (Zimet et al, 2011)
Note: None of these articles have been downloaded to seafile yet.
Health effects of milk/cheese components
Cow's Milk Protein Allergy
Cow's milk allergies seem to be caused primarily by casein, and especially the alpha-s1 and beta caseins that are the major component of cow's milk. Switching to breastfeeding seems to resolve the problems, although in some instances the mother may need to avoid dairy products as well. This seems to suggest that we may be able to avoid allergic reactions by replacing the bovine caseins with the human version, or by humanizing specific epitopes on the bovine casein.
Cow's Milk Protein Intolerance
Non immunologically reactions against cow's milk protein are defined as cow’s milk protein intolerance (CMPI).
Human or bovine?
There seems to be a distinct sociological "ick" reaction against the idea of making cheese made from human casein proteins, even among those perfectly willing to consider genetically engineered cheese. (Also check out the wikipedia sections on Alternative uses for breast milk and Extraordinary consumption.) On one hand, there may be significant health benefits associated with humanizing some or part of the casein proteins (probiotic effects, avoiding cow's milk allergies, etc.) On the other hand, we don't know how human caseins would behave in cheese making, and they may make the concept of an engineered cheese even less acceptable to the general public.
Suggested compromise:
- We do both the human and bovine kappa-casein at least. That should tell us whether the chymosin enzyme functions the same way on both, and whether the human kappa-casein can form micelles with bovine caseins.
- We do a little write-up on the Ethical, Legal, Societal Issues surrounding engineering with human genes and making a consumable from them (human burger, anyone?) That'll give us extra points for the iGEM competition as well.
Another suggested compromise:
- Narwhal casein!