Difference between revisions of "Molecular biology"

From Real Vegan Cheese
Jump to navigation Jump to search
Line 29: Line 29:


=== Genetic variants & health effects ===
=== Genetic variants & health effects ===
A recent review of milk protein nomenclature (Farrell et al., 2004) indicated 8 αs1-CN (A, B, C, D, E, F, G, H) [...] variants in the Bos genus. [http://www.ncbi.nlm.nih.gov/pubmed/19841193]


α-s1 Casein is the most prevalent form of casein in bovine milk. It has been reported to exhibit antioxidant and radical scavenging properties.1 It has also been reported to be involved in the transport of and casein from the endoplasmic reticulum to the Golgi apparatus. [Chanat, E., J. Cell Sci., 112, 3399-3412 (1999)] [http://www.sigmaaldrich.com/life-science/metabolomics/enzyme-explorer/enzyme-reagents/casein.html]
α-s1 Casein is the most prevalent form of casein in bovine milk. It has been reported to exhibit antioxidant and radical scavenging properties.1 It has also been reported to be involved in the transport of and casein from the endoplasmic reticulum to the Golgi apparatus. [Chanat, E., J. Cell Sci., 112, 3399-3412 (1999)] [http://www.sigmaaldrich.com/life-science/metabolomics/enzyme-explorer/enzyme-reagents/casein.html]

Revision as of 04:44, 23 April 2014

Intro

Kappa casein is the protein responsible for milk coagulation into cheese when it is cleaved by rennet (chymosin). If we could express this protein in yeast, we could essentially make vegan cheese.

A 2005 paper showed that the C-terminal part of the human kappa casein protein can be overexpressed in S. cerevisiae (baker’s yeast) and Pichia pastoris, and they were able to get it secreted in sufficient quantity that they could run the culture medium on a gel, and see a clear band for the protein. (“This macropeptide has various biological activities and is used as a functional food ingredient as well as a pharmaceutical compound.“ - valuable stuff, not just for vegan cheese!)

Our first task would be to replicate this result - ideally in the BioBricks format. Next would be to express and secrete the full-length protein. If we can get it to express and secrete in sufficient quantity, we should be able to show that you get a single band on the gel for the intact protein, but two bands after cleaving the protein with chymosin.

We can have some people working simultaneously on trying to express the other casein proteins (alpha-s1, alpha-s2, and beta). But since these are hydrophobic, we may not be able to express and secrete them without kappa-casein to hold them in suspension in a casein micelle. We can also attempt to modify the yeast lipid biosynthesis pathways to produce milk fats.

Essential reading

As recommended by Patrik:

Proteins

Patrik D says: Turns out casein proteins are not quite as insoluble as I had feared, especially if you keep the Ca concentration low enough. They've even been explored as chaperones to get other problematic proteins to fold better. There's also some papers on alpha- and beta-casein expression in E. coli and yeast, and on reconstitution of casein micelles from beta and kappa casein (beta casein is the major component in human milk).

Alpha-s1 casein (CSN1S1)

peptide diagram

Main component of cow's milk, along with beta casein.

Genetic variants & health effects

A recent review of milk protein nomenclature (Farrell et al., 2004) indicated 8 αs1-CN (A, B, C, D, E, F, G, H) [...] variants in the Bos genus. [1]

α-s1 Casein is the most prevalent form of casein in bovine milk. It has been reported to exhibit antioxidant and radical scavenging properties.1 It has also been reported to be involved in the transport of and casein from the endoplasmic reticulum to the Golgi apparatus. [Chanat, E., J. Cell Sci., 112, 3399-3412 (1999)] [2]

Sequences

Alpha-s2 casein (CSN1S2)

peptide diagram

Genetic variants & health effects

Proteolytic fragments of α-s2 Casein have been shown to exhibit antibacterial activity. Specifically the 39 amino acid casocidin-1 peptide fragment has been shown to inhibit E. coli and Staph. carnosis growth. [Zucht, H. D., FEBS Lett., 371, 185-188 (1995)] [3]

Sequences

Beta casein (CSN2)

peptide diagram

Main component of cow's and human milk.

Genetic variants & health effects

β-Casein and its fragments have been implicated in a number of biological functions. The casoparan peptide has been reported to activate macrophage phagocytosis and peroxide release. Casohypotensin and casoparan may be involved in bradykinin regulation. Casohypotensin has also been shown to be a strong inhibitor of endo-oligopeptidase A, a thiol-activated protease capable of degrading bradykinin and neurotensin, and hydrolyzing enkephalin-containing peptides to produce enkephalins. β-Caseins are also a source of casomorphin peptides which exhibit opioid activity binding to opioid receptors. Casomorphins may be the hydrolysis product of dipeptidyl peptidase IV. [Miyamoto, Y., et al., Am. J. Physiol. Renal. Physiol., 252, 670-F677 (1987)] [4]

"Populations, which consume milk containing high levels of β-casein A2 variant, have a lower incidence of cardiovascular disease and type 1 diabetes. Furthermore, consumption of milk with the A2 variant may be associated with less severe symptoms of autism and schizophrenia." [5] "Human milk, goat milk, sheep milk and other species’ milk contain beta-casein which is ‘A2 like’, because they have a proline at the equivalent position in their beta-casein chains." [6].

Sequences

Kappa casein (CSN3)

peptide diagram κ-Casein's orientation on the surface of the casein micelle functions as an interface between the hydrophobic interior caseins and the aqueous environment. During clotting of milk, hydrolysis by chymosin or rennin releases the water soluble fragment, para-k-casein and the hydrophobic caseinomacropeptide. [7]

Genetic variants & health effects

  • Casoxins A, B and C have opioid antagonist activity. Casoxin C binds to the complement C3a receptors. Casoplatelin inhibits platelet aggregation. [Lawrence K., Creamer, L. K., et al., Journal of Dairy Science, 81, 3004-3012 (1998)] [8]
  • The protein polymorphism at position 148-150 led to 2 different ACE inhibiting peptides. The peptide Val-Ser-Pro derived from CSN3*F1 showed the highest IC50 value (21.8) and is also known from maize (Miyoshi et al., 1991). The peptide Ala-Ser-Pro at the same position of the amino acid sequence but derived from the alleles CSN3*B and CSN3*C is not known for ACE inhibitory effects yet and showed an IC50 value of 242.3.
The genetic variation at amino acid position 96–99 produced 2 further ACE inhibitory peptides. In CSN3*C, the peptide Ala-His-His-Pro (IC50 = 847.6) was identified and at the same position the peptide Ala-Cys-His-Pro (IC50 = 360.7) was identified in CSN3*G2. These 2 peptides are not known for ACE inhibitory effects yet.
The dominant genetic variant of CSN3 in German cattle breeds is CSN3*A, whereas the B allele shows a high frequency in German Brown (0.479; Erhardt, 1989). Compared with CSN3*A, the B allele shows association with higher milk protein content and positive effects on coagulation properties and cheese yield (NgKwai-Hang and Grosclaude, 1992). In addition to these positive attributes, the present study shows another reason for preferring milk with the κ-casein allele B, because of the appearance of a new ACE inhibitory peptide at position 148–150 of the protein. [9]
  • CSN3 B positively affected milk coagulation [10]

Caseinomacropeptide

Kim YJ, Oh YK, Kang W, Lee EY, Park S. Production of human caseinomacropeptide in recombinant Saccharomyces cerevisiae and Pichia pastoris. J Ind Microbiol Biotechnol. 2005 Sep;32(9):402-8.

“Caseinomacropeptide is a polypeptide of 64 amino acid residues (106-169) derived from the C-terminal part of the mammalian milk k-casein. This macropeptide has various biological activities and is used as a functional food ingredient as well as a pharmaceutical compound. The gene encoding the human caseinomacropeptide (hCMP) was synthesized and expressed with an alpha-factor secretion signal in the two yeast strains, Saccharomyces cerevisiae and Pichia pastoris. The complete polypeptide of the recombinant hCMP was produced and secreted in a culture medium by both the strains, but the highest production was observed in S. cerevisiae with a galactose-inducible promoter. In a fed-batch bioreactor culture, 2.5 g/l of the recombinant hCMP was obtained from the S. cerevisiae at 97 h.” “Escherichia coli XL1-Blue (Stratagene, USA) was used for cloning and propagating genes. Host strains for recombinant hCMP were S. cerevisiae 2805 (his-, ura-) [...] For S. cerevisiae, pYIGP (containing a constitutive GAP promoter) or pYEGa (containing a galactose-inducible GAL promoter) was used.”

Sequence

Bovine kappa-casein is 190 AA of which the first 21 AA form a secretion signal:

MMKSFFLVVT ILALTLPFLG AQEQNQEQPI RCEKDERFFS DKIAKYIPIQ YVLSRYPSYG 
LNYYQQKPVA LINNQFLPYP YYAKPAAVRS PAQILQWQVL SNTVPAKSCQ AQPTTMARHP 
HPHLSFMAIP PKKNQDKTEI PTINTIASGE PTSTPTTEAV ESTVATLEDS PEVIESPPEI 
NTVQVTSTAV 


Post-translational modification

According to the Uniprot record, the post-translation modifications include:

  • Glycosylation of six residues, all of them Threonine ((O-GalNAc)
  • Phosphorylation of three residues, all of them Serine
  • One disulphide bond between two cysteine residues
  • One "Pyrrolidone carboxylic acid" modification to a glutamine (?)

We should investigate if any of this is important for micelle formation or micellar flocculation.

Quote from University of Illinois' Milk Composition & Synthesis Resource Library:

Phosphorylation is a ubiquitous biological process. It is involved in many regulatory mechanisms. It occurs after completion of the polypeptide chain. It can occur at Ser, Thr, Glu, Asp, His, Lys, and Tyr. Many enzymes will phosphorylate proteins with ATP, but they differ in specificity. These are called Kinases. Kinases are membrane-bound in the smooth endoplasmic reticulum and Golgi.

Caseins are phosphoproteins.

  • k-Casein is not extensively phosphorylated- probably because of competition between the glycosyltransferases and the kinases for sites on the protein.
  • Caseins are in multiphosphorylated forms - the same casein polypeptide chain may be phosphorylated at varying sites.
  • Probably due to the tremendously high rate of protein synthesis which exceeds the phosphorylating capacity of the membrane-bound kinases.

Casein Kinase

Phosphorylation is important for proteins.

Two species of casein kinase from lactating bovine mammary gland have been identified; a Ca2+- and CM-independent casein kinase and a Ca2+- and CM-dependent casein kinase. The Ca2+- and CM-independent casein kinase phosphorylates previously dephosphorylated αs1-, β- or κ-casein while the Ca2+- and CM-dependent casein kinase prefers previously dephosphorylated β- or κ-casein as substrates. Two activities are indicated by their substrate specificity, sensitivity to Ca2+ and CM, pH maxima, and differential solubilization by anionic detergents. The presence of a regulated casein kinase in the lactating mammary gland suggests that casein phosphorylation may be a regulator of micelle formation or secretion.[11]

Fam20C appears to be the Golgi casein kinase that phosphorylates secretory pathway proteins within S-x-E motifs. Fam20C phosphorylates the caseins and several secreted proteins implicated in biomineralization [12]

[Casein kinase-1 and casein kinase-2] are physiologically unrelated to casein because they are mainly cytosolic and nuclear proteins that would be unlikely to encounter casein in the secretory pathway and have therefore been renamed protein kinase CK1 and protein kinase CK2. A physiological casein kinase activity has been characterized from highly enriched Golgi fractions of lactating mammary gland, liver, brain, and kidney and named Golgi-enriched fraction casein kinase (GEF-CK). The GEF-CK specifically recognizes the consensus S-x-E/pS (where x is any amino acid and E/pS can be Glu or phosphoserine). [...] A peptide substrate reproducing one of the sites phosphorylated in β-casein, KKIEKFQSE-EQQQ (β28-40), is selectively phosphorylated by GEF-CK. [...] Protein immunoblotting of immunoprecipitates from conditioned medium of U2OS cells overexpressing V5-tagged αs1-casein with FLAG-tagged Fam20C revealed a mobility shift in αs1-casein that was absent when αs1-casein was coexpressed with the inactive Fam20C D478A mutant [...] Thus, Fam20C appears to phosphorylate casein in vitro and in mammalian cells.

Beta-casein expressed in E. coli has been successfully phosphorylated by co-expression of casein kinase II: Expression and characterization of phosphorylated recombinant human beta-casein in Escherichia coli. We do not know if CK2 will also work correctly for alpha-casein (for which phosphorylation is important in micelle formation) nor for kappa-casein (where phosphorylation is probably less important), but there is probably a good chance that it does. More research needed.

S. cerevisiae does have Casein Kinase II, but it is somewhat different from bovine CK2. Here's a comparison:

Cow does not have a beta' so this last comparison is between yeast beta' and cow beta.

Is there any way we can gauge if the yeast CK2 is likely to work? If we co-express bovine CK2, how might that affect the yeast and the casein?

Genetic variants

Excellent review: Milk protein polymorphisms in cattle: Effect on animal breeding and human nutrition. [13]

"Genetic variants can result from SNP, as well as nucleotide (nt) deletions or insertions. A recent review of milk protein nomenclature (Farrell et al., 2004) indicated 8 αs1-CN (A, B, C, D, E, F, G, H), 4 αs2-CN (A, B, C, D), 12 β-CN (A1, A2, A3, B, C, D, E, F, G, H1, H2, I), 11 κ-CN (A, B, C, E, F1, F2, G1, G2, H, I, J), 11 β-LG (A, B, C, D, E, F, G, H, I, J, W), and 3 α-LA (A, B, C) variants in the Bos genus."

"Substantial variation in milk coagulation properties has been observed among dairy cows. [...] the cows were genotyped for major genetic variants in the αS1- (CSN1S1), β- (CSN2), and κ-casein (CSN3) genes, revealing distinct differences in variant frequencies among breeds. [...] CSN1S1 C, CSN2 B, and CSN3 B positively affected milk coagulation, whereas CSN2 A(2), in particular, had a negative effect" [14]

Host organism

Which organism should we use? We've mostly assumed we were going with S. cerevisiae but Pichia pastoris might be preferable. Here is a review comparing the two.

P. pastoris

Advantages over S. cerevisiae

  • Likely to have fewer problems with efficient protein secretion
  • It can grow to very high cell densities
  • It can grow on methanol which is cheap
    • This is kinda funny, since we can make methanol from methane (cow-farts) which means that our organism can eat the major greenhouse gas produced by cows.

"Pichia can grow to very high cell densities, and under ideal conditions can multiply to the point where the cell suspension is practically a paste. As the protein yield from expression in a microbe is roughly equal to the product of the protein produced per cell and the number of cells, this makes Pichia of great use when trying to produce large quantities of protein without expensive equipment" -- wikipedia quoting Cregg et al. "Expression in the yeast Pichia pastoris" 2009

Potential problems

  • It is unclear if it is safe for human consumption (meaning more purification needed)
  • Fewer resources available

S. cerevisiae

Advantages

  • Familiar as a human consumable
  • FDA GRAS status
    • We can probably get away with less or no purification and still have a food-safe product

Potention problems

"Although S. cerevisiae has a well-developed eukaryotic secretory pathway, it is not the most efficient yeast for high-level export of proteins to the extracellular medium."

"...often transport to the extracellular medium is inefficient, especially for complex mammalian proteins of high molecular weight."

-- Overview of Protein Expression in Saccharomyces cerevisiae by Strausberg and Strausberg

Strain selection

The Johns Hopkins' team used YPH500

Vectors

The Jons Hopkins iGEM team modified pRS400-series vectors, removing the MCS and replacing them with biobrick prefix and suffix.

Patrick says that galactose induction is the most common form of controlling heterologous expression. Common vectors are the pYES2 and pESC series vectors.

Secretion

Secretion of proteins will require a secretion signal (triggering processing through the ER, golgi and then exocytosis).

iGem Parts

Other notes

pESC contains a bidirectional promoter which may allow expression of both parts of the protein without cleavage being required. How?

Estimating amount of expressed protein

In the 2005 Kim et al. paper where they express caseinomacropeptide they describe using densitometry to estimate amount of expressed protein. They run the stained protein on an SDS-PAGE and use a gel imager to estimate amount of protein. There is an interesting article on how to use a normal scanner for densitometry here. We'd need at least of couple of calibration points which could be known amounts of a different protein. We'd have to know if the stain shows in equal amounts for the different proteins (per mole or per gram).

Clotting / Micellar flocculation

If the model proposed by Home is correct, then micelle formation will require alpha, beta, kappa-casein and colloidal calcium. The model also implies that the phosphoserine residues of both alpha and kappa-casein play an important role in micelle formation.

Synthetic casein micelles

Some researchers have created synthetic casein micelles. We should study their efforts:

Note: None of these articles have been downloaded to seafile yet.

Health effects of milk/cheese components

Cow's Milk Protein Allergy

Cow's milk allergies seem to be caused primarily by casein, and especially the alpha-s1 and beta caseins that are the major component of cow's milk. Switching to breastfeeding seems to resolve the problems, although in some instances the mother may need to avoid dairy products as well. This seems to suggest that we may be able to avoid allergic reactions by replacing the bovine caseins with the human version, or by humanizing specific epitopes on the bovine casein.

Cow's Milk Protein Intolerance

Non immunologically reactions against cow's milk protein are defined as cow’s milk protein intolerance (CMPI).

Human or bovine?

There seems to be a distinct sociological "ick" reaction against the idea of making cheese made from human casein proteins, even among those perfectly willing to consider genetically engineered cheese. (Also check out the wikipedia sections on Alternative uses for breast milk and Extraordinary consumption.) On one hand, there may be significant health benefits associated with humanizing some or part of the casein proteins (probiotic effects, avoiding cow's milk allergies, etc.) On the other hand, we don't know how human caseins would behave in cheese making, and they may make the concept of an engineered cheese even less acceptable to the general public.

Suggested compromise:

  • We do both the human and bovine kappa-casein at least. That should tell us whether the chymosin enzyme functions the same way on both, and whether the human kappa-casein can form micelles with bovine caseins.
  • We do a little write-up on the Ethical, Legal, Societal Issues surrounding engineering with human genes and making a consumable from them (human burger, anyone?) That'll give us extra points for the iGEM competition as well.

Another suggested compromise:

  • Narwhal casein!