an autodidact meets a dilettante…

‘Rise above yourself and grasp the world’ Archimedes – attribution

Archive for the ‘genetics’ Category

a DNA dialogue 6: Okazaki fragments, as promised

leave a comment »

Canto: Okay, so first off, why are Okazaki fragments so called?

Jacinta: Well as anyone would guess, they’re named after someone Japanese, in this case two, the husband and wife team Reiji and Tsuneko Okazaki, who discovered these short, discontinuously synthesised stretches of DNA nucleotides in the 1960s.

Canto: Yes their story is intriguing – tragic but also inspiring. Reiji, the husband, was born in Hiroshima and died in 1975 from leukaemia, related to the 1945 A-bomb. He was only 44. Tsuneko Okazaki continued their research and went on to make many other contributions to genetics and molecular biology, as a professor, teacher, mentor and director of scientific institutes. Her achievements would surely make her a Nobel candidate, and she’s still alive, so maybe…

Jacinta: Now the key to Okazaki fragments is this lagging strand. Its directionality means that the DNA primase, followed by the DNA polymerase, must work ‘backwards’, away from the replication fork, to add nucleotides. This means that that they have to have periodic breaks – but I’m not sure exactly why – in creating this lagging strand. So the entire replication process is described as semi-discontinuous because of this fundamental difference between the continuously created leading strand and the stop-start ‘fragmentary’ (at least briefly) lagging strand.

Canto: But we need to know why this ‘backward’ movement has to be stop-start, and I’d also like to know more about this primase and polymerase, thank you.  

Jacinta: Well the Okazakis and their team discovered this semi-discontinuous replication process in studying the replication of good old Escherichia coli, the go-to research bacterium, and it was a surprise at the time. Now, I’m looking at the explanation for this necessarily discontinuous process in Wikipedia, and I confess I don’t really understand it, but I’ll give it a go. Apparently the Okazakis ‘suggested that there is no found mechanism that showed continuous replication in the 3′ to 5′ direction, only 5′ to 3′ using DNA polymerase, a replication enzyme’, to quote from Wikipedia. So they were rather cleverly hypothesising that there must be another mechanism for the 3′ to 5′ lagging strand, which must be discontinuous. 

Canto: And another way of saying that, is that the process must be fragmentary. And they used experiments to test this hypothesis? 

Jacinta: Correct, and I won’t go into the process of testing, as if I could. It involved pulse-labelling. Don’t ask, but it has something to do with radioactivity. Anyway, the test was successful, and was supported by the discovery shortly afterwards of polynucleotide ligase, the enzyme that stitches these fragments together. Now, you want to know more about primase, polymerase, and now ligase no doubt. So here’s a bit of the low-down. DNA primase is, to confuse you, an RNA polymerase, which synthesises RNA from a DNA template. It’s a catalyst in the synthesis of a short RNA segment, known as a primer. It’s extremely important in DNA replication, because no polymerase (and you know how polymerase keeps getting associated with primase) can make anything happen without an RNA (or DNA) primer.

Canto: But why? This is getting so complicated.

Jacinta: I assure you, we’ve barely scratched the surface….

Canto: Well, Socrates was right – there’s an essential wisdom in being aware of how ignorant you are. We’ll battle on in our small way.    

 

 

 

 

Written by stewart henderson

February 27, 2020 at 5:48 pm

a DNA dialogue 5: a first look at DNA replication

leave a comment »

schematic of ‘replisome’ structures involved in DNA replication

 

Jacinta: So let’s scratch some more of the surface of the subject of DNA and genetics. A useful datum to remember, the human genome consists of more than 3 billion DNA bases. We were talking last time about pyrimidines and purines, and base pairs. Let’s talk now about how DNA unzips.

Canto: Well the base pairs are connected by hydrogen bonds, and the two DNA strands, the backbones of the molecule, run in opposite, or anti-parallel, directions, from the 5′ (five prime) end to the 3′ (three prime) end. So, while one strand runs from 5′ to 3′ (the sense strand), the other runs 3′ to 5′ (the antisense strand). 

Jacinta: Right, so what we’re talking about here is DNA replication, which involves breaking those hydrogen bonds, among other things. 

Canto: Yes, so that backbone, or double backbone whatever, where the strands run anti-parallel, is a phosphate-sugar construction, and the sugar is deoxyribose, a five-carbon sugar. This sugar is oriented in one strand from 5′ to 3′, that’s to say the 5′ carbon connects to a phosphate group at one end, while the 3′ carbon connects to a phosphate group at the other end, while in the other strand the sugar is oriented in the opposite direction. 

Jacinta: Yes, and this is essential for replication. The protein called DNA polymerase should be introduced here, with thanks to Khan Academy. It adds nucleotides to the 3′ end to grow a DNA strand…

Canto: Yes, but I think that’s part of the zipping process rather than the unzipping… it’s all very complicated but we need to keep working on it…

Jacinta: Yes, according to Khan Academy, the first step in this replication is to unwind the tightly wound double helix, which occurs through the action of an enzyme called topoisomerase. We could probably do a heap of posts on each of these enzymes, and then some. Anyway, to over-simplify, topoisomerase acts on the DNA such that the hydrogen bonds between the nitrogenous bases can be broken by another enzyme called helicase.

Canto: And that’s when we get to add nucleotides. So we have the two split strands, one of which is a 3′ strand, now called the leading strand, the other a 5′ strand, called the lagging strand. Don’t ask.

Jacinta: The leading strand is the one you add nucleotides to, creating another strand going in the 5′ to 3′ direction. This apparently requires an RNA primer. Don’t ask. DNA primase provides this RNA primer, and once this has occurred, DNA polymerase can start adding nucleotides to the 3′ end, following the open zipper, so to speak.

Canto: The lagging strand is a bit more complex though, as you apparently can’t add nucleotides in that other direction, the 5′ direction, not with any polymerase no how. So, according to Khan, ‘biology’ adds primers (don’t ask) made up of several RNA nucleotides.

Jacinta: Again, according to Khan, the DNA primase, which works along the single strand, is responsible for adding these primers to the lagging strand so that the polymerase can work ‘backwards’ along that strand, adding nucleotides in the right, 3′, direction. So it’s called the lagging strand because it has to work through this more long, drawn-out process.

Canto: Yes, and apparently, this means that you have all these fragments of DNA, called Okazaki fragments. I’m not sure how that works…

Jacinta: Let’s devote our next post on this subject entirely to Okazaki fragments. That could clarify a lot. Or not.

Canto: Okay, let’s. Goody goody gumdrops. In any case, these fragments can be kind of sewn together using DNA ligase, presumably another miraculous enzyme. And the RNA becomes DNA. Don’t ask. I’m sure all will be revealed with further research and investigation.

References

Leading and lagging strands in DNA replication (Khan Academy video)

https://www.quora.com/What-is-DNA-unzipping

https://www.yourgenome.org/facts/what-is-dna-replication

Written by stewart henderson

February 26, 2020 at 10:59 pm

Epigenetics 8: some terms

leave a comment »

Histone, with DNA wrapping, rendered by the Protein Data Bank (PDB)
Histone, with DNA wrapping, rendered by the Protein Data Bank (PDB)

 

The gene is not more ‘basic’ than the organism, or closer to ‘the essence of life’, whatever that means. Organisms have DNA codes, and they maintain external forms and behaviours. Both are equal and fundamental components of being. DNA does not even build an organism directly, but must work through complex internal environments of embryological development, and external environments of surrounding conditions. We will not know the core and essence of humanity when we complete the human genome project. 

Stephen Jay Gould, ‘Magnolias from Moscow’, in Dinosaur in a Haystack, 1996

I remember ages ago promising that I’d start every blog piece with a quote, then I more or less immediately forgot about it. Anyway the above quote kind of refers to epigenetics, and anticipates, in a way, the disappointment that many have felt about the human genome project and its not-quite-revelatory nature. As we learn more about the complexities of epigenetics, more about the relationships between genotype and phenotype will be revealed, but the process will surely be very gradual, though relentless. But I can’t talk, knowing so little. In this post, I’ll look at a very few key terms to help orient myself in this vast field. Not all will be specifically related to epigenetics, but to the whole field of DNA and genetics. 

nucleosome: described as ‘the basic structural form of DNA packaging in eukaryotes’, it’s a segment of DNA wound round a histone ‘octamer’, a set of eight histones in a cubical structure. All of this is for fitting DNA into nuclei. Nucleosomes are believed to carry epigenetic info which modifies their core histones, and their positions in the genome are not random. Each nucleosome core particle consists of approximately 146 base pairs. 

chromatin: a complex of DNA and protein, which packages DNA protectively, condensing the whole into a tight structure. Histones are essential components of chromatin. Chromatin structure is affected by methylation and acetylation of particular proteins, which in turn affects gene expression. 

nucleotides: the basic building blocks of DNA and RNA, they consist of a nucleoside and a phosphate group. A nucleoside itself is a nitrogenous base (also known as a nucleobase) and a five-carbon sugar ribose (a ribose – these explanations always need more explaining – is a simple sugar, the natural form of which is D-ribose, and which comes in various structural forms). DNA and RNA are nucleic acid polymers made up of nucleotide monomers. 

nucleobase: a nitrogenous base (e.g. adenine, cytosine, thymine, guanine, and uracil which replaces thymine in RNA), the fundamental units of our genetic code. Also simply known as a base. 

base pairs: a base pair, in DNA, is one of the pairings adenine-thymine (A-T) or cytosine-guanine (C-G). They are pyrimidine-purine pairings. Adenine and guanine are purines, the other two pyrimidines. Due to their structure pyrimidines always pair with purines. 

CpG islands: regions of DNA with a high frequency of CpG (C-G) sites, i.e. sites where a cytosine nucleotide is followed by a guanine nucleotide in linear sequence in a particular direction. 

histones: highly alkaline proteins, the chief proteins of chromatin, and the means of ordering DNA into nucleosomes. There are four core histones, H2A, H2B, H3 and H4. These form an octamer structure, around which approximately 146 base pairs are wound. 

Obviously, I’m very much a beginner at comprehending all this stuff, but I note that the number of videos on epigenetics seems to increase almost daily, which is raising my skepticism more than anything. I try to be selective in checking out these videos and other info on the topic, as there’s always this human tendency to claim super-solutions to our problems, as in super-foods and super-fitness regimes and the like. I’m more interested in the how of things, which is always a more complicated matter. Other information sources tend to assume knowledge or to skate over obvious complexities in a facile manner, and then of course there’s the ‘problem’ of being a dilettante, who wants to learn more about areas of scientific and historical knowledge often far removed from each other, and time’s running out, and we keep forgetting…

So anyway, I’ll keep plodding along, because it’s all quite interesting.  

Written by stewart henderson

February 23, 2020 at 12:20 pm

epigenetics and imprinting 7: more problems, and ICRs

leave a comment »

This image has an empty alt attribute; its file name is screen-shot-2020-02-02-at-10.11.35-pm-1.pngthe only image I can find that I really understand

 

In the previous post in this series I wrote about the connection between two serious disorders, Angelman syndrome and Prader-Willi syndrome, their connection to a missing small section of chromosome 15, and how they’re related to parental inheritance. These syndromes can sometimes also be traced back to uniparental disomy, in which the section of chromosome 15 is intact, but both copies are inherited from the mother (resulting in PWS) or the father (resulting in AS).

So the key here is that this small section of chromosome 15 needs to be inherited in the correct way because of the imprinting that comes with it. To take it to the genetic level, UBE3A is a gene which is only expressed from the maternal copy of chromosome 15. If that gene is missing in the maternal copy, or if, due to uniparental disomy, both copies of the chromosome are inherited from the father, UBE3A protein won’t be produced and symptoms of Angelman syndrome will appear. Similarly, PWS will develop if a certain imprinted gene or genes aren’t inherited from the father. Other imprinting disorders have been found, for example, one that leads to Beckwith-Wiedemann syndrome, though the mechanism of action is different, in that both copies of a gene on chromosome 11 are switched on when only the paternal copy should be expressed. This results in abnormal growth (too much growth) in the foetus. It too has an ‘opposite’ syndrome, Silver-Russell syndrome, in which the relevant protein expression is reduced, resulting in retarded growth and dwarfism. 

But now to the question of exactly how genes are switched on and off, or expressed and repressed. DNA methylation, briefly explained in my first post on this topic, is essential to this. Methyl groups are carbon-hydrogen compounds which can be bound to a gene to switch it off, but here’s where I start to get confused. I’ll quote Carey and try to make sense of it:

… it may be surprising to learn that it is often not the gene body that is methylated. The part of the gene that codes for protein is epigenetically broadly the same when we compare the maternal and paternal copies of the chromosome. It’s the region of the chromosome that controls the expression of the gene that is differently methylated between the two genomes.

N Carey, The epigenetics revolution, 2011 p140

The idea, I now realise, is that there’s a section of the chromosome that controls the part of the gene that codes for the protein and it’s this region that’s differently methylated. Such regions are called imprinting control regions (ICRs). Sometimes this is straightforward, but it can get extremely complicated, with whole clusters of imprinted genes on a stretch of chromosome, being expressed from the maternally or paternally derived chromosomes, and not simply through methylation. An ICR may operate over a large region, creating ‘roadblocks’, keeping different sets of genes apart, and affecting thousands of base-pairs, not always in the same way. Repressed genes may come together in a ‘chromatin knot’, while other, activated genes from the same region form separate bundles.

Imprinting is a feature of brain cells – something which, as of the writing of Carey’s book (2011), is a bit of a mystery. Not so surprising is the number of expressed imprinted genes in the placenta, a place where competing paternal-maternal demands are played out. As to what is going on in the brain, Carey writes this:

Professor Gudrun Moore of University College London has made an intriguing suggestion. She has proposed that the high levels of imprinting in the brain represents a post-natal continuation of the war of the sexes. She has speculated that some brain imprints are an attempt by the paternal genome to promote behaviour in young offspring that will stimulate the mother to continue to drain her own resources, for example by prolonged breastfeeding.

N Carey, The epigenetics revolution, 2011. pp141-2

This sounds pretty amazing, but it’s a new epigenetic world we’re exploring. I’ll explore more of it next time.

References

The epigenetics revolution, by Nessa Carey, 2011

Epigenetics, video: SciShow

Written by stewart henderson

February 2, 2020 at 10:33 pm

a DNA dialogue 3: two anti-parallel strands

leave a comment »

but why the twist? – we don’t know yet

Jacinta: Ok so these two strands of DNA are described as anti-parallel. Is this just intended to confuse us?

Canto: Apparently not, in fact it’s quite essential. The useful q&a site Quora has good info on this, and understanding it in all its complexity should help us to understand DNA general – it’s one of a thousand useful entry points.

Jacinta: Yes, and I’ll try to explain. It became clear to us last time that the strands or ribbons twisted round in a double helix, called the backbone of the molecule, are made from phosphate and deoxyribose sugar, covalently bonded together. That means tightly bonded. Between the two strands, connecting them like ladder rungs, are nitrogenous bases (this is new to us). That’s adenine, thymine, guanine and cytosine, bonded together – A always to T, and C to G – with weak hydrogen bonds. We’ll have to look at why they must be paired in this way later.

Canto: It’s called Chargaff’s base pairing rule, which doesn’t tell us much.

Jacinta: And, according to a respondent from Quora, ‘the two strands of DNA are anti-parallel to each other. One of them is called leading strand, the other is lagging strand’. But I don’t quite get this. How are there two strands of DNA? I thought there was one strand with two sugar-phosphate backbones, and a rung made up of two – nitrogenous nucleobases? – weakly connected by hydrogen bonds.

Canto: I think the idea is there are two strands, with the attached bases, one next to another on the strand, and weakly attached to another base, or set of bases each attached to another phosphate-sugar backbone. As to why the whole thing twists, rather than just being a straight up-and-down ladder thing, I’ve no idea. Clearly we’re a couple of dopey beginners.

Jacinta: Well, many of the Quora respondents have been teaching molecular biology for years or are working in the field, and just skimming through, there’s a lot be learned. For now, being anti-parallel is essential for DNA replication – which makes it essential to DNA’s whole purpose if I can call it that. I’ll also just say that the sugars in the backbone have directionality, so that the way everything is structured, one strand has to go in the opposite direction for the replication to work. If for example the strands were facing in the same direction, then the base on one side would connect to a hydrophobic sugar (a good thing) but the base on the other side would be facing a hydrophilic phosphate (a bad thing). Each base needs to bond with a sugar – that’s to say a carbon atom, sugar being carbon-based – so one strand needs to be an inversion of the other. That’s part of the explanation.

Canto: Yes, I find many of the explanations are more like descriptions – they assume a lot of knowledge. For example one respondent says that the base pairs follow Chargaff’s rule and that means purines always pair up with pyrmidines. Not very helpful, unless maybe you’re rote-learning for a test. It certainly doesn’t explain anti-parallelism.

Jacinta: Well, although we don’t fully understand it yet, it’s a bit clearer. Anti-parallelism is an awkward term because it might imply, to the unwary, something very different from being parallel. The strands are actually parallel but facing in the opposite direction, and when you think about the structure, the reason for that becomes clearer. And imagining those backbone strands facing in the same direction immediately shows you the problem, I think.

Canto: Yes and for more insight into all that, we’ll need to look more closely at pyrmidines and purines and the molecular structure of the backbone, and those bases, and maybe this fellow Chargaff.

References

https://www.quora.com/Why-are-DNA-strands-anti-parallel

https://slideplayer.com/slide/13304243/

Written by stewart henderson

January 21, 2020 at 2:38 pm

epigenetics and imprinting 5: mouse experiments and chromosome 11

leave a comment »

something new, since Carey’s book was published – a healthy mouse, from entirely maternal DNA, with healthy offspring – and in 2018 a healthy bi-paternal mouse was created

 

So we were looking at how we – mammals amongst others – are engaged in a kind of battle for the best way to ensure our genetic survival into the future, beyond our insignificant little selves. This battle begins in the very early phase of life, as zygotes multiply to form a blastocyst. 

Remember from my last post on this topic, the male mammal is interested in the offspring above all else. He’s even happy to sacrifice the mother for the sake of the child – after all there’s plenty more fish in the sea (or mammals in the – you know what I mean). The female, on the other hand, is more interested in self-preservation than in this pregnancy. She wants more than one chance to pass on her genes.

So, by the blastocyst stage, cells have differentiated into those that will form the placenta and those that will form the embryo itself. Experiments on mice have helped to elucidate this male-female genetic struggle. Mouse zygotes were created which contained only paternal DNA and only maternal DNA. These different zygotes were implanted into the uterus of mice. As expected, the zygotes didn’t develop into living mice – it takes DNA from both sexes for that. The zygotes did develop though, but with serious abnormalities, which differed depending on whether they were ‘male’ or ‘female’. In those in which the chromosomes came from the mother, the placental tissues were particularly underdeveloped. For those with the male chromosomes, the embryo was in a bad way, but the placental tissues not so much.

In short, these and other experiments suggested that the male chromosomes favoured placental development while the female chromosomes favoured the embryo. Thus, the male chromosomes are ‘aiming’ to build up the placenta to drain as many nutrients as possible from the mother and feed them into the foetus. The female chromosomes have the opposite aim, resulting in a ‘fine balance’ in the best scenarios.

Further work in this area has identified particular chromosomes responsible for these developments, and some of the epigenetic factors involved. For example, mouse chromosome 11 is important for offspring development. When the offspring inherits a copy of chromosome 11 from each parent, the offspring will be of normal size. If both copies come from the mother it will abnormally small, while if both come from the father it will be abnormally large. These experiments were carried out on inbred mice with identical DNA. Nessa Carey summarises:

If you sequenced both copies of chromosome 11 in any of the three types of offspring, they would be exactly the same. They would contain the same millions of A, C, G and T base-pairs, in the same order. But the two copies of chromosome 11 do clearly behave differently at a functional level, as shown by the different sizes of the different types of mice. Therefore there must be epigenetic differences between the maternal and paternal copies of chromosome 11.

So this means that chromosome 11 is an imprinted chromosome – or at least certain sections of it. This is the same for other chromosomes, some of which aren’t imprinted at all. But how is it done? That’s the complex biochemical stuff, which I’ll try to elucidate in the next post on this topic.

Footnote: the photo above shows a bi-maternal mouse with healthy offspring, and further work in deleting imprinted genetic regions has allowed researchers to create healthy bi-paternal mice too. There’s a fascinating account of it here.

References:

Nessa Carey, The epigenetics revolution, 2011

https://www.the-scientist.com/news-opinion/first-mouse-embryos-made-from-two-fathers-64921

Written by stewart henderson

January 19, 2020 at 12:26 pm

A DNA dialogue 1: the human genome

leave a comment »

what genomics tells us

Canto: I’m often confused when I try to get my head around all the stuff about genes and DNA, and genomes and alleles and chromosomes, and XX and XY, and mitosis and meiosis, and dominant and recessive and so on. I’d like to get clear, if only I could.

Jacinta: That’s a big ask, and of course we’re both in the same boat. So let’s use the magical powers of the internet to find answers. For example, here’s something that confuses me. The Human Genome project, which ended around the year 2000, involved a mapping of the whole human genome, and that includes coding and non-coding genes, and I think it was found to contain 26,000 or so – what? Letters? Genes? Coding genes? Anyway there’s a number of questions there, but they’re not the questions that confuse me. I don’t get that we now, apparently, have worked out the genetic code for all humans, but each of us has different DNA. How, exactly, does our own individual DNA relate to the genome that determines the whole species? Presumably it’s some kind of subset?

Canto: Hmmm. This article from the Smithsonian tells us that the genetic difference between human individuals is very tiny, at around 0.1%. We humans differ from bonobos and chimps, two lineages of apes that separated much more recently, by about 1.2%….

Jacinta: Yes, yes, but how, with this tiny difference between us, are we able to use DNA forensically to identify individuals from a DNA sample?

Canto: Well, perhaps this Smithsonian article provides a clue. It says that the 1.2% difference between us and chimps reflects a particular way of counting. I won’t go into the details here but apparently another way of counting shows a 4-5% difference.

Jacinta: We probably do need to go into the details in the end, but clearly this tiny .1% difference between humans is enough for us to determine the DNA as coming from one individual rather than 7 to 8 billion others. Strangely enough, I can well believe that, given that we can detect gravitational waves and such – obviously using very different technology.

Canto: Yeah the magic of science. So the Human Genome Project was officially completed in April 2003. And here’s an interesting quote from Wikipedia:

The “genome” of any given individual is unique; mapping the “human genome” involved sequencing a small number of individuals and then assembling these together to get a complete sequence for each chromosome. Therefore, the finished human genome is a mosaic, not representing any one individual.

Of course it would have to be a mosaic, but how can it represent the whole human genome when it’s only drawn from a small number? And who were these individuals, how many, and where from?

Jacinta: The Wikipedia article does give more info on this. It tells us that the project isn’t really finished, as we’ve developed techniques and processes for faster and deeper analyses. As to your questions, when the ‘finished’ sequencing was announced, the mosaic was drawn from a small number of anonymous donors, all of European origin.

Canto: But we all originated from Africa anyway, so…

Jacinta: So maybe recent ‘origin’ isn’t so important. Anyway, that first sequencing is now known as the ‘reference genome’, but after that they did sequence the genomes of ‘multiple distinct ethnic groups’, so they’ve been busy. But here are some key findings, to finish off this first post. They found some 22,300 protein-coding genes, as well as a lot of what they used to call junk DNA – now known as non-coding DNA. That number is within the mammalian range for DNA, which no doubt surprised many. Another blow for human specialness? And they also found that there were many more segmental duplications than expected. That’s to say, sections of DNA that are almost identically repeated.We’ll have to explore the significance of this as we go along.

Canto: Yes, that’s enough for starters. Apparently our genome has over 3 billion nucleobase pairs, about which more later no doubt.

References

http://humanorigins.si.edu/evidence/genetics

https://en.wikipedia.org/wiki/Human_Genome_Project

Written by stewart henderson

January 13, 2020 at 11:48 pm