BIS002C "Biodiversity & the tree of life" Lecture2&3 more on phylogeny & trees

Well, tomorrow is going to be a bit crazy for me.  I teach four lectures tomorrow for “BIS002C – Biodiversity and the Tree of Life” and UC Davis.  Well, actually, I do each of two lectures twice.  This happens for two reasons.  First, there are two sections for the class and the way we do it, each faculty gives each lecture for which they are responsible twice.  Second, on Mondays, we do two lectures for each section.  In total there are four lectures per week and our schedule is as follows

Section A:  MWF 11-12 M 6-7
Section B: MWF 3-4 M 7-8

So tomorrow at 11 AM I give Lecture 2 for the class to Section A.  Then at 3 PM I give Lecture 2 to Section B.  Then at 6 PM I give Lecture 3 to Section A.  Then at 7 PM I give Lecture 3 to Section B.  Well, enough trying to make it seem like I am working hard.  Especially after PZ Myers gave me a little grief about this on twitter since, well, my teaching load is not actually that big compared to many.

Anyway – back to the class.  I am going to be posting about the class here as much as I can.  To give people an idea of the whole course this is the general highly simplified schema:

Lectures 1-5 Phylogeny (me)
Lectures 6-13 Microbes (Bacteria, Archaea, microbial euks) (me)
Lectures 14-21 Plants and relatives (Jim Doyle)
Lectures 22-24 Fungi  (Jim Doyle)
Lectures 25-36 Metazoa (Susan Keen)
Lectures 37-38 Wrap up, symbioses, etc (Susan Keen)

I do Lectures 1-13 and possibly 37-38.

For the first week or so I am introducing the students to various aspects of phylogeny and phylogenetic trees.  We do this in part because the rest of the class is oriented around using phylogenies and phylogenetic trees so it is important that the students really understand them.

To that end, tomorrow here is the plan:

Lecture 2 will focus on (a) the components of a phylogenetic tree and what they mean plus (b) taxa and groups in trees.  Among the topics we will cover are rooted trees, rotating trees (e.g., vertical vs. horizontal), rotating branches in trees, monophyletic groups/clades, non monophyletic groupings, outgroups vs. ingroups and more.  Oh in addition we will show the awesome Tree of Life movie that we did not get to on Friday. See below

Lecture 3 will then focus on characters and on tracing character evolution on trees.  Among the topics we will cover include traits vs states, homology, ancestral vs. derived, synapomorphies, and the many faces of homoplasy.  Am planning to start posting slides from the class after lecture hopefully starting soon.  But I keep refining them so not going to post before I am close to done ….

Any comments or suggestions welcome …

A complete summary (probably TMI) about my time at the human #microbiome meeting #HMP2010

Well, I am writing this on my flight heading back from St. Louis after going to the “Human Microbiome Research Conference”

I have already posted some summaries of the meetings Day 1 and Day 2  and will soon do one about Day 3.  And you can find out more about the meeting from multiple other sources such as here, here, here and here.  But what I thought I would do here is give some of the story behind the meeting – why I went and what it was like behind the scenes, at least for me.  Note – I know this is very long – but hopefully it will give people a feel both for some of the science, some of the behind the scenes, and some of (my) history relating to this meeting.

Some prelude to the meeting

I headed out to the meeting from Davis on the afternoon on the Monday the 30th.  I was originally planning on only going to the meeting for the last day, Thursday. that was because, well, the 31st was my birthday, and I wanted to spend it with my family.  normally, I reserve important family events and do not travel or schedule things that conflict with them.  So, when George Weinstock invited me to give a talk at the meeting, I told him that I could probably only come for the last day.  So that is when he scheduled my talk.

But then as the meeting approached I changed my mind.  You see, in the last few months I have gotten more and more interested in the human microbiome as a research area.  I had done some work on the human microbiome.  For example, my lab was involved in a collaboration with Michael Zasloff at Georgetown University in which they were conducting human ileal transplants and we looked at the microbial recolonization of the ileum after transplantation.   This project was headed by a stellar student, Amber Hartman, who just finished her PhD in my lab.  Her paper was published recently in PNAS (which I note, we paid for the Open Access option there and really wish more people would do similarly).

Anyway, that project had been my main foray into human microbiome studies.  However, I have led or been involved in dozens of studies looking at the genomics of symbioses (good and bad) between animals and microbes (e.g., see these Open Access papers on Wolbachiasharpshooter nutritional symbiontsdeep sea worm epibiontsa cellulolytic symbiont of shipworms).  In fact, I have been moving more and more towards studies of communities of microbes that live in or on animals and plants.  And I have been writing on and off about the microbiome in my blog including the following posts:

And I guess as the meeting approached I realized I really do want to do more human associated studies.


Day 1, August 30, 2010: Getting there

So, given the above desire to go to the meeting, I changed my flight, and then headed on out on Tuesday PM.  As I was posting to twitter along the way and during the meeting, I am going to use my tweets to give some feel for what I was thinking as things were happening:

Tour de Lance

The trip getting there was OK … I got into Phoenix, my connection city and bumped into, of all people, Lance Price at the airport.  He had a cold, and I avoided him after a brief hello, and then got the good news that I was upgraded to first class. Imagine my surprise when who was in the seat next to me it the man with the cold, Lance Price.  This was funny in a way because I had been pondering a recent interaction I had with Price via my blog.  A while back I had written a post pointing to an interesting paper by Price on the penis microbiome and circumcision.  However, I was not completely happy with the paper, since the press release that came with it was quite misleading.  The press release had made some unfortunate statements that gave the impression that the study had direct implications for the transmission of HIV.  The paper had nothing directly to do with HIV – it was mostly a comparison of the microbes found on the penis of circumcised and uncircumcised males. Sure, this could in theory be connected to HIV since other studies had started to make connections between HIV transmission and circumcision and it was possible that the transmission differences were due in some way to microbiome differences.  But they did not show is in any way and it was inappropriate really to have any suggestion that their study was directly connected to the HIV-circumcision story.  So I said is in the blog and amazingly to me, Price, who happened to be the lead author, got the press release taken down and changed in a VERY short period of time.  Anyway, you can read more about that on my post.

So there he was, in the flesh, with a cold of some kind, sitting next to me.  It was actually quite entertaining …. We had a great time talking about penises and vaginas and anthrax and the FBI and all kinds of other nasty things (this makes me want to say – “and everything was fine until the flight attendant came over and had some paper in his hand, held it up and said” … but people might not get that if they have not heard of Alice’s Restaurant by Arlo Guthrie).  I am certain some of the others in first class were wondering WTF was wrong with us.

  • Sat next to Lance Price on flight to #microbiome mtg – am still impressed w/ his response to blog of mine months ago http://ff.im/-pWTaU
  • Passengers near me & Lance Price on way to #microbiome mtg got earful: anthrax, penises, vaginas, wounds, microbes everywhere

Stranded Landed in St. Louis

We landed around 11:30 PM local time and I asked a flight attendant what time zone we were in she said Eastern.  This meant that it would, be 8:30 PM at home.  So while taxiing I called my wife in the hope I could talk to my kids as they went to sleep to say good night.  But alas, it turns out St Louis is on Central Time and the kids were asleep.  We got off the plane and Price and I headed downstairs … He had checked a bag and after discussing it I decided to not wait and share a cab with Dr. Sniffly.  So I took a cab to the hotel. The hotel the meeting was at (and thankfully where my room was) was the Park Chase Hotel. It was perfectly nice, and my room was enormous.  There were some issues, like the proliferation of pillows on the beds, and the strange key card system in the elevators that did not work well, and the 12$ beers in the lobby bar, and the fact that they kept the conference rooms at something like 60°F but it was still an OK place to have a conference.

No food for you

Anyway, I checked in, dropped my stuff in my room and went out to try and find a place to eat.  On the way out of the hotel, I bumped into Ashlee Earl and Bruce Birren who were working on something on the computer in one of the common areas.  I would have lingered longer, since I have known Ashlee for many years as used to collaborate with her when she worked on one of my favorite organisms Deinococcus radiodurans in John Battista’s lab (see our paper on gene expression in Drad here). But since it seemed I was interrupting some important work, I headed out to look for food since I had missed dinner (note to self, getting upgraded on US Air is certainly better than coach but nothing special – no food except chips).

I headed into the streets to a place directed by the hotel front desk people.  Alas, even though they promised the place would be serving food at 12:10 am, it was not.  So I grabbed some simple thing at a cafe and then went to my room to crash, on what was now, my birthday.

Day 2: Start of the meeting

On the morning of the meeting start, I got up a bit late and rushed downstairs to the conference area.

  • Well, I am late to the human microbiome conference this am – I hate meetings that start at 8 am, especially when not on west coast

I got my conference material and ID badge – I had missed the general introduction by George Weinstock and Jane Peterson and Larry Shapiro.  The current speaker was (I think) Jim Versalovic and I was standing in the way back trying to figure out whether to go get some coffee or get a seat.

The talk ended and then I saw that Bruce Birren was up.  Well, as Bruce does some good/interesting stuff and as I had seen him working away earlier that AM, I figured I should stay.  So I grabbed a seat in the way back, and got out my iPad to start tweeting.  I saw that some others were tweeting the meeting and had settled upon a hashtag of #HMP2010, so I started firing away.  I note I like to live tweet meetings for a variety of reasons, including that I think it is a good way to let people have a feel for a meeting as it is happening.  But I also do it to take notes and force myself to pay attention.

Alas, sometimes you do not always write things out perfectly.  Here are my tweets from his talk:

  • Well, I made it … To catch Bruce Birren from Broad starting his talk on 16s surveys and developing high throughput approaches 
  • The human microbiome meeting #HMP2010 is quite packed – microbes rule http://twitpic.com/2jynzt
  • Bruce Birren says that they developed a 454 rRNA sequencing pipeline that has been tested & is accurate across centers; #HMP2010
  • Birren used a mock community to test rRNA PCR methods – wonder if he has read our mock metagenomics paper http://bit.ly/cPaIKF #HMP2010
  • Bruce Birren testing diversity estimators from rRNA PCR – for simulation got many more OTUs back than they put in #HMP2010 #sequencingerror
  • Birren – given that they trust their PCR, now comparing many human samples – seems like they are way behind the field here #hmp2010
  • Birren presenting clustering of samples based on rRNA analysis – but could do much better referencing prior lit on this #HMP2010
  • Interesting question after Birren’s talk re: archiving samples & informed consent – key things to think about in this type of work #HMP2010

Anyway, more later about the mini firestorm that my tweets about his talk created.  I was moving on to other things at the time.  Like responding to my brother’s Michael’s Happy Birthday wishes, which he surprisingly sent through twitter, which he rarely uses.

Next up speaking was Ian Lipkin.  Lipkin is another one of the people at the meeting I first interacted with through my blog, though with him it was a bit more awkward than with Lance Price.  A few years ago now I had written a post about a paper from Lipkin on colony collapse disorder (CCD).  The post I wrote was not positive, let’s put it that way.  And one of the coauthors of the paper, wrote to me and basically told me I was a jackass and I was wrong about what I had written in the post.

Alas, they were right.  And so I did what I thought any good scientist would do if they published something that was wrong – I retracted the blog post and put out an apology (this will be a recurring theme here) and gave myself a new “genomic jerk award”.  Plus I wrote a new post discussing their paper.   And then I was stunned – Lipkin called me up to thank me for apologizing and retracting the post.  Since then I have interacted with Lipkin on and off over the years and have followed his work with some fascination as in a way he is an old school microbe hunter.

Anyway, here is what I wrote during his talk:

  • Ian Lipkin now up discussing microbe hunting in the 21st century #HMP2010 – mentions problems with Koch’s postulates at start
  • Ian Lipkin talking about emerging infectious diseases – says we have only scratched surface of sampling viral diversity #HMP2010
  • Lipkin showing world wide air traffic to scare us (rightly so) about how fast EIDs can spread #HMP2010
  • Ian Lipkin referring to Edgar Allen Poe and cryptography in trying to sort through Metagenomic data#HMP2010 #binningishard
  • Lipkin talking about nothing controversial: autism and MMR and microbes #HMP2010
  • Lipkin has looked at microbial diversity in biopsies of people w/ &; w/o autism – found microbes associated w/ autism #hmp2010
    • apoorva_nyc asked: @phylogenomics what sort of microbes? that’s all this field needs, more gut/autism theories
    • I wrote back: @apoorva_nyc he simply said that there are differences in the microbial diversity in patients with autism – not claiming causal
  • Really like how Lipkin caveated his autism work by saying “this is one study in one place” & then asked for others to test if real #HMP2010
  • Ok Ian Lipkin has won me over forever . He just quoted Yogi Berra and Neils Bohr in one sentence #HMP2010

Coffee with a Tiger

In the first break, I headed out to get coffee, as I was starting to drag.  I bumped into none other than Owen White, who also was going to get coffee.  Owen was one of the people who helped convince me to take a faculty job at TIGR in 1998.  He was the PI on the project to sequence the genome of Deinococcus radiodurans, the most radiation resistant organism known, and I was at the time working on the evolution of DNA repair and radiation resistance.  One of my first projects at TIGR involved helping Owen analyze the Deinococcus genome and write a paper on our analysis.  I note, look at the figures in that and other TIGR genome papers very very carefully and you will see some secrets Owen, I and others put in there.  For example, in the Deinococcus paper, search for “strangelove.”

I worked at TIGR for eight years before moving to Davis in 2005 and Owen and I worked together on many many projects.  Owen and I did not always see eye to eye on various things while I was at TIGR but I both respect his work enormously and still consider him a friend so it was great to see him from my point of view.

We went to the hotel cafe and got there just before all the others from the meeting since we snuck out just as questions were starting for the last talk.  And then we wandered back to the mixing area outside where the talks were and I bumped into an array of people I knew.  The meeting really was a “who’s who” of microbial genomics and human microbial studies including many of my old TIGR colleagues (most of whom were ow at either UMD or the J. Craig Venter Institute – JCVI).

More talks

Talks started up again I went back to my live tweeting.  I note, it was also interesting to see who else was tweeting the meeting and eventually to meet them in the flesh.  After the break there were four talks:

  • Roger LaskenJohannes Goll discussed metagenomic projects at the J. Craig Venter Institute.  They did a tag team talk with Roger focusing on whole genome amplification (which he was one of the pioneers of) and Johannes discussing the JCVI metagenomic analysis tools.  Here is what I wrote:
    • Roger Lasken from the J Craig Venter Institute now talking about whole genome amplification w/ MDA to get at uncultured organisms’ #hmp2010
    • Roger Lasken: phylogenetic analysis can help assess MDA samples – note work done by Jonathan Badger who used to work in my lab #HMP2010
    • Johannes Goll discussing Metarep – JCVI metagenomics analysis tool #HMP2010 – now available / published http://ff.im/-pYRF7
    • Lasken: shallow 454 sequencing with barcoding – 50-100 MDA reactions to verify 16S classification of HMP samples #HMP2010
    • Best place to find out about Human Microbiome Project?: probably The HMP DACC #HMP2010 #somecoolthingshttp://ff.im/-pYTFo
    • For software geeks: The MetaREP software is open source & uses CAKEPHP, Solr, MySQL mix to do rapid indexing and searching #HMP2010
  • Makedonka Mitreva from Wash. U. then discussed various microbiome projects going on there.
    • Makedonka Mitreva from Wash U Genome Center: shotgun metagenomics of 54 people x 5 body sites: oral, skin, gut, vagina, nasal #HMP2010
    • Mitreva – the big problem in analyzing human microbiome metagenomics data is computational time –#HMP2010 #weneedbettermethods
    • Ugh: Mitreva referred to “lower eukaryotes” – maybe she says this because the podium is high above the meeting hall? #HMP2010
    • Mitreva: current tools for analyzing metagenomic data mostly die (my word, not hers) when analyzing massive new data sets #HMP2010
  • Owen Whitethen gave an overview of the Human Microbiome Project Data Analysis and Coordination Center, which he runs. His talk was quite good – a good mix of detail about what they are doing and big picture commentary about the state and future of this type of activity.
    • Next up Owen White on the HMP Data Analysis and Coordination Center that he runs #HMP2010http://www.hmpdacc.org/
    • White: one of their goals is to have a nice, easy to use website to coordinate information about HMP #HMP2010see http://www.hmpdacc.org/
    • White: major goal is to provide ease of access to certain types of HMP data #HMP2010
    • Key issue: open data is important but ease of access and use of open data also critical #HMP2010
    • Best place to find out about past & ongoing microbial genome projects?: definitelyhttp://www.genomesonline.org/ #HMP2010
    • I almost said same thing: RT @mikethemadbiol#HMP2010 *I* care if we sequence another E. coli genome 😉 Though metadata ARE critical
    • Key issue: genome and metagenome data is not very useful without metadata about samples, methods, etc#HMP2010
    • White: some challenges in downloading data from short read archive from NCBI #HMP2010
    • White: DACC putting together virtual machines to run all sorts of microbiome/metagenomic analyses – can run locally or on cloud #HMP2010
    • White: his goal is to teach community to fish not give them food (i.e., to give community tools, rather than run things for them) #HMP2010
  • Eric Alm gave the final talk in the session.  His was not about human microbiome work but instead about population genomics of Vibrio strains.   
    • And now for something completely different but very
      relevant: ocean microbes and genome evolution from Eric Alm#HMP2010
    • Eric Alm discussing Fred Cohan’s ecotype model for species formation in bacteria and archaea #HMP2010
    • Eric Alm describing how microbial samples are collected from ocean water; me: informatics people need to understand these things #HMP2010
    • Alm: comparing 75 genomes of strains of Vibrio species; note all of these are in essence same OTU/species but lots of differences #HMP2010
    • A questioner asked about something relating to “higher primates” – come one everyone – stop the evolution bogusness #HMP2010
    • Alm, in response to my question, says that much of their evolution analyses done on incomplete genomes but he thinks it works fine #HMP2010

Lunch and PM talk burnout

And then it was time for the herd of scientific sheep to go to lunch.  The mechanisms for getting up to the 11th floor for lunch were awkward. Two-three elevators, no stairs, and nobody sure exactly where to go. But people made it on up to a freezing cold room on the top with a nice view of the park nearby the hotel.  I had a relatively quick lunch, went outside on the balcony to get some air, and then it was time for the PM sessions.

I was now a bit burned out. Too many talks I think.  And it is a bit hard to tweet all the talks and not just space out for a while.  But I soldiered through, and tweeted some more,

Much of the PM talks are a bit blurry to me now, but here is what I wrote on twitter:

  • Ruth FarrellRichard Sharp, both bioethicists from the Cleveland Clinic talked about ELSI issues in the human microbome project
    • Oops … Missed the first part of the talk on ELSI issues associated with the human microbiome project#HMP2010 #importantstuff
    • Issues relating to microbiome ethics: genetic discrimination, GMOS, release of probiotics to world, gene therapy #HMP2010
    • Richard Sharp discussing how patients seem willing to use GMO probiotics even if they don’t like GMO foods#HMP2010
    • Sharp: patients may be susceptible to aggressive marketing of probiotics #HMP2010
    • Sharp saying that we should not oversell the microbiome project – he should read my bloghttp://bit.ly/ahXB4n #HMP2010
  • Jo Handelsman discussed some of her work on looking for antibiotic resistance encoding genes in microbes in the environment
    • Jo Handelsman, one of my favorite scientists, now up talking about finding antibiotic resistance genes in the environment #HMP2010
    • Jo Handelsman refers to the “resistome” – collection of antibiotic resistance genes in a community;#badomics word? – probably #HMP2010
    •  I note, Jo Handelsman coined the word metagenomics many years ago #HMP2010 – this omics word seems OK to me
  • Vincent Denef discussed population genomics in the human microbiome
    • Vincent Denef from Jill Banfield’s lab discussing “how deep does the rabbit hole really go?” #HMP2010
    • Denef now explaining his talk title – lots of closely related organisms and may have to distinguish at very fine scale #HMP2010
    • Denef now giving homage to the red pill blue pill choice from the Matrix #HMP2010#Toldyoumetagenomicswascool #geekytoo
    • Denef discussing metagenomic studies of preterm infant gut -using similar methods as in acid mine drainage studies #HMP2010
    • Denef – looking at population shifts over time using metagenomics -getting to correlating pop. changes with specific allelic diffs #HMP2010
    •  Denef showing how proteomics can be used in environmental studies – Banfield lab does v. cool stuff in this area #HMP2010
    • A main worry re: metagenomic studies: like GWAS, lots of genetic variation but w/ few variables, will always find correlations #HMP2010

In the next break I bumped into Ian Lipkin and we agreed to go to dinner together.  Then the next session seemed to begin way too fast but it was good.

  • David Relman was up first.  I note, before I got accepted to graduate school I had an offer to work in Relman’s lab on rRNA PCR studies (I do not think we ever met, but I am not sure anymore).  I wonder how my career would have ended up if I had joined his lab then …           
    • David Relman is w/o a doubt the Philosopher of the Microbiome – always discussing deep questions #HMP2010
    • David Relman discussing single cell genomics and other approaches to studying single cells in communities#HMP2010
    • Relman: reagents in molec. biology freq. contaminated w/ DNA; this impacts many amplification studies when DNA levels low; #HMP2010   
  • Margaret-McFall-Ngai who I have interacted with on and off over the years and love her approach to studies of symbioses.
    • Margaret McFall-Ngai : vertebrates are a very old group & likely been associated w/ microbes for hundreds of millions of yes #HMP2010
    • McFall-Ngai: humans are not the pinnacle/crown of evolution; me: just YAV (yet another vertebrate) #HMP2010
    • RT @TheGenomeCenter McFall-Ngai: most human genes are old; most genes associated with human disease are also old#HMP2010 
    • McFall-Ngai’s talk is so far an homage to model organisms and how they will be relevant to human microbiome studies #HMP2010
    • McFall-Ngai encouraging everyone to study circadian rhythms of host-symbiont interactions #HMP2010
  • Mihai Pop who I used to work with when we were both at TIGR.  He discussed metagenomic assembly methods.
    • Mihai Pop says his talk on Metagenomic assembly will be very short b/c it is impossible #HMP2010
    • Alas Pop didn’t end his talk, says one can make (somewhat) useful assemblies from metagenomic data; utility depends on goals #HMP2010
    • Pop showing challenge in assembly by showing assembly of A Tale of Two Cities; repeats Ike “it was the” can confuse things #HMP2010
    • Pop: genome assembly is hard b/c of 1) repeats 2) low coverage 3) sequencing errors – gets even harder for metagenomics #HMP2010

A bloggable dinner?

After the last talk that PM I got a drink in the bar and was hanging out talking to people when I realized I was late to meet Ian and I headed over to the lobby and off we went.  We found a place and sat down outside – good to not be in a freezing cold room for a while.  Mike the Mad Biologist of ScienceBlogs wandered by and we invited him to join us (not sure if his real name is public knowledge so not posting it here).  We had a good dinner and Lipkin even got the restaurant to bring out a candle for my birthday after dinner.

Drinks and criticism and bedtime stories

Afterward, we went back to the hotel,  I dropped off my stuff and headed down to the bar to get a birthday drink, and bumped into a crowd of people from the Broad, JCVI and UMD coming from a dinner they had for the sequencing centers involved in the HMP.  I skipped out for a fun 30 minutes to tell bedtime stories to my kids over the phone (I really wonder what the hotel staff lurking nearby thought while listening to me tell a crazy invented story about submarines, polar bearskin, whales, and strange activities).

Anyway I returned to the bar, got some complaints from the Broad and JCVI people about something I tweeted about Bruce Birren’s talk and then went to my room. The comments about my tweet led me to relook at what I had written and especially in light of having been thinking about the Lipkin blog retraction.  So after fretting a bit, I decided I probably was a bit over the top in some of my comments on his talk.  So I wrote up a mini apology and clarification on my blog at night when I wrote up notes on day 1.  In summary what I said in that post I quote here, in part to make this a very very long post, but also b/c I think it is relevant:

As I have tweeted many meetings I guess I am used to various aspects of such activities but many out there clearly are not.  Live tweeting a meeting is a rough thing in many ways, at least for me.  I want to give people a feel for the meeting, as it happens.  I want them to know what I actually think about talks, at least within some reasonable limits.  But alas sometimes, hopefully not too often, I get things wrong.  And sometimes I post something obnoxious.  And sometimes I miss key points.  To me, this is analogous to the conversations people have about talks all the time.  Overall, I think mostly I do an OK job tweeting meetings.  But occasionally I write something that does not sit right with others or myself.  And alas, today has one such tweet (well, only one I know of right now).

It happened during Brice Birren’s talk.  Birren, from the Broad, was discussing a few different things including studies done by the Broad in which they have tried to compare and contrast and use rRNA PCR studies done at different centers associated with the HMP.  Much of what he was discussing was technical details of the control experiments they did to assess how variable the results were between centers.  I felt at the time that he was placing this work in enough of the context of other rRNA studies of the human microbiome.  And I tweeted this feeling.

  • phylogenomics Birren presenting clustering of samples based on rRNA analysis – but could do much better referencing prior lit on this #HMP2010
  • phylogenomics  Birren – given that they trust their PCR, now comparing many human samples – seems like they are way behind the field here #hmp2010

But in the end, after discussions at the bar later, I think I may have missed the point of his talk.   I thought at the time that he was discussing solely new findings and new analysis tools that they developed.  To me, I did not like that he did not spend much time discussing other analysis tools nor how their work compared to other studies of the human microbome.  And that gnawed at me.

But in retrospect, I think perhaps he was focusing more specifically on the comparison across the centers.   In that context, the way he laid out his talk and what other work he referenced makes more sense.  I think perhaps he could have still placed things in a broader context but my comments in retrospect were a bit over the top and unnecessary.  As I said above, I think it is useful to try and post what I am actually thinking at the time.  I try to filter this if I know what I am thinking is rude, biased, obnoxious, etc; but I do post critiques if they seem relevant.  But my filtering was a bit off here.  Alas, now Bruce (if he reads twitter) probably wants to dump a beer on my head and people from Broad think I am a putz (well, some may have thought that before). I guess I may have to change the tuning on my filter a bit … but I still will try and post what I think at the time.  It is a fine balance I do not always do precisely … off to sleep and in the AM – Day 2 plus time to make some in person apologies …

In the end, I accept that live tweeting is a bit risky in the way I do it, where I try to not just say what people are saying but try to say what i think about what they are saying.  Sometimes my comments are overreactions or rude or wrong.  But I always try to correct them or clarify when something was off.  From the emails and tweets out there I know a lot of people follow along when I am tweeting a meeting and they seem to like it … Just got to try and be careful to not be over the top.

Other notes on Day 1 on the meeting.

In addition to my comments about the Birren tweet, I thought it might be of interest to post here what else I said about Day 1.

Well, Day 1 of the Human Microbiome Project meeting is over.  And overall, I think it was quite good.  So now for a Day 1 wrap up.

First, a bit about the meeting.  The meeting is formally called the “Human Microbiome Research Conference” and more information about it can be found here. It is directly tied to the NIH “Human Microbiome Project“, also known as the HMP, which has been in operation for a few years now.  The HMP is one of a small number of NIH “Roadmap” initiatives (these are also known as NIH Common Fund Projects).  These are cross cutting projects that are funded outside of the normal NIH departments/centers.  The HMP started a few years ago and is focused on studies of the communities of microbes that live in and on humans.  The HMP has so far funded a wide array of projects including some big scale and some smaller scale.  This meeting is I think the first to try to cover the diversity of different projects funded by the HMP at once.

Day 1 covered a bit of introduction and then a diversity of HMP related and some not so related (but good) talks.  The talks were perhaps a bit too reviewy for my liking, but I think actually, all of them were interesting.  Some lessons I got from today include the following:

  1. The human microbiome is becoming a fascinating area of research with an ever growing set of data to look at
  2. The data for the HMP seems to be openly available, which is good. Reference genomes can be found here. Some rRNA data can be found here. Strains of microbes are available too.  Not sure where the metagenome data.
  3. Ease of access and use of data is more important than just access to data; the HMP DACC is doing a decent job with helping access data.
  4. As important as access is metadata about samples. Not sure how much of this is available right now nor how easy the metadata is to make sense out of.
  5. As with most (all?) large scale, top down projects, there are multiple areas where improvements could be made in communication and engagement with the broader community.  It seems like the HMP is working hard on this issue.
  6. There are some possible complicated issues around release of microbiome data and medical records from people
  7. There is still a big risk in overselling the potential benefits of microbiome research
  8. Correlations ≠ causation.  Sorry I had to put that here.  See #7 above.
  9. Analyzing and making sense of metagenomic data is still very very hard
  10. We desperately need more ecology driven studies of the microbiome
  11. To me, the HMP should really try to mimic the human genome project and focus on producing reference data (genomes, metagenomes, and rRNA) for everyone to use.  Trying to do complex scientific/clinical studies in this project seems inappropriate.  We all need the baseline to do the science.
  12. As with every meeting, the best stuff that happens is in between talks.
  13. Having the meeting room be something like 40 °C is probably not the best idea nor use of resources (hotel issue, not HMP issue).

Those are some of the lessons I am thinking about now, a few hours after the last talk.  But if you want to get a “real time” feel for the talks, the best way to do this, if you weren’t here, is to look at twitter posts about the meeting.  If you do not know, the common practice these days is to use a code within twitter specific for the meeting called a hashtag.  For this meeting the hashtag is #HMP2010 and you can find the tweets about the meeting easily by searching twitter for this code.  There were other twitter posts about the meeting, but may be somewhat hard to find b/c they did not use this code.  Such is life I suppose.

Anyway, if you want to see all the tweets from the first day of the meeting with this hash tag I have appended them at the bottom of this email.  This includes anyone who may have reposted (aka retweeted) these tweets to their twitter feed. 



Day 3 of trip, Day 2 of meeting.

Anyway, since I was up so late working on the blog post, I overslept and missed the first few talks,  I was really really bummed about this because one of the ones I missed was from Floyd Dewhirst who does some absolutely fascinating stuff.  I also missed a talk by Susan Haake.  Bummer.  But talks did start really early.

Fortunately, I got there in time for most of Jeff Gordon’s talk.  I simply am in awe of Jeff Gordon.  He is to me, the grand guru of the field of human microbiome research.  He has in particular specialized in making studies of germ free animals a key part of work on the microbiome.  I have always had very positive interactions with him, although I think he may still want to get back at me for including him in my April Fools joke about Craig Venter sequencing his own microbiome.

  • Jeff Gordon discussed his current work on using germ free animals to study function and ecology of microbial communities
    • RT @TheGenomeCenter Next talk Jeff Gordon from @wustl on the marriage of comparative metagenomics and gnotobiotics#HMP2010
    • RT @TheGenomeCenter Gordon: took fecal microbe community from healthy donor and graft it into gut of germ-free mice successfully #HMP2010
    • After brief delay now at #HMP2010 – Jeff Gordon up: creating artificial communities of many species & introducing to germ free animals
    • Gordon: using defined communities of microbes to test microbiota response to different diets#HMP2010 – track changes in abundance, txn, etc
    • Gordon : using RNAseq to track gene expression patterns in model microbial communities #HMP2010
    • Gordon: Goal .. use “personal” culture collections isolated from people to get a collection that has coevolved w/ each other #HMP2010
    • Gordon – much of OTU diversity in human gut microbes can be captured via culturing #HMP2010; though this ignores w/in species diversity
    •  Jeff Gordon has electrified the crowd w/ his discussion of model organisms & model communities#HMP2010 #modelorganisms
  • Liping Zhao discussed microbes associated with metabolic syndrome
    • Now Liping Zhao from Shanghai Center of Systems Biomedicine on re-eng. gut microbiota with TCM foods to prevent metab. syndrome #HMP2010
    • Zhao: multiple models could explain connection between microbes and obesity #HMP2010
    • Zhao: who is right? Grandma: you are what you eat; Geneticists: you are your genes #HMP201
    • Good to sea some old school TFLP Data here from Zhao mixed in with sequence data #HMP2010
    • Zhao: what is connection between diet, microbiome and longevity? Doing experiment in mice over four years #HMP2010
    • Zhao: diet / excercise differences -> lifespan differences + microbiota differences; though I note just correlations right now #HMP2010
    • Zhao emphasizes benefits of having reference genomes to sort through metagenomic data #HMP2010
    • Zhao wants to encourage marriage of metagenomics and metabolomics in gut studies #HMP2010

Then there was a brief break and then more talks.  Much of this has blurred together now so I think my tweets will be the best record of what happened from my point of view.

  • Joe Petrosino discussed metagenomic projects at Baylor
    • Now Joseph Petrosino from Baylor on improved metagenomic sequencing and analysis for healthy and diseased individuals #HMP2010
    •  Joe Petrosino from Baylor giving overview of role of Baylor in HMP #HMP2010
    • Petrosino – reminds people that not only is HMP producing data and science, but also spin off benefits akin to space program #HMP2010
    •  Petrosino: key ?s: what are biases, how do new sequencing platforms perform, what is gained/lost from assemblies #HMP2010
    • Petrosino has interesting results on sequencing mock community, I note we have study like this in PLoSonehttp://bit.ly/cPaIKF #HMP2010
    • Petrosino doing some interesting analysis of the risks/benefits of assembly in metagenomics #HMP2010
    • Note of caution: For statement “X manipulations of microbes help treat X ailments in X people X of the time”: X=”some” NOT “all” #HMP2010
    • Petrosino working on a viral metagenomic sequencing pipeline including isolation, concentration strategies#HMP2010
  • Phil Tarr discussed microbes associated with necrotizing enterocolitis
    • Next up – Philip Tarr on the infant microbiome and necrotizing einterocolitis #HMP2010
    • I have created a PaperLi newspaper on the human microbiome meeting tweets – seehttp://www.paper.li/tag/HMP2010 #HMP2010
    • Phil Tarr telling a very sad story of a premature infant dying of necrotizing enterocolitis – which is alas very common #HMP2010
    • Phil Tarr doing “massively parallel stool collection” for necrotizing enterocolitis study “all stools all the time” #HMP2010
    • Tarr: high throughput, barcoded, quick collection of all stools of premies in hospital, then if some get NEC will do sequencing #HMP2010
    • Tarr doing a great job of saying who did the analysis for which he presents results #HMP2010#willtrytodothisbetter
  • Vincent Young
    • Next up Vince Young on the role of gut microbiota in ulcerative colitis, IBD #HMP2010
    • Young: many studies suggesting possible connection between microbes and IBD; but emphasizes these are associations not causal proof #HMP2010
    • Young: complexities in IBD microbe studies incl. lack of time zero, diversity of host history/genetics, no info on microbe function #HMP2010
    • Young looking at “pouchitis” to develop in IBD patients who get inflammation of diverting intestinal pouch#HMP2010
    • Those interested in IBD/microbes might want to look at ileal transplant study from my lab/Georgetown http://bit.ly/XFRWf #HMP2010
  • J. Dennis Fortenberry
    • Dennis Fortenberry: urethral & coronal sulcus microbiome studies of adolescent males; offers caution for sexual nature of talk #HMP2010
    •  Fortenberry “for those of you who have not been thinking about male sexual anatomy this morning, here is a primer” #HMP2010
    • RT @TheGenomeCenter Fortenberry: multiple microbiomes of the penis – in areas where pass urine and areas used during sex#HMP2010
    • Fortenberry discussed scanner image of MF sex, how microbes could exchange, ended w/ “don’t how how they got them in scanner” #HMP2010
    • Fortenberry on difficulty studying microbe-sexual activity connection in teens “These are free-ranging adolescents” #HMP2010
    • Fortenberry: need to train teens “amazing the things teenage males will do with specimen cups that does no involve peeing in cup” #HMP2010
    • Fortenberry discussing ethical issues w/ genomic research on young people; especially on sexual behavior #HMP2010

Then lunch.  I sat again with some people from JCVI including Granger Sutton and Shibu Yooseph.  And I tried to get out of there quickly to get a bit of time to walk around, but it was hard to escape.

Then came the concurrent sessions which made me write this:

  • Oh no – #HMP2010 has concurrent sessions – where to go? Where to go?

Here are some of my notes from the concurrent session talks I went to:

  • Gail Rosen discussed metagenomic binning methods
    • Listening to Gail Rosen talk about naive bayesian classifiers for metagenomic data #HMP2010
    • @ianholmes yes i think kmer classification though got in late
      • @phylogenomics “naive Bayesian classifiers for metagenomics” = clustering using k-mer composition of reads?
    • Here is a link to Gail Rosens Naive Bayesian Classifier tool for metagenomics analysishttp://nbc.ece.drexel.edu/ #HMP2010
    • Here is a link to Gail Rosens Naive Bayesian Classifier tool for metagenomics analysishttp://nbc.ece.drexel.edu/ #HMP2010
  • Tom Schmidt discussed culturing microbes 
    • Tom Schmidt now up talking about culturing the human microbiome #HMP2010
    • Schmidt: Great Plate Count Anomaly misinterpreted to mean most microbes are unculturable – means most don’t grow in one condition #HMP2010
    • Schmidt – cultured organisms very valuable for working on functions in communities #HMP2010
    • Schmidt discussing nice trick for tracking oxygen in tissues – GFP only fluoresces win essence of oxygen#HMP2010
    • Schmidt: to study microbes in colon they developed methods to do colonoscopies on unprepped people so microbes not flushed out #HMP2010
    • Schmidt: in trying to culture from human microbiome have to switch methods and samples often to continue to recover new things #HMP2010
  • Anthony Fodor discussed CF infections
    • Anthony Fodor: how do infections in CF patients respond to antibiotics? Followed CF patients during exacerbation events #HMP2010
    • Fodor pronounces T-RFLP “Tee R Flip” while I and I think others usually say “Tee Riff Lip” – I think I like his more #HMP2010
    • Fodor – OTU diversity (i.e., number of bacterial species) is lower in severe CF infections #HMP2010 though he notes small sample size
  • Diane E. Hoffmann discussed federal regulation of probiotics
    • Listening to a Diane Hoffman discussing federal regulation of probiotics – there is lots of misleading stuff out there #HMP2010
    • Hoffman: for most probiotics we know very little about effectiveness, safety, mechanisms and other key parameters #HMP2010
    • Hoffman: whether/how to regulate probiotics depends on health claims, whether in food or not, etc.#HMP2010
    • Many probiotics sellers make structure/function claims like “x promotes intestinal health” – these are less regulated #HMP2010
    • Hoffman: FDA regulates health claims probiotics though FTC can also be involved if claims in media#HMP2010
    • Hoffman: some think probiotics overregulated (making research harder); others think underregulated (bogus stuff out there) #HMP2010

The worst part of the concurrent sessions was that the schedules somehow got switched in some of them and I missed talks by both Rob Knight and Jeroen Raes.  Completely bummed about this as both are great.  Rob Knight apparently discussed his push for an Open Microbiome Project which I think is brilliant.

After the concurrent sessions I had coffee with a post doc candidate from Wash. U and then wandered back over to the hotel lobby to see if I could find someone who wanted to go to dinner.  I saw Peer Bork and Jeroen Raes having a beer and sat down with them with a plan to go to dinner in a bit. Dusko Ehrlich eventually sat down with us too as did a few others.  Eventually, Peer and Dusko headed off I think to work on their talks for the next day and Jeroen and I wandered off for dinner.  We bumped into Gail Rosen and one of her students and we all went to a pub, had dinner, and eventually I headed back to the hotel to do my bedtime stories activity with my kids again.

Then I wrote up a post on notes from Day 2 of the meeting, which is always good to do while things are fresh in ones head.  Here is what I wrote

Well, Day 2 is over now for the Human Microbiome Meeting.  See my previous post for information about the meeting and about Day 1. I enjoyed Day 2, though some things are starting to wear me down a little bit.  In particular, the fact that the meeting starts at 8 AM, or 6 AM California time, has been really rough.  I missed the first 1.5 talks today because of that.  Not to organizers of meetings everywhere – don’t start meetings so early.  And don’t have too many talks in one day.  It is better, far better, to just have fewer talks, than it is to wear people out.  Also not to meeting organizers – do not change the order of talks in concurrent sessions at the last minute.  Very bad idea.  Anyway – onwards.

As with Day 1, I think the best way to get a feel for the meeting would be to look at Twitter posts with the hashtag #HMP2010.  Unlike what I did with my post for Day 1 I am going to put only my tweets below, not everyones.

Today was split up into one series of talks in the AM and then concurrent sessions in the PM.  What lessons did I learn today?  Well here are a few, with some overlap to those on Day 1 but that is OK by me.

  1. Again – correlations ≠ causation.  Those of you out there who do not get this should GTF out of science.
  2. For statement “X manipulations of microbes help treat X ailments in X people X of the time”: X=”some” NOT “all”
  3. Seems like we are really on the cusp of publications of 100s of clinical studies of microbes and their association with health status
  4. Some of these studies are even starting to get at causation
  5. It should be remembered that all of the methods used in microbiome studies are just methods; none are per se better than others; it is the science that should be judged not the tools themselves
  6. With some effort it seems one can culture many more organisms from a system than might be expected
  7. Cultures have many many uses
  8. But culture based studies do not really get at population genetic frequencies and relative abundance information very well
  9. Be wary of those who stick relentlessly with one idea or method
  10. Very strange how few pharma reps there were here (more on this in my next post)
  11. Please please please do not confuse data with knowledge.  Data can be very very useful and I completely support some projects that just are focused on generating data sets.  But knowledge comes from thinking about the data, and carefully analyzing it.
  12. Microbes, I think, run our lives much more than we would like to believe

Anyway – that is a brief update.  Back to preparing my talk for later this AM …

Finally, I got to work on my talk.  Wow that took a while.  I was up until about 3 am.  Finally I got to sleep a bit worried about missing my alarm ..

Day 4: My talk and heading home

Got up with plenty of time to spare, worked a bit on my talk, got my stuff together, grabbed a coffee, and went to set up my presentation. Peer Bork was already there getting his set up. The AV guy was not pleased that we all wanted to present off of our laptops. He said “We have a mac you can use”. I asked if they had keynote and they did not. Not very useful. Got myself set up and then paced around waiting for the lovely 8 am start. And then we were on.  Here are my tweets from just before my talk:

    • Attn Apple Keynote experts: anyone know how to hide the flashing volume bar when using “record slideshow” option?
    •  In relation to my upcoming talk at #HMP2010 I thought I would share this 2007 April Fools story about microbiomes http://ff.im/-q6t01
    • Next up at #HMP2010 Jonathan Eisen, aka, me: the Genomic Encyclopedia of Bacteria & Archaea & the dark matter of the genomic universe

I note, I tried to record my talk but that did not work for some reason.  I did post my slides on slideshare here.  And I think I can embed them in this post

Here are my notes on the rest of the talks in that AM session.

  • Peer Bork: Towards Microbial Markers for the Human Gut Microbiome
    • Now Peer Bork from EMBL on microbial markers for the human gut #microbiome #HMP2010
    • Peer Bork now discussing covariation of gene families within Metagenomic data sets #HMP2010
    • Bork: analyzing metahit data on metagenomics of the human gut #HMP2010
    • Bork: developed a metagenomics pipeline called SMASH which should be available soon #HMP2010
    • Bork: does some interesting comparisons of phylogenetic vs. Functional composition in Metagenomic samples; #HMP2010
    • Bork: tries to remove vagrants (e.g., organisms in food) from Metagenomic data sets by looking at habitat of reference genomes #HMP2010
    •  Bork: w/ small sample get three distinct clusters of co-ocurring gene families within human Metagenomic samples #HMP2010
    • Bork: then went to analyzing newer Metagenomic Illumina data w/ more samples – get three major clusters of coocurring genes again #HMP2010
    • Bork – also sees three distinct clusters of coocurring rRNA types in human samples #HMP2010 – he now calls these enterotypes
    • Bork – can find specific functional categories that correspond to the three distinct Metagenomic types#HMP2010
    • Bork – can find specific functional categories that correspond to the three distinct Metagenomic types#HMP2010
    • Bork: shows cool pathway analysis iterated along fine scale transecting in hypersaline samples, showing as pathways shift #HMP2010
    • Bork : we need better databases of pathway-gene mapping in diverse organisms and diverse phenotypes#HMP2010
  • Dusko Ehrlich discussed the metahit project 
    • Next: one of most entertaining people in field: S. Dusko Ehrlich discussing the MetaHit approach to relation between microbes & us #HMP2010
    • Ehrlich: discussing the BGI/MetaHit Nature metagenomics paper which I personally found fascinating#HMP2010
    • Ehrlich: in gut metagenomes, a high fraction of all genes are shared between people; same true for species#HMP2010
    • Ehrlich: much of the variation between people in metagenome is in relative abundance not in presence/absence of genes or species #HMP2010
    • Ehrlich: comparing metagenome of 177 Danish people; 67 lean, 110 obese, deep Illumina sequencing#HMP2010
    •  Ehrlich: Identifying some microbial genes associated with host body mass index #HMP2010
    • Ehrlich: the major differences between metagenome in obese vs. lean people are genes from Firmicutes#HMP2010
    • Ehrlich shown a cool graph that he calls the Metahit barcode – 4 species x genes in rows, samples in columns ordered by BMI #HMP2010
    • Here is link to the MetaHit human metagenome paperhttp://www.nature.com/nature/journal/v464/n7285/full/nature08821.html #HMP2010
    • Ehrlich also doing metagenome analysis of IBD, says there are correlations but cautions we do not know causal vs. consequence #HMP2010
    • Ehrlich now showing that his metahit barcode can potentially be used for strain level differences #HMP2010
    • Ehrlich: we are on brink of having diagnostic tests, possibly better treatments, possibly novel treatments for health states #HMP2010
    • Ehrlich ends by saying you should follow @metahit #HMP2010
    • Excellent suggestion from crowd after Ehrlichs talk: we should look at microbiome in non industrialized populations #HMP2010
  • Kanitsak Boonanantanasarn discussed oral cancers
    • Boonanantanasarn looking at microbes associated with oral cancers #HMP2010
    • Boonanantanasarn: see increase in archaea in samples from oral cancer vs. normal #HMP2010 #archaearule
  • Heidi Kong discussed atopic dermatitis
    • Now up Heidi Kong on the skin microbiome, atopic dermatitis and immunodeficiency #HMP2010
    • Kong: how to move from correlation to causation?; using longitudinal studies, selective alteration of microbes, mouse models #HMP2010
  • Yuzhen Ye discussed metagenomic informatics
    • Awesome: Yuzhen Ye just used a drawing by her four year old w/ cryptic writing to discuss difficulty in Metagenomic analysis #HMP2010
    • Ye now discussing her FragGeneScan Metagenomic gene finderhttp://omics.informatics.indiana.edu/FragGeneScan/ #HMP2010
    •  Ye: working on assembling genes from within metagenomics data (and thus not doing assemblies of everything) #HMP2010
    • Ye: has a metagenomic binner called abundancebin which not surprisingly using abundance information to bin #HMP2010
    • Uploaded my slides for my #HMP2010 talk tohttp://slidesha.re/cQQkws
    • Ye: describing SWIFT a fast protein similarity search method using reduced alphabet suffix arrays #HMP2010
  • Daniel Haft, who I used to work with a little bit when we were both at TIGR then discussed protein family analysis
  • Cindy M. Liu
    • Next and last: Cindy Liu talking about the sinus microbiota in health and disease #HMP2010
    • Liu discussing chronic rhinosinusitis – effects millions and costs lots (though kills very few) “We should help them” she says #HMP2010
    • Liu discussing need for ecological models of stable states in human microbiome studies #HMP2010
    • And #HMP2010 is done; overall I enjoyed it; will post thoughts on day 3 later

And then, just like that, the meeting was a wrap. I checked out, chatted a bit with a few people, and then off to the airport and home.  If you have gotten this far in this post, I am impressed …  I am going to hold off a few days before writing up a summary of what I learned from the meeting overall but I think the lessons learned from Day 1 and Day 2 are a good place to start.  Thanks for reading.

———————–
Also here are some other links of relevance to the meeting

Quick blog post: interesting piece on the evolution of ecology by Simon Levin

There is a very interesting piece in the Chronicle of Higher Education by Simon Levin on the “Evolution of Ecology.”

See The Evolution of Ecology – The Chronicle Review – The Chronicle of Higher Education

In it Simon, who I consider both a friend and colleague and who has been an inspiration to me for much of my work, discusses the history of the concept and the field of ecology. He repeats a key phrase he has used elsewhere:

Ecology, the unifying science in integrating knowledge of life on our planet, has become the essential science in learning how to preserve it.

I like this phrase and plan to use it a bit here and there, with attribution of course.
Levin also discusses how Darwin’s Voyage of the Beagle helped launch the field of ecology because it

defined a new and synthetic way of looking at nature—in which the patterns characteristic of particular regions found explanation in a unifying, dynamic framework

It was only after Voyage of the Beagle and Wallace’s work and others that the term “oekologie” came into being.
I particularly like the end where he connects ecology to study of other complex adaptive systems like economic ones and medical ones.
The article is really really really worth a read.

Lack of neutrality in bacteria and where pseudogenes go when they die

ResearchBlogging.org

Pseudogenes, which are in essence regions of the genome that used to be genes but no longer able to produce a functional unit, have long been considered to be models of the genetic equivalent of Switzerland’s neutrality. With this assumption of neutrality in hand, researchers have used studies of pseudogenes to better understand what happens to DNA when it is not visible to any form of natural selection. That is, pseudogenes have been thought to be neither harmful (as in, they are not under negative selection) or helpful (i.e., they are not under positive selection).

And from this assumption we have supposedly learned about mutation rates and patterns (because if they are neutral then the changes in pseudogenes should be reflective of mutational processes, not selection) as well as all sorts of other features of genome evolution.
Over the years, some have challenged the assumption of neutrality of pseudogenes (e.g., see here) like many have questioned whether Switzerland is really neutral. But overall, the feeling that pseudogenes were mostly neutral seems to have stuck. However, that may change a bit with a new paper from Chih-Horng Chu and Howard Ochman in PLoS Genetics (PLoS Genetics: The Extinction Dynamics of Bacterial Pseudogenes).
In their paper they report: (this is their authors summary)

Pseudogenes have traditionally been viewed as evolving in a strictly neutral manner. In bacteria, however, pseudogenes are deleted rapidly from genomes, suggesting that their presence is somehow deleterious. The distribution of pseudogenes among sequenced strains of Salmonella indicates that removal of many of these apparently functionless regions is attributable to their deleterious effects in cell fitness, suggesting that a sizeable fraction of pseudogenes are under selection.

Basically, what they did was the following
1. Compare Salmonella genomes. Identify putative pseudogenes and trace their evolution onto a phylogeny of the species.
Figure 1. Distribution of pseudogenes among Salmonellagenomes.
The phylogenetic tree was inferred from 2,898 single-copy genes shared by all fiveS. enterica subsp. enterica strains and the outgroup S. enterica subsp. arizonae.

doi:10.1371/journal.pgen.1001050.g001


2. Carry out a variety of analyses of the pseudogenes such as
  • looking at ratios of Ka/Ks (this is in essence a ratio of amino acid changes – aka non synonymous substitutions to “silent” synonymous changes which occur when the DNA sequence changes but the same amino acid is encoded).
  • examining the types and frequencies of gene inactivating mutations
3. Then they looked at the “ages” of pseudogenes – with age being estimated by the position in the tree in which the pseudogenes appear to have arise.
4. Finally the examined the age class distribution of pseudogenes as well as whether there were other differences between pseudogenes of different ages. And what they found was inconsistent with a neutral model. Instead, what they conclude is that something is making it advantageous to delete pseudogenes more rapidly than one might expect.
What explains this? After testing multiple possibilities the authors conclude that their is some negative selection against pseudogenes (or I guess positive selection for deletion of pseudogenes).
They conclude by suggesting this is likely to be pervasive across all bacteria and even in archaea. And furthermore make a connection to possible selection on intron size in eukaryotes. Anyway – the paper seems quite interesting and worth a read. Still pondering what it all means, so I would welcome comments.

Kuo, C., & Ochman, H. (2010). The Extinction Dynamics of Bacterial Pseudogenes PLoS Genetics, 6 (8) DOI: 10.1371/journal.pgen.1001050

Evolution diet for pets. Why not for people? Oh wait, of course that has been thought of before

Yup, that is right.  There are evolution based diets for pets.  Do you think people will use them if they don’t believe in evolution?  Do pets believe in evolution? Is an evolution based diet better than other diets? Actually, for the last question, it seems plausible that evolution informed diets could be of some use, but still seems funny to see it in pet food. 
Why isn’t there an evolution diet for people yet? Oh wait, there is
And there is all sorts of stuff out there about evolution and diet including
Just goes to show you, with google you can find that just about everything you can think of already exists. 

Summary of #iEVOBIO Day 2, #phylogenetics #informatics #opensource #biodiversity #evolution

This is a continuation of notes on iEVOBIO meeting.  Much of this comes from twitter.  Additional comments will be posted over the next few days.  See notes on Day 1 here. Note – thanks to the people who answered my query on twitter about how to remove spurious html code from pages – I wrote this post a few days ago but somehow the copying and pasting I did from twitter broke blogger with some weird html.  I ended up using Zubrag which was suggested by brendanwlocke.

Day 2. Beginning.

Since I was leaving that PM I could not borrow a hotel bike and bike over again.  So I walked from the hotel, along the river 2 or so miles, after getting coffee.  I got to the meeting a bit late and thus missed much of the opening Keynote, which was a bummer since it seemed very good.  But I find if I do not get some exercise every day at a meeting I go crazy so it was worth it.  Here are some notes on Day 2.

Day 2. Part 1. Keynote

Alas, I missed much of the keynote.  But what I caught was good.  Here are some notes, mostly from twitter. And also see Rob G’s slides below

Rob Guralnick: Biodiversity Discovery and Documentation in the Information and Attention Age

    Day 2. Part 2. Short talks.

    Day 2. Part 3. Lunch

    Went to lunch with a small group to the People’s Sandwich of Portland.  Took the light rail over the river.  Portland has some really nice features – like free light rail in the city.

    Day 2. Part 4. Lightning Talks

    Day 2. Part 5. Birds of a feather


    Then there were breakout sessions which were called “Birds of a feather”  I went to one discussing open access and fair use issues. 

    Day 2. Part 6. Wrap up and other general information

    And then I had to go home.  The meeting wrapped up as I was leaving. Here are some ending tweets:

    Some other general info tweets:

    Post meeting posts and blogs

    Summary of #iEVOBIO Day 1 #evolution #phylogenetics #informatics #opensource

    Well, just getting around to writing up some thoughts on the iEVOBIO meeting I went to earlier this week.  It was really quite excellent so here are some thoughts/notes.  Today I am writing about the background and Day 1.  Most of this is simply a catalog of what happened along with some twitter details … In a few days I will write up a post on what I think it meant ….

    The background: how I heard about iEVOBIO (skip to below if you just want to know about what happened in the meeting)  

    The first I heard about regarding the meeting was Dec 7, 2009, in a Direct Message on Twitter from @rdmpage.  That would be Rod Page, who I had never met, but followed remotely via twitter, his blog, his software and his papers.  He wrote

    Hi Jonathan, hope you got my email about speaking at iEvoBio in June. No pressure, just checking that it made it into your in box.

    I had known about Rod for a long time since I had used his software since I was in grad school.  For example, I used to use Treeview for all phylogenetic tree viewing/drawing etc.  It seems from the history, it has been available since 1996.  Not 100% when I started using it, but it was around then.  Then I switched over to using Treeview X a few years later.  And I have used on and off some of his other software.  More recently I have followed his blog/tweets/web sites closely.

    When Rod invited me, I was on a mini vacation in Monterrey and I had not actually seen his email yet (I am ALWAYS behind in reading email).  So I found the email, inviting me to give a Keynote at this cool sounding iEVOBIO meeting focusing on informatics for phylogenetics, evolution and biodiversity.  Sounded great actually.  Especially the part about Open Source:

    iEvoBio and its sponsors are dedicated to promoting the practice and philosophy of Open Source software developmentand reuse within the research community. For this reason, if a submitted talk concerns a specific software system for use by the research community, that software must be licensed with arecognized Open Source License, and be available for download, including source code, by a tar/zip file accessed through ftp/http or through a widely used version control system like cvs, Subversion, git, Bazaar, or Mercurial

    I also liked the notion of a challenge – in this case there was a challenge for new visualization methods for evolutionary data. In summary the challenge was:

    From phylogenetic trees to population networks, whether on printed pages or in GoogleEarth, visualizing evolution is a key part of our discipline. Inspired by the challenges and opportunities visualizing presents for our field, the first iEvoBio challenge is “To create a new visualization tool or platform to support evolutionary science”.

    Alas, since I was on vacation I did not have all my schedule information with me, so I said I was not sure. Fortunately, when I got back, it looked like good timing with the Evolution meeting just before so I said sure.

    Going to iEVOBIO (skip to below if you just want to know about what happened in the meeting)

    Anyway – jump to last week, skipping over some of the preparatory stuff for the meeting. I was planning on being in Oregon for almost a week, including the SSE meeting just before iEVOBIO and a meeting for my iSEEM project in Eugene before that.  But I just could not deal with being away for that long including over the weekend, after having not really taken any time off in a while.

    So I went home and skipped SSE2010 and then headed back to iEVOBIO on Monday the 28th. I flew on Southwest from Sacramento to Oregon, took the light rail into the city, and walked the last bit to my hotel.  I arranged to have dinner with Aaron Darling, a Research Scientists working in/with my lab who was at SSE.  We had a good dinner and then I went back to my room and stayed up until about 3:30 AM working on my keynote talk.

    I really wanted to include some new stuff and also include some background on microbes and microbial diversity and so worked very late making new slides, piecing together slides from multiple talks, and then trying to delete slides since my talk was way way way too long.  The final project I did not finish that night.

    I set my alarm on my phone and asked for a wake up call and got about 3 hours sleep. I got up around 6:30 worked on my slides for an hour, and then took a showed and heading downstairs where I borrowed a hotel bike (god, I love Portland – free bikes at my hotel) and biked along the river, over the bridge, and after a little hunting around found where I could park and lock the bike.  And I went in.

    I worked on my talk for another 30 minutes in an isolated corner and then went over to the main part of the conference center. Finally I was done.  I was amazed at how crowded it was.  Were all of those people there for iEVOBIO?  Alas, no – SSE2010 was still going (I did not realize it would still be on).

    After asking around I found the meeting room and met the one and only Rod Page (we had never met).  I made sure my laptop would connect with their system and then headed out to get coffee – there was a Starbucks in the hallway outside the meeting room.  Alas there was a giant line and my talk was in 25 minutes.  Fortunately, Aaron Darling was in the line and he agreed to purchase a latte for me.  I went back in, made sure everything was set, and paced around until I got my coffee from Aaron and then it was time for the meeting to start.

    The meeting itself: Day1 part 1: keynote by me 

    The meeting kicked off with a few details from some of the organizers including Rod Page and Todd Vision.  We found out who the other organizers were (Rod Page (University of Glasgow), Cecile Ane (University of Wisconsin at Madison), Rob Guralnick (University of Colorado at Boulder), Hilmar Lapp (NESCent), and Cynthia Parr (Encyclopedia of Life).  We also found out who helped fund the meetings (US National Evolutionary Synthesis Center (NESCent), and the Society of Systematic Biologists (SSB). I am no longer sure exactly what else they said.  But there seems to have been at least one tweet about the intro:

    • toranaga Todd Vision mentions how computational biology is a guild, full of people that take great pride in their craft.

    And then Rod introduced me.  Pretty funny actually.  He gave me grief for writing about bad omics words and yet inventing and then using phylogenomics for everything.  And then my talk was on.  Here it is on slideshare.

    Also – there were a few tweets about my talk including the following:

    I got asked some great questions afterwards including one by Joe Felsenstein, one by Arlin Stoltzfus, and one by James McInerney (well, McInerney mostly gave me grief about how he disagreed with me about the extent of lateral gene transfer).

    Day1 part 2:  Short talks 

    After my talk, thankfully for all involved, there was a coffee break.  And then we were back with short, ~15 min talks. These are listed below with some information, most of it from Twitter.

    Day 1: part 3: Lunch – here are some tweets that came out around that time …

    Day 1 part 4: Challenge talks

    And then the post lunch challenge talks began.  These related to visualization tools entered into the meeting challenge mentioned above and described here.

    Day 1. Part 5. Lightning talks.   

    • dgaston83 Now time for the lightning talks. Gives me a good idea what to expect for mine tomorrow. Glad I’m on day 2!#ievobio
    • toranaga #ievobio lightening talks! 5 minutes and then the gong goes off

    Day 1. Part6. Software bazaar and demos

    Then there were was the software bazaar and challenge demonstrations, which alas, I skipped most of because of the lack of sleep the night before.  It seemed quite packed in there and I was just exhausted. So I went back to my hotel, riding the bike I had borrowed from the hotel back, slowly, along the river.

    Here is what I missed:

    • Software bazaar
      • W. Berendsohn: The EDIT Platform for Cybertaxonomy
      • R.J. Challis: Pipefinder – semantic pipelines made easy
      • B. Gemeinholzer: DNA Bank Network Ð a virtual linkage of natural history collections’ voucher specimens and documentation with physical DNA, sequences, and publications
      • M.J. FavŽ: eFECTIV: Shape analysis using elliptical harmonics
      • T.M. Keesey: Names on Nodes: Automating the Application of Taxonomic Names within a Phylogenetic Context
      • S. Lewis: Functional Gene Ontology Annotation across Species using PAINT
      • S. McKay: GBrowse_syn
      • M. Porter: iBarcode-nextgen: tools for next generation biodiversity analysis
      • D. Rosauer: Biodiverse, a tool for spatial analysis of biological diversity
      • R. Scherle: The Dryad Digital Repository
      • C.L. Strope: indel-Seq-Gen version 2.0
      • M. Youngblood: mt-tRNA-Draw
    • Challenge demonstrations
      • M. Porter: GenGIS
      • K. Urie: VoLE (Viewer of Life in EOL)
      • V. Gopalan: Nexplorer3
      • A. Hill: PhyloBox
      • S. Smits: jsPhyloSVG

    Summary of Day 1. 

    Here are some tweets summarizing Day 1:

    • oatp Open science and data sharing at Evolution 2010 andiEvoBio: Posted by petersuber to oa.notes oa.biology oa.new oa… http://bit.ly/bOocHh
    • chrisfreeland @rdmpage @cydparr sounds like #ievobio is going well & interesting. v. sorry I couldn’t work it in!
    • trinaeroberts Demos of Biodiverse, GenGIS, various tree visualizers… very cool stuff at #iEvoBio today!
    • justsayinnn someone explain to me how the ievobio wants open source yet they charged for registration???
    • justsayinnn data visualization stuff was way cool today at #ievobio

    At the end of the day, I had dinner with Steven Kembel and Tom Sharpton (@toronaga) who I work with on a Gordon and Betty Moore Foundation funded project we call iSEEM.  Dinner and conversation were great.  I then went for a walk along the river and went back to my room.

    Twisted tree of life award #5: Nicholas Wade & use of higher, lower, ladders, etc

    Nicholas Wade has a new article in the New York Times critiquing some aspects of the human genome project (A Decade Later, Gene Map Yields Few New Cures – NYTimes.com)

    Whether one agrees with his critiques or not, I hope that everyone can recognizes that one section on evolution is, well, awful. Wade writes

    First was the discovery that the number of human genes is astonishingly small compared with those of lower animals like the laboratory roundworm and fruit fly. The barely visible roundworm needs 20,000 genes that make proteins, the working parts of cells, whereas humans, apparently so much higher on the evolutionary scale, seem to have only 21,000 protein-coding genes.

    While Mr. Wade may want to believe he and humans in general are somehow “higher” on some evolutionary ladder than other species, I have some news for him

    THERE IS NO FU*$ING EVOLUTIONARY LADDER.

    Humans are neither higher nor lower than any other organisms. This is an antiquated and inane view of evolution. Sure, humans are smart. Sure we are more complex in some aspects than, say, some bacteria. But new features evolve on ALL branches in the tree of life. And some organisms lose features present in their ancestors. The evolution of complexity is, well complex, sure, but please, “higher” and “lower” organisms? An evolutionary ladder? Uggh.

    I do not pay much attention to human GWAS studies, but if Wade’s understanding of them is akin to his understanding of evolution, well, I would then infer that GWAS studies have revolutionized all of medicine. For his butchering of evolution, I am giving Nicholas Wade my 6th coveted “Twisted tree of life award

    —————————-

    More on this topic can be found at:
    Larry Moran’s Sandwalk
    Larry Moran has a good discussion of the genes in the human genome issue (from 2007)
    PZ Myers at Pharyngula Chimes in

      Holy lateral transfer batman; amazing story on fungal to aphid transfer from Nancy Moran

      As many know, I generally do not write a lot about papers in non open access journal because I like readers to be able to access all the papers which I write about. But this is one of the exceptions to my normal rule. An amazing paper was published a few days ago in Science by Nancy Moran and Tyler Jarvik. Lateral Transfer of Genes from Fungi Underlies Carotenoid Production in Aphids — Moran and Jarvik 328 (5978): 624 — Science
      I first found out about this from Ed Yong’s blog post here (just a note – his Not Exactly Rocket Science is such a frigging incredible blog). He really does the whole story on this so I am just posting a bit here.
      Anyway Moran and Jarkiv paper focuses on genes in the aphid genome that encode enzymes for carotenoid synthesis. These enzymes are involved in red and/or green coloring seen in the pea aphids. Recently the pea aphid genome was sequenced (a paper about this was published in PLoS Biology ) and it was analysis of the genome data that helped lead Moran and Jarvik to the study reported in the recent issue of Science.
      In their study they report a detailed evolutionary and phylogenetic analysis of the carotenoid synthesis genes found in the aphid genome and show quite convincingly that these genes do not appear to be of “normal” descent. That is, they seem to have an ancestry separate from many of the “normal” animal genes in the genome. Instead, these genes are related to genes from fungi. In fact, these genes are embedded in an evolutionary sense, in a group of genes which are all from fungi and thus Moran and Jarvik conclude the most likely explanation is that some time in relatively recent pea aphid evolutionary history, these genes were acquired from some fungus.
      About to have some eye drops put in my eyes so gotta go for now, but just wanted to get something out there about this fascinating work. For more on this story – there is lots out there, such as the following:

      Moran, N., & Jarvik, T. (2010). Lateral Transfer of Genes from Fungi Underlies Carotenoid Production in Aphids Science, 328 (5978), 624-627 DOI: 10.1126/science.1187113

      . (2010). Genome Sequence of the Pea Aphid Acyrthosiphon pisum PLoS Biology, 8 (2) DOI: 10.1371/journal.pbio.1000313

      ResearchBlogging.org

      Most important paper ever in microbiology? Woese & Fox, 1977, discovery of archaea

      Well, today in my “Microbial phylogenomics” class at UC Davis we are discussing what I think might be the most important paper (well, actually, series of papers) in the history of microbiology. The papers are the ones where Carl Woese, George Fox and colleagues outline the evidence for the existence of a “hidden” third major branch in the tree of life – what is now known as the archaea. The evidence for this third branch was first laid out in a series of papers in 1977 including:

      Now Woese, Fox and others in Woese’s group had been leading up to these publications in ways for years (I note, there were some pretty incredible people involved in these studies in the years before 1977 too including Mitch Sogin, now at MBL, David Stahl, Chuck Kurland, Norm Pace, etc but that is another story). They had been determining the nucletide sequences of small fragments of rRNAs from different species, especially from different organisms that did not have nuclei – the so-called “prokaryotes”. And they were using these sequences to infer the phylogenetic relationships among these microbes.
      Consider for example, the paper by SJ Sogin et al in 1972 “Phylogenetic measurement in procaryotes by primary structural characterization. Sogin SJ, Sogin ML, Woese CR. J Mol Evol. 1971;1(1):173-84. This paper laid out some of the arguments for why rRNA sequence information might re-write our concepts of classification of prokaryotes. From this and many of the other papers from Woese and Fox and others before 1977 it had been shown that one could use rRNA sequence information to more accurately infer relationships among “prokaryotes” than had been done previously with other types of information. Today this notion that we can use sequence information to infer the evolutionary history of microbes is taken for granted but back in the early 1970s it was not. And in addition, many people probably just did not care too much about the exact details of microbial phylogeny and classification.
      But this changed in the 1977 with that series of papers outlined above. What these papers showed was that hidden beneath everyone’s noses was a separate, previously unknown, major split in the prokaryotes into two distinct lineages. One of these included all the standard bacteria people were familiar with like E. coli and B. subtilis and one of them included some pretty weird wacked out bugs that thrived in extreme conditions. For example, look at the phylogenetic tree from Fox et al.
      This tree (made using a distance based clustering algorithm where the distances represent a measure of the similarity of the catalog of short ologonucleotides found in each species) shows the normal bacteria on one side (down below) and methanogens and their relatives on another side. I like the last line of the abstract, which to an evolutionary microbiologist can be considered equivalent to Watson and Crick’s “It has not escaped our notice …”. Here Fox et al. say “These organisms appear to be distantly related to typical bacteria”
      The Bach et al. paper has similarly interesting, cool nuggets. However, alas, it is not available in PubMed Central as are the other two papers here I am not focusing on it. What is great though is that the other two papers are freely available to anyone to read in Pubmed Central and also at the PNAS web site. Yay for access. Too bad the other paper is not freely available.
      Anyway, fortunately, the most critical of these papers is the Woese and Fox paper from PNAS which is freely available And it is in this paper that they full argument is laid out. Consider the abstract:

      ABSTRACT A phylogenetic analysis based upon ribosomal RNA sequence characterization reveals that living sys.tems represent one of three aboriginal lines of descent: (i) the eubacteria, comprising all typical bacteria; (ii) the archaebacteria, containing methanogenic bacteria; and (iii) the urkaryotes, now represented in the cytoplasmic component of eukaryotic cells.

      In this paper they lay out the evidence for the existence of at least three main branches in the Tree of Life. Interestingly, for the phylogenetically minded people out there, they do not show an evolutionary tree in the paper. What they show is what is known as a similarity matrix (the inverse in essence of the distance matrices many people may be used to seeing) where a score is given for the similarity between organisms in the fingerprints of their 16S/18S rRNAs).
      If one scans through the matrix one can clearly see three clusters of similarity scores
      From this table, Woese and Fox infer the existence of three primary branches in the tree of life. This is laid out in a few paragraphs starting with one at the bottom of page 5088.

      A comparative analysis of these data, summarized in Table 1, shows that the organisms clearly cluster into several primary kingdoms. The first of these contains all of the typical bacteria so far characterized …. (lots of names here) … It is appropriate to call this urkingdom the eubacteria.

      And then a second paragraph discusses the second group
      A second group is defined by the 18S rRNAs of the eukaryotic cytoplasm-animal, plant, fungal, and slime mold (unpublished data). … (They call this lineage the urkaryotes).
      And then the third paragraph lays out the revolution:

      Eubacteria and urkaryotes correspond approximately to the conventional categories “prokaryote” and “eukaryote” when they are used in a phylogenetic sense. However, they do not constitute a dichotomy; they do not collectively exhaust the class of living systems. There exists a third kingdom which, to date, is represented solely by the methanogenic bacteria, a relatively unknown class of anaerobes that possess a unique metabolism based on the reduction of carbon dioxide to methane (19-21). These “bacteria” appear to be no more related to typical bacteria than they are to eukaryotic cytoplasms. Although the two divisions of this kingdom appear as remote from one another as blue-green algae are from other eubacteria, they nevertheless correspond to the same biochemical phenotype. The apparent antiquity of the methanogenic phenotype plus the fact that it seems well suited to the type of environment presumed to exist on earth 3-4 billion years ago lead us tentatively to name this urkingdom the archaebacteria. Whether or not other biochemically distinct phenotypes exist in this kingdom is clearly an important question upon which may turn our concept of the nature and ancestry of the first prokaryotes.

      Mind you, the whole paper is worth reading, but those three paragraphs lay out a revolution in how one thinks about the tree of life. Now admittedly, some of our notions of the tree of life have changed since 1977 and there is much more of a feeling of mixing and merging between branches than was appreciated back then. And some definitely feel that the archaebacteria (or archaea as they are known today) are not per se a third branch in the tree of life but rather than there are four or five major branches and that archaea may not in fact be a “monophyletic grouping”. But whether you think archaea truly represent a third branch in the tree of life or not, this paper fundamentally altered how we think about the tree and about microbes. The work was even written up in the New York Times and got a lot of press (not that that is proof of anything – but it got microbial phylogeny into the public’s mind).
      I think it is worth having all biology students read and understand this paper. Which is why I now try to cover it in basically all classes whenever I can. I could go on and on, but I will simply end with their last paragraph:

      With the identification and characterization of the urkingdoms we are for the first time beginning to see the overall phylogenetic structure of the living world. It is not structured in a bipartite way along the lines of the organizationally dissimilar prokaryote and eukaryote. Rather, it is (at least) tripartite, comprising (i) the typical bacteria, (ii) the line of descent manifested in eukaryotic cytoplasms, and (iii) a little explored grouping, represented so far only by methanogenic bacteria.

      Citations
      Woese CR, & Fox GE (1977). Phylogenetic structure of the prokaryotic domain: the primary kingdoms. Proceedings of the National Academy of Sciences of the United States of America, 74 (11), 5088-90 PMID: 270744

      Fox GE, Magrum LJ, Balch WE, Wolfe RS, & Woese CR (1977). Classification of methanogenic bacteria by 16S ribosomal RNA characterization. Proceedings of the National Academy of Sciences of the United States of America, 74 (10), 4537-4541 PMID: 16592452

      Balch WE, Magrum LJ, Fox GE, Wolfe RS, & Woese CR (1977). An ancient divergence among the bacteria. Journal of molecular evolution, 9 (4), 305-11 PMID: 408502


      Some related posts