Phylojeanomics and The Jeans of Life

Well after the PLoS Nature dust up, I thought we needed a little humor in our lives.  So here is an old Levis add with a distinct evolution theme.  Maybe Jindal should propose a Levis boycott?

Forget Lincoln-Douglas – How about a Lincoln-Darwin debate?

In case you did not see it – it is worth seeing the discussion of Lincoln vs. Darwin in Newsweek (How Darwin and Lincoln Shaped Us).  They set up the discussion by pointing out that they had the same birthday.  They say Lincoln was more important.  Not going to argue but not sure they are right.  My favorite section on this article:

This questioning spirit is one of the most appealing facets of Darwin’s character, particularly where it finds its way into his published work. Reading “The Origin of Species,” you feel as though he is addressing you as an equal. He is never autocratic, never bullying. Instead, he is always willing to admit what he does not know or understand, and when he poses a question, he is never rhetorical. He seems genuinely to want to know the answer. He’s also a good salesman. He knows that what he has to say will not only be troubling for a general reader to take but difficult to understand—so he works very hard not to lose his customer. The book opens not with theory but in the humblest place imaginable: the barnyard, as Darwin introduces us to the idea of species variation in a way we, or certainly his 19th-century audience, will easily grasp—the breeding of domestic animals. The quality of Darwin’s mind is in evidence everywhere in this book, but so is his character—generous, open-minded and always respectful of those who he knew would disagree with him, as you might expect of a man who was, after all, married to a creationist.

Evolution education, Jindal and the election

There is an interesting piece on the “Science Education Act” in Louisiana in the New Scientist (see New legal threat to school science in the US ). by Amanda Gefter. This act seems to be designed to “lip ID in “through the back door” and is promoting itself as a bill for “Academic Freedom” Personally, I am all for academic freedom, including the ability to study and discuss all sorts of controversial things. However, it is clear this is not what the bill is really about. It is about teaching religion as science. The New Scientist reports

“Supporters of the new law clearly hope that teachers and administrators who wish to raise alternatives to evolution in science classes will feel protected if they do so. The law expressly permits the use of “supplemental” classroom materials in addition to state-approved textbooks. The LFF is now promoting the use of online “add-ons” that put a creationist spin on the contents of various science texts in use across the state, and the Discovery Institute has recently produced Explore Evolution, a glossy text that offers the standard ID critiques of evolution (see “The evolution of creationist literature”). Unlike its predecessor Of Pandas and People, which fared badly during the Dover trial, it does not use the term “intelligent design”.”

All I can say is that if McCain picks Bobby Jindal (the governor of LA and supporter of this bill) as his running mate it will be the ultimate proof that McCain is no longer the independent thinker he used to be and is instead a complete tool of others.

It is worth reading this article if you care about science education.

Only Nature could turn the success of PLoS One into a model of failure

Now, mind, you I like Nature as a publishing unit. They publish some very fine journals. Now, most of them are not Open Access, so I choose not to publish there if I can avoid it. But I still like them. And many of the editors and reporters there are excellent – smart, creative, insightful and such. But Nature the publisher can also be completely inane when it comes to writing about Open Access and PLoS. In a new article by Declan Butler, Nature takes another crack at the PLoS “publishing model”

The problem with PLoS now is … wait for this … the success of PLoS One. PLoS One it turns out is publishing a lot of papers (including one by me, today). And bringing in a decent amount of money to PLoS apparently (note for full disclosure – I am involved in PLoS Biology as “Academic Editor in Chief” and PLoS Computational Biology as an Academic Editor … although I should note I am not involved in financial discussions at PLoS in any way).
So why is the success of PLoS One a problem? Well, because it allows Nature to do the old good cop bad cop routine and to write, again, about the “failings” of the PLoS publication model. Now, mind you, the article does not quote a single source for what the PLoS publication model is. But they do say it has failed. From what I can tell here is the logic of the failure argument:
  1. Nature believes PLoS’ model for success revolved solely around PLoS Biology and PLoS Medicine and some of the other other PLoS journals being self sustaining after a few years.
  2. Analysis of some financial information suggests that PLoS Biology and Medicine currently are not breaking even
  3. PLoS One is apparently wildly successful and thus is brining in some money to PLoS.
  4. PLoS One publishes a lot of papers (they discuss this a bit and imply that this is a bad thing because some of the papers must be bad. Note – they do not back this up with any evidence. Silly for me to ask a science journal to use evidence)
  5. Therefore, the entire PLoS Publication model is a failure.
The problems with this logic are, well, large. Here are some:
  1. Does Nature really think that there ever was a single “model” for how PLoS should be evaluated?
  2. If so, where is the documentation of what this model actually was?
  3. Even if there was a PLoS model and even if it turns out to be not exactly what PLoS is doing now, what is the big deal? If you were a stockholder of any company and they told you “we are never going to change our business model no matter what happens in the world around us” I would recommend you not buy their stock. It is simply farcical to expect any entity to stick to a single simple model forever.
  4. Does not Nature supplement some of their bigger journals with their higher volume other journals?
  5. Most companies these days use high profile entities such as PLoS Biology and PLoS Medicine to attract attention to other portions of their company in order to help bring in money. Is this somehow not allowed by PLoS? Doesn’t Nature do the same thing?
  6. If you look at the figure Nature shows of PLoS $$$, it shows income rising in 2007 and expenses going down. How did that get turned into a bad thing?
So – I still do like Nature publishing because much of the time it has high quality stuff. It even has high quality stuff commenting/criticizing the Open Access movement and pointing out some of the challenges with it. But this article by Butler is not an impressive piece of work. I really wanted to give him an award but could not think of what to give.
See also (thanks to Bora for pointing out a bunch of these links)

My first PLoS One paper …. yay: automated phylogenetic tree based rRNA analysis

ResearchBlogging.org
Well, I have truly entered the modern world. My first PLoS One paper has just come out. It is entitled “An Automated Phylogenetic Tree-Based Small Subunit rRNA Taxonomy and Alignment Pipeline (STAP)” and well, it describes automated software for analyzing rRNA sequences that are generated as part of microbial diversity studies. The main goal behind this was to keep up with the massive amounts of rRNA sequences we and others could generate in the lab and to develop a tool that would remove the need for “manual” work in analyzing rRNAs.

The work was done primarily by Dongying Wu, a Project Scientist in my lab with assistance from a Amber Hartman, who is a PhD student in my lab. Naomi Ward, who was on the faculty at TIGR and is now at Wyoming, and I helped guide the development and testing of the software.

We first developed this pipeline/software in conjunction with analyzing the rRNA sequences that were part of the Sargasso Sea metagenome and results from the word was in the Venter et al. Sargasso paper. We then used the pipeline and continued to refine it as part of a variety of studies including a paper by Kevin Penn et al on coral associated microbes. Kevin was working as a technician for me and Naomi and is now a PhD student at Scripps Institute of Oceanography. We also had some input from various scientists we were working with on rRNA analyses, especially Jen Hughes Martiny

We made a series of further refinements and worked with people like Saul Kravitz from the Venter Institute and the CAMERA metagenomics database to make sure that the software could be run outside of my lab. And then we finally got around to writing up a paper …. and now it is out.

You can download the software here. The basics of the software are summarized below: (see flow chart too).

  • Stage 1: Domain Analysis
    • Take a rRNA sequence
    • blast it against a database of representative rRNAs from all lines of life
    • use the blast results to help choose sequences to use to make a multiple sequence alignment
    • infer a phylogenetic tree from the alignment
    • assign the sequence to a domain of life (bacteria, archaea, eukaryotes)

  • Stage 2: First pass alignment and tree within domain
    • take the same rRNA sequence
    • blast against a database of rRNAs from within the domain of interest
    • use the blast results to help choose sequences for a multiple alignment
    • infer a phylogenetic tree from the alignment
    • assign the sequence to a taxonomic group

  • Stage 3: Second pass alignment and tree within domain
    • extract sequences from members of the putative taxonomic group (as well as some others to balance the diversity)
    • make a multiple sequence alignment
    • infer a phylogenetic tree

From the above path, we end up with an alignment, which is useful for things such as counting number of species in a sample as well as a tree which is useful for determining what types of organisms are in the sample.

I note – the key is that it is completely automated and can be run on a single machine or a cluster and produces comparable results to manual methods. In the long run we plan to connect this to other software and other labs develop to build a metagenomics and microbial diversity workflow that will help in the processing of massive amounts of sequence data for microbial diversity studies.

I should note this work was supported primarily by a National Science Foundation grant to me and Naomi Ward as part of their “Assembling the Tree of Life” Program (Grant No. 0228651). Some final work on the project was funded by the Gordon and Betty Moore Foundation through grant #1660 to Jonathan Eisen and the CAMERA grant to UCSD.

Wu, D., Hartman, A., Ward, N., & Eisen, J. (2008). An Automated Phylogenetic Tree-Based Small Subunit rRNA Taxonomy and Alignment Pipeline (STAP) PLoS ONE, 3 (7) DOI: 10.1371/journal.pone.0002566

Looming news

Just having some fun with/at Carl Zimmer who claims he has some news to post very soon. See

Top five metaphors Darwin considered and rejected for "The Tree of Life"


Well, as I am desperately trying to prevent Hollywood from corrupting the term “Tree of Life” and I got to thinking. Why did Darwin and others have to use the metaphor of a tree to represent the branching history of organisms through evolutionary time? Why did other metaphors not get used? Well, thanks to a little research I did by communicating with Darwin directly (as I did when he announced his endorsement of Obama), I have found out that Darwin went through many other metaphors before settling on the “Tree of Life.” (note the tree-like figure here which is based on the one figure in the Origin of Species).

And here are the top 5 other such metaphors for what is now known as “The Tree of Life”

  1. The coral of life. Darwin particularly liked this one as he did some work on marine organisms. But ultimately he rejected it because he was worried about anti-evolutionists killing coral to get back at Darwin.
  2. The watershed of life. Rivers exhibit branching patterns much like trees. The big problem Darwin saw was that sometimes separate river branches reconnect to each other, which did not follow his model for descent. If only Darwin knew about lateral gene transfer.
  3. The blood vessels of life. Darwin was desperate to find visceral connections for people to evolution. This one would have been great. The big problem here was the “going” and “coming” nature of arteries and veins.
  4. The shrub of life. Thus turns out to have been one of Darwin’s favorites because it captures the richness of diversity more than a sparsely branching tree. However, shrub, even then, was used as a derogatory term to refer to height challenged individuals. And Darwin did not want to upset this key constituency so he avoided this term.
  5. The lungs of life. While this has some positive features (e.g., it is unidirectional like a tree), thankfully Darwin did not stick to this or we would have competition from “LOL” for all the domain names.

    Other possibilities included “the Plumbing System of Life” and “the Highway Map of Life”. If you know of others Darwin may have considered, let me know.

Hello world!

Welcome to WordPress.com. This is your first post. Edit or delete it and start blogging!

Calling all evolutionary biologists – protect the "Tree of Life" from extinction by Hollywood

OK, I am getting worried. As I wrote a few weeks ago, there is a new movie coming out supposedly next year called “The Tree of Life” starring Brad Pitt, Sean Penn and others. Well, the movie sounds like it could be good. But though this may sound like a cool thing to those of you out there who study the Tree of Life in some way, we are at risk here. There will soon be 100s of blogs, web pages, news stories, etc. writing about the Tree of Life movie. And uses of the Tree of Life by evolutionary biologists will lose their google rankings. The term “The Tree of Life” is at risk of a form of extinction.

So what are we to do? I am calling on all evolutionary biologists and bloggers and biologists in general to to link to other sites that use the term as much as possible. For example, you can add a link to my blog, which is after all called “The Tree of Life.”
So please, add links to some Tree of Life pages. And start using the term yourself as much as possible.
Here are some places you should link to
Please post additional links in the comments here and please help save “The tree of life”
———————————-
Note – here are some sites responding and helping save “The tree of life”

Jelly Bellies in the New York Times

Well, Jelly Bellies, a semi-local institution (they are based in Fairfield for those who have not seem them from I-80) has gotten some good press in the Times today. I hear they have a nice tour that kids and adults like but we have not done it yet.  I must say, I like their Sport Beans when I go for bike rides.  Anyway, just good to see local businesses getting some props in the Times.