Cellular Consequences of Genetic variation

Friday, September 15, 2006

Comparative Interactomics on the rise (II)

Two papers on interaction network alignment caught my attention recently. Both of them are supposably more efficient ways of doing network comparison than the previously published PathBlast from the Ideker lab.

The first one called Græmlin was published in Genome Research and the second one developed by Mehmet Koyuturk is called Mule(PDF) and is in press in the Journal of Computational Biology.

More on the buzz department, the MIT technology reviews elected comparative interactomics one of 10 “emerging technologies” of the year. Highlighting Ideker and Palsson as two of the people pushing the idea forward.

That's it, I figure the meme gained enough traction to "spread" in the wild and we can expect several variations on the algorithms to surely continue to flow along with even more interaction networks.
Psi-Network-Blast anyone ?

Open access publishing (physicists viewpoint)

There is a very interesting discussion going on about open access publishing in a physics blog. The author also links to a recent editorial in Nature Physics.

A lot of the discussions are on how to certify content after submission to a pre-print server and how expensive should the whole process really be. Going through it the thing that most impressed me is that everyone seems to accept naturally the usefulness of a preprint server. Ken Muldrew in one of the comments says:

"something like the arXiv is sort of a bridge between conference talks and publications; a new phenomenon that doesn’t replace the old ways but rather adds to them, like email as it relates to phone conversations and page-written letters."

So why are there not more bioinformatics manuscripts in pre-print servers ? Most bioinformatic/computational biology journals accept submissions from papers already in pre-print servers. I subscribe to the arXiv feed on quantitative biology but most papers seem to a bit away from biology when compared to work published in say Bioinformatics, PLoS Comp Bio and BMC Bionformatics. This recent manuscript about horizontal gene transfer shows that arXiv does accept the type of work that I might participate in so I will try to in the future submit there first.

Thursday, September 14, 2006

Marc Vidal interview in a podcast

(via My Biotech Life) Rick pointed me to a podcast series called Futures in Biotech. I am listening to a two part interview with Marc (Interactomes) Vidal, explaining his interest in parts and networks on a very accessible level. There are some interesting historical little stories for those more familiar with the field.

Monday, September 11, 2006

Postgenomic Upgrade

I don't remember reading any announcement but I guess Stew upgraded Postgenomic. It's full of extra information that will give anyone a chance to procrastinate for some time. Did you ever wonder what is the Gunning-Fog Index of your blog ? Your incoming bloglove ? (whatever that means) Your most popular blog post ? Go have a look at PG.

He also added a section on original research that I guess is still experimental. My two posts on the correlation between protein age and protein interactions are there but I guess he had to mark them by hand.

On a related note, Stew released a greasemonkey script to add links to postgenomic from connotea. I would love to see journals starting to link directly to postgenomic without us having to use greasemonkey scripts. This would give bloggers an extra little incentive to write some comments on papers and for the journals it's free content and extra attention on their sites.

Now .. back to my thesis ...

Friday, September 08, 2006

Building reality online (the lonelygirl15 meme)

I am not a big fan of video sites so today was the first day I heard about lonelygirl15. She is a very famous video blogger with videos seen a total of about 2 million times. The interesting thing is that she does not actually exist. She is part of an art project intent on doing something like a wiki soap:

Thank you so much for enjoying our show so far. We are amazed by the overwhelmingly positive response to our videos; it has exceeded our wildest expectations. With your help we believe we are witnessing the birth of a new art form. Our intention from the outset has been to tell a story-- A story that could only be told using the medium of video blogs and the distribution power of the internet. A story that is interactive and constantly evolving with the audience.

Even if people lose interest now that they know she is not real, the fact was that they were drawn into her fabricated life. Is this ethically correct ?
I guess what I am getting at is that we are getting better at creating virtual reality. The truth is what the majority of people perceive to be true, the wiki kind of truth. This is already the case in physical reality with mass media having a strong effect on what is actually perceived. What I think is that this manipulation of masses and crowd effects are much stronger online.

Tuesday, September 05, 2006

Social network dynamics in a conference setting

(disclaimer: This was not peer reviewed and is not serious at all :)

To study the dynamics in social network topology we decided evaluate how some nodes (also called humans) interact in defined experimental conditions. We used the scientific meeting setting that we think can serve as a model for this type of studies. We observed human-human interactions during the meeting breaks by taking snapshots and calculating inter-human distances. We defined an arbitrary cut-off to determine the binary interactions between all the humans present in the study.

The first analysis we preformed was under the so call "conference breaks" model where our nodes are allowed to interact for brief time intervals after being subjected lengthy lectures.
We observed an interesting clustered network topology that can be described with a power law distribution. Most nodes in the network have few interactions while a small fractions of humans was found to consistently interact with a large number of other nodes. We found also some nodes that did not show any interactions in our studies even when several "conference breaks" were preformed. We believe that these could be pseudo-humans that were included in our study by mistake. These pseudo-humans might be on the way to extinction from the humeone.

Having built this network of human-human interaction on a large scale we decided to investigate what human properties might be correlated with human hubs. We used previous large-scale studies of human properties like height, gender and number of papers published to test this.
We show here that although gender shows a significant correlation with human hubness, the best predictor for hubs in the conference breaks networks is actually number of papers published. We tried to refine this further by introducing a new human measurement we call "hypeness". Hypeness of a human was calculated as a modification of the number of papers published weighted by the impact factor of the journals where the papers were published and also the number of times cited in popular media articles. We show here that hypeness does significant better at predicting hub nodes in this network.

Given that networks are dynamic we set out to map the changes in network structure with time. To simulate this we perturbed the gathering using a small-compound (EtOH) that we administered in liquid form. With time we observed a noticeable change in the network. Although the overall topological properties were maintained, the nature of the hubs changed dramatically. In this new network state that we call the "drunk" state, the best predictor for the highly connected hubs is clearly gender. We believe this clearly proves that social networks in conference settings are very dynamic with time.

To prove that gender was indeed the best indicator of hubness and not some strange artifact we used deletions studies. Random female nodes where struck with a sudden case of "sleepiness" and the perturbed network was observed. We show here that random female deletion leads to a rapid collapse of the network. The same is not observed with random deletion of the hypest nodes, proving our initial proposition.

Propagation of Errors in Review Articles

Thomas J. Katz signs a small letter in Science warning us about the propagation of errors in review articles. The author gives a scary example of an incorrect citation propagated through 9 reviews (if I counted correctly). The cited paper does not contain the experiment that all the reviews mention and as it seems it was actually never published anywhere. Very scary.
As more science moves online with more individual voices, will this propagation of errors be accentuated or reduced?

How to recognize you have become senior faculty

I am back from holidays and trying to plow trough the RSS feeds/content alerts that accumulated in these two weeks. I might post on couple of things that catch my eye.
Here is a funny editorial from Gregory A Petsko talking about the project to sequence Homo neanderthalensis.
The editorial is actually more about senior faculty members and in particular how to identify one:

- You are senior faculty if you can actually remember when more than 10% of submitted grants got funded.
- You are senior faculty if you can remember when there was only one Nature.
You are senior faculty if you still get a lot of invitations to meetings, but they're all to deliver after-dinner talks.
- You are senior faculty if students sometimes ask you if you ever heard Franklin in person, and they mean Benjamin, not Aretha.
- You are senior faculty if a junior colleague wants to know what it was like before computers, and you can tell her.
- You are senior faculty when the second joint on the little finger of your left hand is the only joint that isn't stiff at the end of a long seminar.
- You are senior faculty if you sleep through most of those long seminars.
- You are senior faculty if you visit the Museum of Natural History, and the dummies in the exhibit of Stone Age man all remind you of people you went to school with.
- You are senior faculty if you find yourself saying "Back in my day" or "When I was your age" at least twice a week.
- You are senior faculty if you actually know what investigator-initiated, hypothesis-driven research means.
- You are senior faculty if you occasionally think that maybe you should attend a faculty meeting once in a while.
- You are senior faculty when your CV includes papers you can't remember writing.

Monday, September 04, 2006

Bio::Blogs #3

The third edition of Bio::Blogs was released a couple of days ago in business|bytes|genes|molecules.
I particularly enjoyed the nice discussions going on in evolgen , about the rifts in scientific communities and in Neil's blog regarding structural genomics data.

The next Bio::Blogs will be edited by Sandra Porter. Send your links and offers to host future editions to bioblogs{at}gmail.com.

Sunday, August 27, 2006

Bio::Blogs #3 - call for submissions

The third edition of Bio::Blogs will be up on the 1st of September, edited by mndoci. Send your submissions to the usual email bioblogs {at} gmail.com. There were few submissions so far. It might be a slow month if a lot of people took some time off, or maybe most people are waiting for the last day as usual.

I am back from holidays, trying to digest the emails and RSS feeds accumulated in two weeks. I miss the beach already :).

Tuesday, August 15, 2006

Interactome Networks conference

I am going for a two week holidays tomorrow. I really need some boring relaxing days by the beach without thinking to much about anything :).

After that I am going to a conference, Interactome Networks on the Wellcome Trust Genome Campus in Hinxton, UK. I will give a short talk about my last project - "Specificity and evolvability in eukaryotic protein interaction networks". I will try to blog some of the talks, either here or in Nodalpoint.
Here is the current list of talks and posters. If by any chance you are not going and are particularly interested in some of the titles let me know and I will try to have a look.

Sunday, August 13, 2006

Science Foo Camp

I am to sleepy to post a coherent account of what happened today at scifoo, I'll try to do it tomorrow in Nodalpoint. It's an amazing mixture of people that they gathered here, social/bio/physics/computer scientists, science fiction writers (futurists?), journal editors, open access and open science advocates and googlers.

To ilustrate how chaotic the meeting has been, here is a picture of the session planing board they put up.

So anyone can just grab a pen and fill up a slot. The only problem so far was actually having too many parallel interesting sessions.

Thursday, August 10, 2006

Science Foo Camp

I am going to California this weekend to attend the Science Foo Camp. An event organized by Nature/O'Reilly and hosted by Google. Yeap I get to visit the Googleplex :) (I admit it am I geek). It has been fun to see the event getting set up. It started with a wiki page seeded with some ideas from O'Reilly and some instructions. Then everyone started editing their bios and suggesting/offering talks.
I will be blogging my impressions of the event over at Nodalpoint but probably a good way to keep track of the event is to take a look at the scifoo tag in connotea.

(official announcement)

Wednesday, August 09, 2006

Blogging ~~science~~ .. something like science

(via Open Reading frame and Uncertain Principles) This is what happens when you spend to much time in the lab. Dylan Stiles reports on his analysis of ... his own ear wax?! Creative post to say the least. :).

Tuesday, August 08, 2006

Identifying protein-protein interfaces

There were several interesting papers on protein interaction in the last week. For example, in PLoS Computational Biology, Kim et al described an improvement to a method that classifies interaction interfaces according to their geometry. The results of the analysis are available on their site SCOPPI.
It seems reasonable to believe that there is a limited number of protein interaction types (predicted to be around 10 thousand), much the same way that there is probably a limited number of folds used in nature. This database, along side others like the iPFAM database, provide templates on which to model other protein interactions with reasonable homology. As first proposed by Aloy and Russell, if we know that two proteins interact, for example with a yeast-two-hybrid experiment we might be able to use these databases to identity and model the interaction interface between the two proteins.

Here is an example of two complexes taken from the paper. The nodes and edges view hides the fact that RBP and RCC1 interact through different interfaces.

Although is has been very useful to look at protein networks as of bunch of nodes connected by edges for some analysis it would be much more informative to know what are the interacting interfaces.

The authors used the database to claim that:
- hub proteins interact with proteins using many distinct faces (what they actually show is that domains that interact with many different other domain types have more distint faces);
- two thirds of gene fusions conserve the binding orientation;
- the apparent poor conservation of interfaces is due to the diversity of interactions and partners (in my opinion it is more a suggestion that proof);
- the interfaces common to archae, bacteria and eukaryotes and mostly symmetric homo-dimers, suggesting that asymmetric and hetero interactions evolved from symmetric homo-dimers.

<speculation> Maybe if we could convert the current human interactome into more than nodes and edges we could try to see if some of disease causing polymorphisms can be explained by how they affect the interfaces. Maybe we could use this to do interaction KOs instead of whole proteins </speculation>

Saturday, August 05, 2006

PLoS ONE is accepting manuscripts - Update

(via PLoS blog and Open) The new PLoS ONE journal is now accepting manuscripts for review. ~~They are still taking care of a few bugs and at this moments I could only use the site with Internet Explorer.~~
We can know have a look at the editorial board and the journal policies. There are some hints of how the user comments and ratings are going to work.

Most journals have some form of funneling, either by perceived impact or subject scope. It is going to be interesting to see what happens with PLoS ONE, given that there is no editorial selection on these criteria.

Update - It is actually working fine with Firefox1.5. When I was seeing it before the the page layout was misbehaving. Sorry for the confusion.

Friday, August 04, 2006

Meta blogging

Neil Saunders' blog as moved to a new home. Fabrice Jossinet is back blogging about RNA and bioinformatics in Propeller Twist.
If you are interested in RNA and microbiology go check out the new blog of Rosie Redfield. She runs a a microbiology lab at the University of British Columbia and will be blogging about their current research.

If you are interested in a better way to track the comments and responses to your comments in other blogs have a look at coComments. You can have an RSS feed and/or a box in your blog with your comments and responses to the comments that you wish to track.

Wednesday, August 02, 2006

Tangled Bank #59

The latest issue of Tangled Bank is up at Science and Reason. It is the first time I participate with a submission. If you are into physics the author of the blog, Charles Daney, is considering starting a physics carnival.

Tuesday, August 01, 2006

Bio::Blogs#2

The second edition of Bio::Blogs is up at Neil's blog. Go check it out and participate with your insightful comments :). There is a lot of conference blogging on this issue, a nice way to get updates on the conferences you might have missed.
I am happy to say that we have volunteers for the next two editions. Deepak offered to host for September 1st and Sandra Porter will host the October 1st edition.

Saturday, July 29, 2006

The likelihood that two proteins interact might depend on the proteins' age - part 2

Abstract
It has been previously shown [1] that S. cerevisiae proteins preferentially interact with proteins of the same estimated likely time of origin. Using a similar approach but focusing on a less broad evolutionary time span I observed that the likelihood for protein interactions depends on the proteins’ age. I had show this previously for the interactome of S. cerevisiae [2] and here I extend the analysis to show that the same is also observed for the interactome of H. sapiens. Importantly the observation does not depend on the experimental method used since removing the yeast-two-hybrid interactions does not alter the result.

Methods and Results
Protein-protein interactions for H.sapiens were obtained from the Human Protein Reference database and from two high-throughput studies excluding any interactions derived from protein complexes. I considered only proteins that were represented in this interactome (i.e. with one or more interactions).
As before I created groups of H. sapiens proteins with different average age using the reciprocal best blast hit method to determine the most likely ortholog in eleven other eukaryotic species (see figure 1 for species names). For a more detailed description of the group selection and the construction of the phylogenetic tree please see the previous post [2].
It is important to note that the placement of C. familiaris does not correspond with other published phylogenetic trees it might be due to the proteins selected for the tree construction. I should consider using different combinations of ancestral proteins to check the robustness of the tree.

In table 1 we can see the likelihood for protein interactions to occur within the ancestral proteins of group A and between the ancestral proteins and other groups of decreasing average age. As published by Qin et al. and as I had observed before for S. cerevisiae, the interactions within groups of the same age (group A) are more likely than between groups of proteins of different times of origin. Also, the likelihood for a protein to interact with an ancestral protein depends on the age of this protein. Confirming the pervious observation that the younger the protein is the less likely it is to interact with an ancestral protein.

I redid the analysis excluding yeast-two-hybrid interactions from the dataset. As it can be see in table 2, the results are qualitatively the same. There is a small increase in the likelihood of interaction with the ancestral proteins for the youngest group (highlighted in red in table 2) that is likely due to lack of data.

Caveats and possible continuations

I still have to test the statistical significance of these observations and control for possible other effects like protein size and protein expression that could explain these results.
I am interested in continuing this further as an open project. Fallowing the suggestion of Roland Krause I will soon start a wiki page to dump the data bits accumulated for open discussion. Hopefully more people will join in and maybe we can together shape up a small communication.

[1]Qin H, Lu HH, Wu WB, Li WH. Evolution of the yeast protein interaction network. Proc Natl Acad Sci U S A. 2003 Oct 28;100(22):12820-4. Epub 2003 Oct 13
[2]Beltrao. P The likelihood that two proteins interact might depend on the proteins' age Blog post