Deliverables

Receive your data fully analyzed and focus on relevant interactions!

The exhaustivity of our ULTImate Y2H™ screens and the availability of a large data set of interactions in our database allowed us to develop sophisticated bioinformatics tools to analyze the results:

  > Prey identification
The 5p and 3p raw sequences of all prey fragments are used to create contigs (longest continuous sequence). Contigs are subsequently submitted to BLAST analysis after removal of repeat sequences and sequence quality check with PHRED algorithm.

  > Interaction and functional domains
Because our domain-based libraries are highly complex and screened at saturation (10-fold library coverage on average), multiple, independent fragments are isolated for each interactant. The "smallest common denominator" shared by all the fragments derived from the same protein defines the Selected Interacting Domain (SID®). This experimental interaction domain is shown in comparison with known functional or structural domains identified in the preys.

  > The confidence score
Hybrigenics Services assigns to each protein interaction a statistical confidence score, the Predicted Biological Score (PBS®), that ranks interacting proteins according to technical parameters such as the number of independent prey fragments. This statistical analysis takes into account not only the data from your screen but also information derived from all the screens performed for the same organism at Hybrigenics (reciprocal interactions, cycles, highly connected protein domains).

 

The PBS is computed as an e-value which varies between 0 and 1 and gives the probability for the interaction to be non specific (the same way a BLAST analysis is computed). We attributed thresholds to the computed e-values to define 4 categories ranging from A to D:

 •    PBS A, B, C characterize high confidence interactions, interactions supported by several independent fragments and/or involved in reciprocal interactions or cycles.
 •    PBS D is assigned to interactions identified through one unique fragment or multiple identical ones. They can be interactions hardly detectable by the Y2H technique (weak or transient interactions, low abundance of the transcript in the library) or represent false-positive technical background.

 •    The PBS E is a special category and represents interactions involving prey domains connected at least 10 times in screens with Human, Mouse, Drosophila and Arabidopsis cDNA libraries and 6 times for all other organisms. This arbitrary threshold allows us to flag highly connected (non-specific) protein domains. Our experience with more than 4,500 screens performed so far has allowed us to classify highly connected proteins in different categories: i) Proteins that are known to be highly connected due to their biological function (modifying enzyme, chaperones, protein degradation enzymes…) and which therefore easily exceed the threshold of 6 when many screens are performed within the same organism; ii) Proteins with a prey interacting domain that contains a known protein interaction motif (e.g. PDZ, Ras Association Domain, …) or a biochemically promiscuous motif (e.g. highly charged residues). Their higher connectivity may be specific or the consequence of the biochemical promiscuity (interaction biochemically relevant but not necessarily occurring in a physiological setting).

  •    The PBS F flags experimentally proven artifacts of the Y2H technology. These can be LexA or Gal4 protein binders, binders of the DNA sequence upstream of the reporter gene…

 

You will find more details about the computation in our Drosophila protein interaction map paper (Formstecher E., et al. "Protein interaction mapping: A Drosophila case study." (2005) Genome Research).

> Discover much more than just the interaction !

The results of each screen are displayed on 3 different files:
  > A Results Summary pdf document
  > An Excel® worksheet including raw data
  > A “DomSight™” pdf file visualizing both the minimal interacting domain
     derived from the screen and known functional domains on the identified preys.

An example of each results file can be provided upon request, contact us !

Excel® is a registered trademark of Microsoft Corp.