Version 2 (modified by vw253, 7 years ago) (diff)


PomBase Planning Meeting ?? Feb 2015


Action item review

Outstanding action items carried forward from previous meeting

Action items from

New (and continuing) Agenda items

  • Has the new person been appointed? Is the browser overhaul in progress? (Re Paul will ensure that that the Ensembl browser developers get the PomBase?? suggestions before the browser overhaul)
  • Compact display progess and outstanding
  • exporting GO and FYPO data in EMBl /Genbank files
  • I still don't see the centromere feature, what was the outcome of this Q? check minutes

Centromere feature is not visible on region in detail view. Do we think it should be?;site=ensemblunit

  • Document update procedure and time taken (Mark) Done on Confluence (Q how long is this?)

  • EsyN update

Hosting high throughput datasets

  • None pending at high priority

Display of new data types

Phenotyping of wild strains, how to display these?

From Dan:

Here is what think might be interesting/useful to represent.

The three main ones are:

  1. SNPs, indels using Ensembles variation tools. I think I jhave already sent a VCF for indels and SNPs. If not I'm happy to send this again.
  2. Tf1 transposons that are variable between strains as well. Could also be supplied in VCF.
  3. Variation wiggle tracks (like pi, which is the average pairwise differences, for example).

Also: Some of the variants are associated with one or more traits (they are QTLS). I think QTLs are displayed on some other species. This would be a small amount of data, perhaps 100 or so QTLs. I am not sure how much of this data gets displayed in gene pages. If its does then fine. Otherwise I could easily make a table of gene-by-gene stats (such as number of Snps, indels, pi, and andy QTLs).

I'd be very happy to come along & help/chat about getting this data into pombase.

Priority Jira tickets

  • I have cleared out some of the smaller tickets to V 48, but if they are quick they can be done and closed. As the gene pages are getting longer and longer due to the volume of curated data we feel that it is becoming a priority to address the display of GO and phenotype data and address some of the redundancy in order for the users to be able to consume the data effectively. This should be the next larger project to tackle development wise. Also we should start to add a large volume of multi gene phenotypes in the next couple of months, so the other 2 tickets under the parent PB-1836 (PB-1839 and PB-1840), so we are happy to punt anything else bar critical bugs until these are done.
  • Follow up on the items for Ensembl, are they all raised and in progress?


Usage stats



Phenotype ontology


  • Kim is finalising multi gene phenotypes (and their storage in Canto), many annotation transfer speed ups.
    • We have a large volume of multi gene phenotype data ready to input so this will be the issues to tackle immediately after the compact display. There are related tickets already for how this should look on the gene pages.

Other general issues



Update on community curation

Literature (triage) status

Item March 2013 April 2013 May 2013 Aug 2013 May7 2014 July 23 2014 Oct 2014
All publications 9755 9761 9773 9989 10356 10400 10522
Curatable publications 4735 4740 4780 4896 5017 5070 4873
Publications with Approved sessions 580 600 647 712 1083 1187 1598
Publications with active sessions 245 247 265 246 220 246 225
Publications with session needing approval 14 4 4 18 6 11 19
community curatable publications - - - 351 526 613 733
community curated publications with approved sessions - - - 58 159 178 207
curatable publications without sessions - - - 3730 3256 3177 2518
  • numvers of annotatable papers have dropped due to retriage and classification of some papers that are probably of low value for curation.

All annotation types


Next priorities



News and Outreach

Next planning meeting