From Brian.Dalrymple csiro.au Thu Jan 10 18:05:49 2013
From: Brian.Dalrymple csiro.au (by way of Jill Maddox <jillian.maddox alumni.unimelb.edu.au>)
To: Multiple Recipients of <sheepmodels animalgenome.org>
Subject: Sheep RNA-Seq data for genome annotation - three weeks left to
submit
Date: Thu, 10 Jan 2013 18:05:49 -0600
Dear All,
Firstly thanks to those groups who have already started to submit their
data.
The closing date for submissions of sheep RNA sequence data for the sheep
genome annotation project at ENSEMBL remains as 31st Jan 2013.
Thus there are three weeks left to submit data to be used in the annotation
of Oar v3.1.
Any questions not answered by the info below please contact me,
brian.dalrymple csiro.au. I will be at PAG2013 Saturday 12th to Tuesday 15th
January and can answer questions face to face.
Note: the information below only applies to data not already deposited in a
public archive. Public data will be obtained direct from the archive.
Data can be submitted in two ways, either on discs/drives to
Dr Steve Searle
Wellcome Trust Sanger Institute
Wellcome Trust Genome Campus
Hinxton
Cambridge
CB10 1SA
UK
If submitting on discs/drives I would appreciate receiving a short note with
the submission information (see below) so that the ISGC has a full picture
of how much data is being submitted, please send to brian.dalrymple csiro.au.
We encourage users with large amounts of data to submit on drives directly
to ENSEMBL as the transfer via the internet to Australia may be slow and
prone to transfer issues.
Or via the internet to the ISGC in Brisbane, we will then package the data
up to send to ENSEMBL. To submit via the internet you will need to contact
Sean McWilliam (sean.mcwilliam csiro.au) before 31st January 2013. Sean will
provide you with the details of the upload procedure - we are using the
Australian Cloudstor system which requires a voucher for each file uploaded,
max size of file 100Gb (submit multiple files/datasets in archives).
For both submission methods we need the following information about each
dataset
Submitter
Breed
Tissue
Sequencing platform
Read length
Single end or paired end
Stranded or not
Amplified or not
Approximate size of the dataset
fastq files of the reads are preferred, or alternatively BAM files of the
complete sets of reads (including nonaligned reads)
ENSEMBL is keen to release the aligned data as BAM files as well as the gene
models themselves. If this is a concern to you please indicate this in the
information file.
The data will only be used by ENSEMBL and only for the purposes of
construction of gene models and the annotation of the sheep reference genome
assembly.
Thanks very much
Brian
Dr Brian Dalrymple
Leader of the ISGC sheep reference genome project
|