This directory contains the Sep. 2009 (TWGS Meug_1.1/macEug2) assembly of the wallaby genome
(macEug2, TWGS (NCBI Project ID: 12586, Accession: GCA_000004035.1)), as well as repeat annotations and GenBank sequences.
This assembly was produced by the National Center for Biotechnology Information (NCBI).
For more information on the wallaby genome, see the project website:
Files included in this directory:
macEug2.2bit - contains the complete wallaby/macEug2 genome sequence
in the 2bit file format. Repeats from RepeatMasker and Tandem Repeats
Finder (with period of 12 or less) are shown in lower case; non-repeating
sequence is shown in upper case. The utility program, twoBitToFa (available
from the kent src tree), can be used to extract .fa file(s) from
this file. A pre-compiled version of the command line tool can be
macEug2.agp.gz - Description of how the assembly was generated from
macEug2.fa.gz - "Soft-masked" assembly sequence in one file.
Repeats from RepeatMasker and Tandem Repeats Finder (with period
of 12 or less) are shown in lower case; non-repeating sequence is
shown in upper case.
macEug2.fa.masked.gz - "Hard-masked" assembly sequence in one file.
Repeats are masked by capital Ns; non-repeating sequence is shown in
macEug2.fa.out.gz - RepeatMasker .out file. RepeatMasker was run with the
-s (sensitive) setting. RepeatMasker version: June 30 2010 (open-3-2-9)
RepeatMasker library version: 20090604
macEug2.trf.bed.gz - Tandem Repeats Finder locations, filtered to keep repeats
with period less than or equal to 12, and translated into UCSC's BED
est.fa.gz - Wallaby ESTs in GenBank. This sequence data is updated once a
week via automatic GenBank updates.
md5sum.txt - checksums of files in this directory
mrna.fa.gz - Wallaby mRNA from GenBank. This sequence data is updated
once a week via automatic GenBank updates.
xenoMrna.fa.gz - GenBank mRNAs from species other than that of
the genome. This sequence data is updated once a week via automatic
macEug2.chrom.sizes - Two-column tab-separated text file containing assembly
sequence names and sizes.
If you plan to download a large file or multiple files from this
directory, we recommend that you use ftp rather than downloading the
files via our website. To do so, ftp to hgdownload.cse.ucsc.edu
[username: anonymous, password: your email address], then cd to the
directory goldenPath/macEug2/bigZips. To download multiple files, use
the "mget" command:
mget <filename1> <filename2> ...
- or -
mget -a (to download all the files in the directory)
Alternate methods to ftp access.
Using an rsync command to download the entire directory:
rsync -avzP rsync://hgdownload.cse.ucsc.edu/goldenPath/macEug2/bigZips/ .
For a single file, e.g. chromFa.tar.gz
Or with wget, all files:
With wget, a single file:
To unpack the *.tar.gz files:
tar xvzf <file>.tar.gz
To uncompress the fa.gz files:
Apache/2.2.15 (CentOS) Server at hgdownload-test.sdsc.edu Port 80