Segway: a way to segment the genome

Hoffman MM, Buske OJ, Wang J, Weng Z, Bilmes J, Noble WS. 2011. Unsupervised pattern discovery in human chromatin structure through genomic segmentation. Submitted. (BibTeX)

The free Segway software package contains a novel method for analyzing multiple tracks of functional genomics data. Our method uses a dynamic Bayesian network (DBN) model, which enables it to analyze the entire genome at 1-bp resolution even in the face of heterogeneous patterns of missing data. This method is the first application of DBN techniques to genome-scale data and the first genomic segmentation method designed for use with the maximum resolution data available from ChIP-seq experiments without downsampling. Our software has extensive documentation and was designed from the outset with external users in mind. Researchers at other universities and institutes have already installed and used Segway for their own projects.

Please ask Michael for a copy of the Segway manuscript.

Segmentation

View the segmentation in the UCSC Genome Browser. NCBI36 (hg18). GRCh37 (hg19).

Here is a brief description of the various classes of segment labels:

MnemonicDescription
Ddead zone
L0–1low zone
F0–1FAIRE only
R0–5repression
C0CTCF (strong)
C1CTCF (weak)
H3K9me1H3K9me1 only
TF0–2transcription factor activity
TSStranscription start site
GSgene body (start)
E/GMenhancer/gene middle
GM0–1gene body (middle)
GE0–2gene body (end)

Download the segmentation for further analysis. NCBI36 (hg18). GRCh37 (hg19). (~165 MB, gzipped BED).

Documentation

Read the documentation, which begins with a quick start. The documentation is also available as a PDF.

Installation

The easy way to install segway and its prerequisites, and set up your environment properly to use them is to use our interactive install script. Just type this command from bash on your UNIX system:

python <(wget -O - http://noble.gs.washington.edu/proj/segway/install.py)

Segway requires the use of a cluster management system. Currently, we support SGE/OGE and Platform LSF. If you would like to use Segway on another system, please open a ticket in the issue tracker. You can also run Segway on SGE via the Amazon EC2 compute cloud.

Support

For support of Segway, please write to the segway-users mailing list, rather than writing the authors directly. Using the mailing list will get your question answered more quickly. It also allows us to pool knowledge and reduce getting the same inquiries over and over. Questions sent to the mailing list will receive a higher priority than those sent to us individually.

Specifically, if you want to report a bug or request a feature, please do so using the Segway issue tracker. We are interested in all comments on the package, and the ease of use of installation and documentation.

If you do not want to read discussions about other people's use of Segway, but would like to hear about new releases and other important information, please subscribe to the segway-announce mailing list. Announcements of this nature are sent to both segway-users and segway-announce.

Useful links

Running Segway in the Amazon Compute Cloud by Jay Hesselberth, University of Colorado Denver

Source code

Version 1.1.0

Notes on the segmentation

The underlying signal data for the segmentation presented above is available in bedGraph and bigWig formats (NCBI36/hg18). Use this browser file to load all the bigWigs. We produced these signal files using Wiggler from original data available from the Encode DCC.

We produced the original segmentations for NCBI36. We used liftOver (minMatch=0.99) to convert segmentations to GRCh37, and then filtered out any overlapping regions.

Michael Hoffman < mmh1 at uw period edu >