The Wiggle pipeline
The wiggle pipeline produces something equivalent to a wiggle plot commonly
seen when viewing NGS data in generic genome browsing systems. This provides
a simple view of the quantitated density of reads along either a specific
region, or over the whole genome, giving a more quantitative way of looking
at your raw data than looking at the raw reads directly, but with no bias
for specific regions.
The wiggle view isn't particularly useful for analysing your data, but is
mostly used as a better way to take an initial view of how your data is
distributed over your genome.
Options
The options you can set for this pipeline are:
- Which area you want to look at. You can choose to cover either the
whole genome, the current chromosome or the currently visible region. Looking
at smaller regions will allow you to place your probes more densely, since it
is the total number of probes which tends to be limiting. When you change
this selection SeqMonk will automatically suggest sensible values for the
width of your probes, but you are free to igore these and set your own size.
- The width and step size for your probes. These will be set automatically
based on the size of the region you're analysing (the program aims to make
10 million probes and adjusts the size of each probe to try to match this
figure). By default the step size and probe width will be the same so probes
don't overlap, but you can alter this value to whatever you like. You should
be aware that reducing the suggested step size could generate very large numbers
of probes which could take a long time and a lot of resources to calculate.
- Whether the results should be log2 transformed. The default is to
quantitate on a linear scale to emphasize any differences, but you can choose
to use a log scale if this makes more sense for your data. If your data is
on a linear scale you will probably need to adjust the automatically calibrated
y-scale to allow you to see the high outliers.
- Whether you want to adjust your counts based on the total number of reads in
your datasets. Setting this option will allow for a more direct comparison of
your samples