You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: CHANGELOG.md
+8Lines changed: 8 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,3 +1,11 @@
1
+
# v0.15.0
2
+
## Changes
3
+
-**Breaking**: Added five new columns to the output of `profile`:
4
+
-`cpg_label` - A pass-through of the optional `cpg_label` field in the provided regions or background file
5
+
-`mean_combined_methyl_delta` - This is the raw delta value that is used to generate `mean_combined_methyl_zscore`. This will only be populated if background statistics are provided.
6
+
-`median_total_coverage`, `median_hap1_coverage`, and `median_hap2_coverage` - These metrics are calculated by extracting the coverage values from the pb-CpG-tools inputs and calculating the median of each. For hap1 and hap2, only the CpGs with phased information will be used.
7
+
-**Breaking**: Added the `cpg_label` column to the `build` output, this is a pass-through of the label from the provided profiles
8
+
1
9
# v0.14.2
2
10
## Fixed
3
11
- Fixed an issue where CpG segments with fewer CpGs than the `--min-cpgs` parameter could get created. This primarily occurred with groups of isolated CpGs, and affected `segment`, `joint-segment`, and `signature` modes. These small isolated segments are now filtered after the segmentation step, and therefor removed from all outputs.
Copy file name to clipboardExpand all lines: docs/profile_guide.md
+16-10Lines changed: 16 additions & 10 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -162,6 +162,7 @@ If a background profile is provided, relative metrics (such as Z-scores) will al
162
162
163
163
Fields:
164
164
*`chrom`, `start`, `end` - the region definition, copied from the input region file
165
+
*`cpg_label` - a pass-through of the optional `cpg_label` field from the background file; empty string if not provided
165
166
*`summary_label` - a summarization of the methylation status for this region, possible options are below:
166
167
*`NoData` - indicates no CpGs were found inside the region
167
168
*`Uncategorized` - indicates that CpGs were present, but there was not enough evidence to label this region with any of the following labels
@@ -183,18 +184,22 @@ Fields:
183
184
*`mean_meth_delta` - the difference in mean methylation ratios between the two haplotypes; `mean_meth_delta = mean_hap2_methyl - mean_hap1_methyl`
184
185
*`mean_abs_meth_delta_zscore` - if a background region file was provided, this value is the Z-score comparing `abs(mean_meth_delta)` against the background profile; positive values indicate _more_ evidence of ASM in this dataset relative to the population, negative values indicate _less_
185
186
*`mean_combined_methyl` - the mean (average) combined methylation ratio; "combined" here indicates that phasing (i.e. haplotypes) is not considered
186
-
*`mean_combined_methyl_zscore` - if a background region file was provided, this value is the Z-score comparing `mean_combined_methyl` against the background profile; positive values indicate this dataset is hyper-methylated relative to the the population, negative values indicate hypo-methylation
187
+
*`mean_combined_methyl_delta` - if a background region file was provided, this value is the raw delta value comparing the sample methylation to the population mean: `mean_combined_methyl - population_mean_combined_methyl`; positive values indicate this dataset is hyper-methylated relative to the the population, negative values indicate hypo-methylation
188
+
*`mean_combined_methyl_zscore` - if a background region file was provided, this value is the corresponding Z-score for `mean_combined_methyl_delta`
187
189
*`num_phased_cpgs` - the number of CpGs in the region with haplotagged reads on both haplotypes
188
190
*`num_partial_cpgs` - the number of CpGs in the region with haplotagged reads on only one haplotype
189
191
*`num_unphased_cpgs` - the number of CpGs in the region with no haplotagged reads
192
+
*`median_total_coverage` - the median coverage across all CpGs in the region
193
+
*`median_hap1_coverage` - the median coverage for CpGs with haplotype 1 information
194
+
*`median_hap2_coverage` - the median coverage for CpGs with haplotype 2 information
@@ -205,6 +210,7 @@ Example cohort background profiles can be found in the [data folder](../data/).
205
210
206
211
Fields:
207
212
*`chrom`, `start`, `end` - the region definition
213
+
*`cpg_label` - a pass-through of the optional `cpg_label` field from the profile files; empty string if not provided
208
214
*`data_category` - the label assigned to the dataset; "ALL" indicates the full cohort; there is one line in the file for each combination of region and `data_category`
209
215
*`num_phased` - the number of datasets with `num_phased_cpgs > 0` for this region
210
216
*`num_unphased` - the number of datasets with `num_phased_cpgs == 0` for this region
Copy file name to clipboardExpand all lines: docs/user_guide.md
+3-2Lines changed: 3 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,11 +6,12 @@ Table of contents:
6
6
*[Data files](../data/)
7
7
8
8
# Main workflows
9
-
There are currently four main workflows that are supported by MethBat:
9
+
There are currently five main workflows that are supported by MethBat:
10
10
1.[Rare methylation analysis](./profile_guide.md#rare-methylation-analysis) - Identify regions in a single dataset exhibiting a "rare" methylation patterns relative to a collection of background datasets; requires pre-defined regions such as all known CpG islands.
11
11
2.[Cohort methylation analysis](./profile_guide.md#cohort-methylation-analysis) - Identify regions exhibiting different methylation patterns between case and control datasets; requires pre-defined regions such as all known CpG islands.
12
12
3.[Segmentation](./segmentation_guide.md) - Segment (or divide) CpGs for an individual dataset into regions with a shared methylation pattern; no pre-defined regions required.
13
-
4.[Signature generation](./signature_guide.md) - Identify regions exhibiting different methylation patterns between case and control datasets; no pre-defined regions required.
13
+
4.[Joint segmentation](./joint_segmentation_guide.md) - Segment CpGs by averaging methylation values across a cohort and then segmenting the averaged values; no pre-defined regions required.
14
+
5.[Signature generation](./signature_guide.md) - Identify regions exhibiting different methylation patterns between case and control datasets; no pre-defined regions required.
14
15
15
16
# Supported upstream processes
16
17
The following upstream processes are supported as inputs to MethBat:
0 commit comments