HiChIP Data Sets
To download one of the data sets, simply use the wget command:
wget https://s3.amazonaws.com/dovetail.pub/HiChIP/fastqs/HiChiP_CTCF_2M_R1.fastq.gz
wget https://s3.amazonaws.com/dovetail.pub/HiChIP/fastqs/HiChiP_CTCF_2M_R2.fastq.gz
For testing purposes, we recommend using the 2M reads data sets, for any other purpose we recommend using the 800M reads data set.
Sequenced (human) libraries:
Library |
Link |
---|---|
GM12878 CTCF 2M |
|
GM12878 CTCF (deep sequencing) |
|
GM12878 H3K27Ac (deep sequencing) |
|
GM12878 H3K4me3 (deep sequencing) |
Human, hg38, Peak files from ENCODE project
Data used for HiChIP Comparative Analysis (Mouse, mm10)
To get a list of all the files generated from the HiChIP Comparative Analysis tutorial, including the required reference genomes, you can use the command:
aws s3 ls s3://dovetail.pub/HiChIP/compare_samples/
Use wget to download any given file, replacing “s3://” with “https://s3.amazonaws.com/”, followed by the remaining path to the file. For example:
wget https://s3.amazonaws.com/dovetail.pub/HiChIP/compare_samples/Reference_Genome/mm10.fa
Data Set |
Link |
---|---|
Fastqs (Sample A) |
|
Fastqs (Sample B) |
Note: The full dataset, including input files and generated output is ~183Gb (roughly 5h with a network speed of 10Mb/s).