This is a preprint.
Characterization of non-coding variants associated with transcription factor binding through ATAC-seq-defined footprint QTLs in liver
- PMID: 39386531
- PMCID: PMC11463493
- DOI: 10.1101/2024.09.24.614730
Characterization of non-coding variants associated with transcription factor binding through ATAC-seq-defined footprint QTLs in liver
Abstract
Non-coding variants discovered by genome-wide association studies (GWAS) are enriched in regulatory elements harboring transcription factor (TF) binding motifs, strongly suggesting a connection between disease association and the disruption of cis-regulatory sequences. Occupancy of a TF inside a region of open chromatin can be detected in ATAC-seq where bound TFs block the transposase Tn5, leaving a pattern of relatively depleted Tn5 insertions known as a "footprint". Here, we sought to identify variants associated with TF-binding, or "footprint quantitative trait loci" (fpQTLs) in ATAC-seq data generated from 170 human liver samples. We used computational tools to scan the ATAC-seq reads to quantify TF binding likelihood as "footprint scores" at variants derived from whole genome sequencing generated in the same samples. We tested for association between genotype and footprint score and observed 693 fpQTLs associated with footprint-inferred TF binding (FDR < 5%). Given that Tn5 insertion sites are measured with base-pair resolution, we show that fpQTLs can aid GWAS and QTL fine-mapping by precisely pinpointing TF activity within broad trait-associated loci where the underlying causal variant is unknown. Liver fpQTLs were strongly enriched across ChIP-seq peaks, liver expression QTLs (eQTLs), and liver-related GWAS loci, and their inferred effect on TF binding was concordant with their effect on underlying sequence motifs in 80% of cases. We conclude that fpQTLs can reveal causal GWAS variants, define the role of TF binding site disruption in disease and provide functional insights into non-coding variants, ultimately informing novel treatments for common diseases.
Figures
Similar articles
-
ATAC-STARR-seq reveals transcription factor-bound activators and silencers within chromatin-accessible regions of the human genome.Genome Res. 2022 Aug 25;32(8):1529-1541. doi: 10.1101/gr.276766.122. Genome Res. 2022. PMID: 35858748 Free PMC article.
-
Profiling of chromatin accessibility identifies transcription factor binding sites across the genome of Aspergillus species.BMC Biol. 2021 Sep 6;19(1):189. doi: 10.1186/s12915-021-01114-0. BMC Biol. 2021. PMID: 34488759 Free PMC article.
-
The native cistrome and sequence motif families of the maize ear.PLoS Genet. 2021 Aug 12;17(8):e1009689. doi: 10.1371/journal.pgen.1009689. eCollection 2021 Aug. PLoS Genet. 2021. PMID: 34383745 Free PMC article.
-
Regulatory SNPs: Altered Transcription Factor Binding Sites Implicated in Complex Traits and Diseases.Int J Mol Sci. 2021 Jun 16;22(12):6454. doi: 10.3390/ijms22126454. Int J Mol Sci. 2021. PMID: 34208629 Free PMC article. Review.
-
Detect accessible chromatin using ATAC-sequencing, from principle to applications.Hereditas. 2019 Aug 15;156:29. doi: 10.1186/s41065-019-0105-9. eCollection 2019. Hereditas. 2019. PMID: 31427911 Free PMC article. Review.
References
Publication types
Grants and funding
LinkOut - more resources
Full Text Sources
Miscellaneous