Genome Biol Evol. 2023 Jul 06. pii: evad126. [Epub ahead of print]
Multiple distal cis-regulatory elements (CREs) often cooperate to regulate gene expression, and the presence of multiple CREs for a gene has been proposed to provide redundancy and robustness to variation. However, we do not understand how attributes of a gene's distal CRE landscape-the CREs that contribute to its regulation-relate to its expression and function. Here, we integrate three-dimensional chromatin conformation and functional genomics data to quantify the CRE landscape composition genome-wide across ten human tissues and relate their attributes to the function, constraint, and expression patterns of genes. Within each tissue, we find that expressed genes have larger CRE landscapes than non-expressed genes and that genes with tissue-specific CREs are more likely to have tissue-specific expression. Controlling for the association between expression level and CRE landscape size, we also find that CRE landscapes around genes under strong constraint (e.g., loss-of-function intolerant and housekeeping genes) are not significantly smaller than other expressed genes as previously proposed; however, they do have more evolutionarily conserved sequences than CREs of expressed genes overall. We also show that CRE landscape size does not associate with expression variability across individuals; nonetheless, genes with larger CRE landscapes have a relative depletion for variants that influence expression levels (eQTL). Overall, this work illustrates how differences in gene function, expression, and evolutionary constraint are reflected in features of CRE landscapes. Thus, considering the CRE landscape of a gene is vital for understanding gene expression dynamics across biological contexts and interpreting the effects of non-coding genetic variants.
Keywords: cis-regulatory element; gene regulation; regulatory landscape; tissue-specific gene expression