Here we present an adaptation of NimbleGen 2.1M-probe array sequence capture for whole exome sequencing using the Illumina Genome Analyzer (GA) platform.The protocol involves two-stage library construction.The specificity of exome enrichment was approximately 80% with 95.6% even coverage of the 34 Mb target region at an average sequencing depth of 33-fold.Comparison of our results with whole genome shot-gun resequencing results showed that the exome SNP calls gave only 0.97% false positive and 6.27% false negative variants.Our protocol is also well suited for use with whole genome amplified DNA.The results presented here indicate that there is a promising future for large-scale population genomics and medical studies using a whole exome sequencing approach.
As an index of functional divergence, expression divergence between duplicate gene copies has been observed and correlated with protein coding sequence divergence and bias in gene functional classes. However, the changes in the cis-regulatory region of the duplicate genes which is thought to have important role in expression divergence, has not been explored on the genome-wide scale. We analyzed functional genomics data for a large number of duplicated gene pairs formed by ancient polyploidy events in Arabidopsis thaliana. The divergence in cis-regulatory regions between two copies is positively correlated with the magnitude difference of expression. Moreover, we find that highly expressed duplicate gene pairs have a more diverged cis-regulatory region than weakly expressed gene pairs. We also show that the correlation between expression functional constraint and protein functional constraint is different in old and young duplicate pairs. Our results suggest that cis-regulatory sequence divergence contributes to the expression divergence of duplicate genes formed by genome-wide duplication. Cis-regulatory region diverges faster in highly expressed duplicate pairs. The diversify selection strengths that act on cis-regulatory region and protein coding region are negatively correlated in young duplicate pairs under expression constraint.