**Genome-Wide Association Study and Selection Signatures Detect Genomic Regions Associated with Seed Yield and Oil Quality in Flax**

**Frank M. You 1,2,\* ID , Jin Xiao 1,3, Pingchuan Li <sup>1</sup> , Zhen Yao <sup>2</sup> , Gaofeng Jia 1,4, Liqiang He <sup>1</sup> , Santosh Kumar <sup>5</sup> , Braulio Soto-Cerda 6,7, Scott D. Duguid <sup>2</sup> , Helen M. Booker <sup>4</sup> ID , Khalid Y. Rashid <sup>2</sup> and Sylvie Cloutier 1,6,\***


Received: 20 July 2018; Accepted: 3 August 2018; Published: 6 August 2018

**Abstract:** A genome-wide association study (GWAS) was performed on a set of 260 lines which belong to three different bi-parental flax mapping populations. These lines were sequenced to an averaged genome coverage of 19× using the Illumina Hi-Seq platform. Phenotypic data for 11 seed yield and oil quality traits were collected in eight year/location environments. A total of 17,288 single nucleotide polymorphisms were identified, which explained more than 80% of the phenotypic variation for days to maturity (DTM), iodine value (IOD), palmitic (PAL), stearic, linoleic (LIO) and linolenic (LIN) acid contents. Twenty-three unique genomic regions associated with 33 quantitative trait loci (QTL) for the studied traits were detected, thereby validating four genomic regions previously identified. The 33 QTL explained 48–73% of the phenotypic variation for oil content, IOD, PAL, LIO and LIN but only 8–14% for plant height, DTM and seed yield. A genome-wide selective sweep scan for selection signatures detected 114 genomic regions that accounted for 7.82% of the flax pseudomolecule and overlapped with the 11 GWAS-detected genomic regions associated with 18 QTL for 11 traits. The results demonstrate the utility of GWAS combined with selection signatures for dissection of the genetic structure of traits and for pinpointing genomic regions for breeding improvement.

**Keywords:** flax; genome-wide association study (GWAS); selective sweep; genotyping by sequencing (GBS); bi-parental population; single nucleotide polymorphism (SNP); seed yield; plant height; maturity; fatty acid composition
