The optimal extraction of information from untargeted metabolomics analyses is a continuing challenge. Here, we describe an approach that combines stable isotope labeling, liquid chromatography? mass spectrometry (LC?MS), and a computational pipeline to automatically identify metabolites produced from a selected metabolic precursor. Here, we identified the subset of the soluble metabolome generated from phenylalanine (Phe) in Arabidopsis thaliana, which we refer to as the Phe-derived metabolome (FDM) In addition to identifying Phe-derived metabolites present in a single wild-type reference accession, the FDM was established in nine enzymatic and regulatory mutants in the phenylpropanoid pathway. To identify genes associated with variation in Phe-derived metabolites in Arabidopsis, MS features collected by untargeted metabolite profiling of an Arabidopsis diversity panel were retrospectively annotated to the FDM and natural genetic variants responsible for differences in accumulation of FDM features were identified by genome-wide association. Large differences in Phe-derived metabolite accumulation and presence/absence variation of abundant metabolites were observed in the nine mutants as well as between accessions from the diversity panel. Many Phe-derived metabolites that accumulated in mutants also accumulated in non-Col-0 accessions and was associated to genes with known or suspected functions in the phenylpropanoid pathway as well as genes with no known functions. Overall, we show that cataloguing a biochemical pathway?s products through isotopic labeling across genetic variants can substantially contribute to the identification of metabolites and genes associated with their biosynthesis.