Characterizing sediment sources by non-negative matrix factorization of detrital geochronological data

This paper explores an inverse approach to the problem of characterizing sediment sources' (“source” samples) age distributions based on samples from a particular depocenter (“sink” samples) using non-negative matrix factorization (NMF). It also outlines a method to determine the optimal number...

Celý popis

Uložené v:

Podrobná bibliografia
Vydané v:	Earth and planetary science letters Ročník 512; s. 46 - 58
Hlavní autori:	Saylor, J.E., Sundell, K.E., Sharman, G.R.
Médium:	Journal Article
Jazyk:	English
Vydavateľské údaje:	Elsevier B.V 15.04.2019
Predmet:	Book Cliffs Grand Canyon Gulf of Mexico low rank approximation mixture modeling U–Pb mixture modeling Grand Canyon Gulf of Mexico Book Cliffs low rank approximation U–Pb
ISSN:	0012-821X, 1385-013X
On-line prístup:	Získať plný text
Tagy:	Pridať tag Žiadne tagy, Buďte prvý, kto otaguje tento záznam!

Popis
Shrnutí:	This paper explores an inverse approach to the problem of characterizing sediment sources' (“source” samples) age distributions based on samples from a particular depocenter (“sink” samples) using non-negative matrix factorization (NMF). It also outlines a method to determine the optimal number of sources to factorize from a set of sink samples (i.e., the optimum factorization rank). We demonstrate the power of this method by generating sink samples as random mixtures of known sources, factorizing them, and recovering the number of known sources, their age distributions, and the weighting functions used to generate the sink samples. Sensitivity testing indicates that similarity between factorized and known sources is positively correlated to 1) the number of sink samples, 2) the dissimilarity among sink samples, and 3) sink sample size. Specifically, the algorithm yields consistent, close similarity between factorized and known sources when the number of sink samples is more than ∼3 times the number of source samples, sink data sets are internally dissimilar (cross-correlation coefficient range >0.3, Kuiper V value range >0.35), and sink samples are well-characterized (>150–225 data points). However, similarity between known and factorized sources can be maintained while decreasing some of these variables if other variables are increased. Factorization of three empirical detrital zircon U–Pb data sets from the Book Cliffs, the Grand Canyon, and the Gulf of Mexico yields plausible source age distributions and weights. Factorization of the Book Cliffs data set yields five sources very similar to those recently independently proposed as the primary sources for Book Cliffs strata; confirming the utility of the NMF approach. The Grand Canyon data set exemplifies two general considerations when applying the NMF algorithm. First, although the NMF algorithm is able to identify source age distribution, additional geological details are required to discriminate between primary or recycled sources. Second, the NMF algorithm will identify the most basic elements of the mixed sink samples and so may subdivide sources that are themselves heterogeneous mixtures of more basic elements into those basic elements. Finally, application to a large Gulf of Mexico data set highlights the increased contribution from Appalachian sources during Cretaceous and Holocene time, potentially attributable to drainage reorganization. Although the algorithm reproduces known sources and yields reasonable sources for empirical data sets, inversions are inherently non-unique. Consequently, the results of NMF and their interpretations should be evaluated in light of independent geological evidence. The NMF algorithm is provided both as MATLAB code and a stand-alone graphical user interface for Windows and macOS (.exe and .app) along with all data sets discussed in this contribution. •Non-negative matrix factorization (NMF) identifies source age distributions and contributions from sink samples.•The number of sink samples should be multiple times the number of source samples, dissimilar, and well-characterized.•We provide a new method to determine the optimum number of sources for factorization.•The power of NMF is demonstrated through successful factorization of both synthetic and empirical data sets.•We provide open-source programs for Windows and macOS as well as MATLAB scripts and example data sets to implement NMF.
ISSN:	0012-821X 1385-013X
DOI:	10.1016/j.epsl.2019.01.044