Efficient Training on Very Large Corpora via Gramian Estimation

Krichene, Walid; Mayoraz, Nicolas; Rendle, Steffen; Zhang, Li; Yi, Xinyang; Hong, Lichan; Chi, Ed; Anderson, John

Statistics > Machine Learning

arXiv:1807.07187 (stat)

[Submitted on 18 Jul 2018]

Title:Efficient Training on Very Large Corpora via Gramian Estimation

Authors:Walid Krichene, Nicolas Mayoraz, Steffen Rendle, Li Zhang, Xinyang Yi, Lichan Hong, Ed Chi, John Anderson

View PDF

Abstract:We study the problem of learning similarity functions over very large corpora using neural network embedding models. These models are typically trained using SGD with sampling of random observed and unobserved pairs, with a number of samples that grows quadratically with the corpus size, making it expensive to scale to very large corpora. We propose new efficient methods to train these models without having to sample unobserved pairs. Inspired by matrix factorization, our approach relies on adding a global quadratic penalty to all pairs of examples and expressing this term as the matrix-inner-product of two generalized Gramians. We show that the gradient of this term can be efficiently computed by maintaining estimates of the Gramians, and develop variance reduction schemes to improve the quality of the estimates. We conduct large-scale experiments that show a significant improvement in training time and generalization quality compared to traditional sampling methods.

Subjects:	Machine Learning (stat.ML); Computation and Language (cs.CL); Machine Learning (cs.LG)
Cite as:	arXiv:1807.07187 [stat.ML]
	(or arXiv:1807.07187v1 [stat.ML] for this version)
	https://doi.org/10.48550/arXiv.1807.07187

Submission history

From: Walid Krichene [view email]
[v1] Wed, 18 Jul 2018 23:45:33 UTC (733 KB)

Statistics > Machine Learning

Title:Efficient Training on Very Large Corpora via Gramian Estimation

Submission history

Access Paper:

References & Citations

1 blog link

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Statistics > Machine Learning

Title:Efficient Training on Very Large Corpora via Gramian Estimation

Submission history

Access Paper:

References & Citations

1 blog link

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators