A comparison of stochastic gradient MCMC using multi-core and GPU architectures

Hernández, Sergio; Valdés, José; Valdenegro, Matias

Autor

Hernández, Sergio

Valdés, José

Valdenegro, Matias

Fecha

2020

Metadatos

Mostrar el registro completo de la publicación

Resumen

Deep learning models are traditionally used in big data scenarios. When there is not enough training data to fit a large model, transfer learning re-purpose the learned features from an existing model and re-train the lower layers for the new task. Bayesian inference techniques can be used to capture the uncertainty of the new model but it comes with a high computational cost. In this paper, the run time performance of an Stochastic Gradient Markov Chain Monte Carlo method using two different architectures is compared, namely GPU and multi-core CPU. As opposed to the widely usage of GPUs for deep learning, significant advantages from using modern CPU architectures.

Fuente

2020 Congreso Estudiantil de Electrónica y Electricidad (INGELECTRA)

La publicación tiene asociados los siguientes ficheros de licencia:

Creative Commons

Excepto si se señala otra cosa, la licencia de la publicación se describe como Atribución-NoComercial-SinDerivadas 3.0 Chile

Listar

Mi cuenta