2009 | OriginalPaper | Chapter
PLDA: Parallel Latent Dirichlet Allocation for Large-Scale Applications
Authors : Yi Wang, Hongjie Bai, Matt Stanton, Wen-Yen Chen, Edward Y. Chang
Published in: Algorithmic Aspects in Information and Management
Publisher: Springer Berlin Heidelberg
Activate our intelligent search to find suitable subject content or patents.
Select sections of text to find matching patents with Artificial Intelligence. powered by
Select sections of text to find additional relevant content using AI-assisted search. powered by
This paper presents PLDA, our parallel implementation of Latent Dirichlet Allocation on MPI and MapReduce. PLDA smooths out storage and computation bottlenecks and provides fault recovery for lengthy distributed computations. We show that PLDA can be applied to large, real-world applications and achieves good scalability. We have released
MPI-PLDA
to open source at http://code.google.com/p/plda under the Apache License.