Deep Bayesian Mining, Learning and Understanding

doi:10.1145/3292500.3332267

Full metadata record

DC Field	Value	Language
dc.contributor.author	Chien, Jen-Tzung	en_US
dc.date.accessioned	2019-10-05T00:09:48Z	-
dc.date.available	2019-10-05T00:09:48Z	-
dc.date.issued	2019-01-01	en_US
dc.identifier.isbn	978-1-4503-6201-6	en_US
dc.identifier.uri	http://dx.doi.org/10.1145/3292500.3332267	en_US
dc.identifier.uri	http://hdl.handle.net/11536/152980	-
dc.description.abstract	This tutorial addresses the advances in deep Bayesian mining and learning for natural language with ubiquitous applications ranging from speech recognition to document summarization, text classification, text segmentation, information extraction, image caption generation, sentence generation, dialogue control, sentiment classification, recommendation system, question answering and machine translation, to name a few. Traditionally, "deep learning" is taken to be a learning process where the inference or optimization is based on the real-valued deterministic model. The "semantic structure" in words, sentences, entities, actions and documents drawn from a large vocabulary may not be well expressed or correctly optimized in mathematical logic or computer programs. The "distribution function" in discrete or continuous latent variable model for natural language may not be properly decomposed or estimated. This tutorial addresses the fundamentals of statistical models and neural networks, and focus on a series of advanced Bayesian models and deep models including hierarchical Dirichlet process, Chinese restaurant process, hierarchical Pitman-Yor process, Indian buffet process, recurrent neural network (RNN), long short-term memory, sequence-to-sequence model, variational auto-encoder (VAE), generative adversarial network (GAN), attention mechanism, memory-augmented neural network, skip neural network, stochastic neural network, predictive state neural network, policy neural network. We present how these models are connected and why they work for a variety of applications on symbolic and complex patterns in natural language. The variational inference and sampling method are formulated to tackle the optimization for complicated models. The word and sentence embeddings, clustering and co-clustering are merged with linguistic and semantic constraints. A series of case studies are presented to tackle different issues in deep Bayesian mining, learning and understanding. At last, we will point out a number of directions and outlooks for future studies.	en_US
dc.language.iso	en_US	en_US
dc.subject	deep learning	en_US
dc.subject	Bayesian learning	en_US
dc.subject	natural language processing	en_US
dc.title	Deep Bayesian Mining, Learning and Understanding	en_US
dc.type	Proceedings Paper	en_US
dc.identifier.doi	10.1145/3292500.3332267	en_US
dc.identifier.journal	KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING	en_US
dc.citation.spage	3197	en_US
dc.citation.epage	3198	en_US
dc.contributor.department	交大名義發表	zh_TW
dc.contributor.department	National Chiao Tung University	en_US
dc.identifier.wosnumber	WOS:000485562503048	en_US
dc.citation.woscount	0	en_US
Appears in Collections:	Conferences Paper