標題: Intelligent compilation of patent summaries using machine learning and natural language processing techniques
作者: Trappey, Amy J. C.
Trappey, Charles V.
Wu, Jheng-Long
Wang, Jack W. C.
管理科學系
Department of Management Science
關鍵字: Artificial intelligence;Machine learning;Natural language processing;Deep learning;Patent analysis
公開日期: 1-一月-2020
摘要: Patents are a type of intellectual property with ownership and monopolistic rights that are publicly accessible published documents, often with illustrations, registered by governments and international organizations. The registration allows people familiar with the domain to understand how to re-create the new and useful invention but restricts the manufacturing unless the owner licenses or enters into a legal agreement to sell ownership of the patent. Patents reward the costly research and development efforts of inventors while spreading new knowledge and accelerating innovation. This research uses artificial intelligence natural language processing, deep learning techniques and machine learning algorithms to extract the essential knowledge of patent documents within a given domain as a means to evaluate their worth and technical advantage. Manual patent abstraction is a time consuming, labor intensive, and subjective process which becomes cost and outcome ineffective as the size of the patent knowledge domain increases. This research develops an intelligent patent summarization methodology using artificial intelligence machine learning approaches to allow patent domains of extremely large sizes to be effectively and objectively summarized, especially for cases where the cost and time requirements of manual summarization is infeasible. The system learns to automatically summarize patent documents with natural language texts for any given technical domain. The machine learning solution identifies technical key terminologies (words, phrases, and sentences) in the context of the semantic relationships among training patents and corresponding summaries as the core of the summarization system. To ensure the high performance of the proposed methodology, ROUGE metrics are used to evaluate precision, recall, accuracy, and consistency of knowledge generated by the summarization system. The Smart machinery technologies domain, under the sub-domains of control intelligence, sensor intelligence and intelligent decision-making provide the case studies for the patent summarization system training. The cases use 1708 training pairs of patents and summaries while testing uses 30 randomly selected patents. The case implementation and verification have shown the summary reports achieve 90% and 84% average precision and recall ratios respectively.
URI: http://dx.doi.org/10.1016/j.aei.2019.101027
http://hdl.handle.net/11536/154214
ISSN: 1474-0346
DOI: 10.1016/j.aei.2019.101027
期刊: ADVANCED ENGINEERING INFORMATICS
Volume: 43
起始頁: 0
結束頁: 0
顯示於類別:期刊論文