An Expected Win Rate-Based Real Time Bidding Strategy for Branding Campaign by the Model-Free Reinforcement Learning Model

doi:10.1109/ACCESS.2020.3016824

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.author	Shih, Wen-Yueh	en_US
dc.contributor.author	Lu, Yi-Shu	en_US
dc.contributor.author	Tsai, Hsiao-Ping	en_US
dc.contributor.author	Huang, Jiun-Long	en_US
dc.date.accessioned	2020-10-05T02:02:04Z	-
dc.date.available	2020-10-05T02:02:04Z	-
dc.date.issued	2020-01-01	en_US
dc.identifier.issn	2169-3536	en_US
dc.identifier.uri	http://dx.doi.org/10.1109/ACCESS.2020.3016824	en_US
dc.identifier.uri	http://hdl.handle.net/11536/155487	-
dc.description.abstract	The bidding strategy plays the most important role to help the Demand Side Platforms (DSPs) making bidding decisions on a large number of bid requests in Real Time Bidding (RTB) to satisfy the different objectives of campaigns under the lifetime and budget constraints. In this paper, we focus on branding campaign whose objective is to obtain as many impressions as possible under the lifetime and budget constraints. To achieve the objectives of branding campaigns, we propose a novel expected win rate-based bidding strategy for branding campaign under the lifetime and budget constraints by utilizing a model-free reinforcement learning model. Specifically, to prevent missing good opportunities resulting from submitting extremely low bid prices, the concept of the base winning price is introduced to determine the lower bound of expected winning price. In addition, to obtain more impressions, the concept of the DSP-specified budget spending plan is proposed to determine the proper winning prices. The base expected win rate is then calculated based on the base winning price and the winning price determined by the DSP-specified budget spending plan. Since RTB is a dynamic environment, we propose a novel expected win rate-based bidding strategy named EWDQN which utilizes Deep Q Network (DQN) to dynamically determine the expected win rate according to the base expected win rate and the current status of the RTB market, and then determines the bid price according to the expected win rate. To the best of our knowledge, this is the first research applying the reinforcement learning technique on the bidding strategies for branding campaign. To measure the performance of EWDQN, several experiments are conducted on two real datasets. Experimental results show that EWDQN outperforms the-state-of-the-art bidding strategies for branding campaign in terms of the number of obtained impressions and CPM (cost per thousand impressions).	en_US
dc.language.iso	en_US	en_US
dc.subject	Advertising	en_US
dc.subject	Predictive models	en_US
dc.subject	Adaptation models	en_US
dc.subject	Real-time systems	en_US
dc.subject	Learning (artificial intelligence)	en_US
dc.subject	Logistics	en_US
dc.subject	Computer science	en_US
dc.subject	Real time bidding	en_US
dc.subject	online advertising	en_US
dc.subject	bidding strategy	en_US
dc.subject	reinforcement learning	en_US
dc.subject	demand side platform	en_US
dc.subject	branding campaign	en_US
dc.title	An Expected Win Rate-Based Real Time Bidding Strategy for Branding Campaign by the Model-Free Reinforcement Learning Model	en_US
dc.type	Article	en_US
dc.identifier.doi	10.1109/ACCESS.2020.3016824	en_US
dc.identifier.journal	IEEE ACCESS	en_US
dc.citation.volume	8	en_US
dc.citation.spage	151952	en_US
dc.citation.epage	151967	en_US
dc.contributor.department	資訊工程學系	zh_TW
dc.contributor.department	Department of Computer Science	en_US
dc.identifier.wosnumber	WOS:000564184100001	en_US
dc.citation.woscount	0	en_US
顯示於類別：	期刊論文