การทำนายยอดการดูวิดีโอโดยใช้การแบ่งกลุ่มยอดการดูวิดีโอและแบบจำลองเชิงเส้นหลายตัวแปร

เอกพล วงศ์ศุภรัตน์กุล

Please use this identifier to cite or link to this item: https://cuir.car.chula.ac.th/handle/123456789/70369

Title:	การทำนายยอดการดูวิดีโอโดยใช้การแบ่งกลุ่มยอดการดูวิดีโอและแบบจำลองเชิงเส้นหลายตัวแปร
Other Titles:	Prediction of view count of online videos using clustering view pattern with multivariate linear model
Authors:	เอกพล วงศ์ศุภรัตน์กุล
Advisors:	สุกรี สินธุภิญโญ
Other author:	จุฬาลงกรณ์มหาวิทยาลัย. คณะวิศวกรรมศาสตร์
Advisor's Email:	Sukree.S@Chula.ac.th
Subjects:	แบบจำลองเชิงเส้น ความน่าจะเป็น Linear models (Statistics) Probabilities
Issue Date:	2562
Publisher:	จุฬาลงกรณ์มหาวิทยาลัย
Abstract:	ในงานวิจัยนี้ เราตั้งเป้าหมายในการออกแบบแบบจำลองที่ทำนายยอดการดูระยะสั้นของวิดีโอบนยูทูบ เราเสนอแบบจำลองเอฟ7เอ็นเอ็มแอลซึ่งเป็นแบบจำลองที่สามารถจัดกลุ่มรูปแบบยอดการดูวิดีโอและกำจัดรูปแบบที่ผิดปกติ แบบจำลองนี้ประกอบด้วย 4 อย่าง อย่างแรกคือการจัดกลุ่มรูปแบบโดยใช้แบบจำลองการจัดกลุ่ม จากนั้นกลุ่มที่มีจำนวนน้อยซึ่งถูกกำหนดเป็นรูปแบบที่ไม่ได้เกิดขึ้นบ่อยจะถูกกำจัดออกไป ต่อมาจัดกลุ่มรูปแบบวิดีโอจากชุดข้อมูลทดสอบโดยใช้แบบจำลองเพื่อนบ้านใกล้เคียงที่สุด 1 อันดับ อย่างสุดท้ายคือรูปแบบแต่ละกลุ่มจะกลายเป็นชุดข้อมูลสำหรับแบบจำลองเชิงเส้นหลายตัวแปรซึ่งนำไปใช้ฝึกฝนเฉพาะกลุ่ม ผลการทดลองพบว่าแบบจำลองเอฟ7เอ็นเอ็มแอลที่ใช้แบบจำลองการจัดกลุ่มที่เหมาะสมทำให้ค่าความผิดพลาดจากการทำนายยอดการดูในวันที่ 30 ลดลง 27% จากแบบจำลองที่ดีที่สุดที่นำมาเปรียบเทียบจากงานวิจัยอื่น
Other Abstract:	In this research, we aim to design a model, which accurately predicts the short-term view count of videos on YouTube. We present F7NML, the First 7-day Normalization for clustering with Multi-variate Linear model, a predictive model that can group the patterns and remove outliers. First, it groups the patterns into many groups using the clustering model, which is presented in the paper. Then, it removes the groups of rare patterns, which are called outliers. Next, the video view count in the test dataset is matched into the groups using 1-nearest neighbor. Finally, Multivariate Linear model is trained for each group specifically. The experimental results show that F7NML with an appropriate clustering model reduces error when it was compared to the best baseline model from the literature by about 27% on the 30th-day view count prediction.
Description:	วิทยานิพนธ์ (วท.ม.)--จุฬาลงกรณ์มหาวิทยาลัย, 2562
Degree Name:	วิทยาศาสตรมหาบัณฑิต
Degree Level:	ปริญญาโท
Degree Discipline:	วิทยาศาสตร์คอมพิวเตอร์
URI:	http://cuir.car.chula.ac.th/handle/123456789/70369
URI:	http://doi.org/10.58837/CHULA.THE.2019.1126
metadata.dc.identifier.DOI:	10.58837/CHULA.THE.2019.1126
Type:	Thesis
Appears in Collections:	Eng - Theses

Files in This Item:

File	Description	Size	Format
6170983121.pdf		1.56 MB	Adobe PDF	View/Open

Show full item record