논문윤리하기 논문투고규정
  • 오늘 가입자수 0
  • 오늘 방문자수 319
  • 어제 방문자수 536
  • 총 방문자수 2790
2024-11-20 12:51pm
논문지
HOME 자료실 > 논문지

발간년도 : [2023]

 
논문정보
논문명(한글) [Vol.18, No.1] Real-Time Processing System of E-Commerce User Data Based on Spark Streaming
논문투고자 Du Zhang, Wenchao Jia, Eunsung Kim, Hoekung Jung
논문내용 The advent of the e-commerce era has changed the way people shop, and at the same time, users generate a large amount of data when shopping. These data can be analyzed by offline calculation, but the results of offline analysis lack real-time performance. In this paper, by processing the log data and business data of e-commerce users in real-time, the feedback of the processing results can be quickly realized. The Spark big data computing framework has the advantages of real-time computing capability and high throughput. Spark Streaming, as an extension of Spark core, is the real-time stream processing component of the Spark computing platform. In this paper, the data is processed in real-time through Spark. Through Maxwell, real-time monitoring of business data changes in the MySQL database is performed, and the monitored data is sent to Kafka. Log data is directly sent to Kafka. Spark Streaming consumes the data in Kafka, then performs specific processing on the data according to the requirements, and the processed data is written to the Elasticsearch. In order to achieve exactly once consumption of data, this paper realizes at least one consumption of data by manually submitting offsets. Elasticsearch supports idempotent writes, so it can achieve exactly once consumption of downstream data. Manually submitted offsets are stored in Redis. Finally, specific queries can be performed on the processing results according to business requirements
첨부논문
   18-1-02.pdf (852.9K) [2] DATE : 2023-03-03 08:58:04