A Comprehensive Dataset for Analyzing the Chinese Automotive Market
The SRNI-CAR dataset addresses the limitations in current automotive industry datasets, offering a robust solution for various business analytics needs. Spanning the period from 2016 to 2022, it encompasses a wide-ranging repository of data pertinent to China’s automobile sector, encompassing crucial facets such as sales data, online reviews, and industry news.
What makes the SRNI-CAR dataset stand out is its combination of industry news, development insights, automotive marketing data, consumer online reviews, and sales information. It also introduces previously unavailable variables like model launch dates and brand entry dates. This broader scope opens up new possibilities for research compared to existing datasets, improving both analytical accuracy and interpretability.
In the automotive sector, the SRNI-CAR dataset holds significant business value. Sales data helps automakers and marketers track market trends, while review data aids in understanding consumer preferences, assessing marketing effectiveness, and analyzing product strengths and weaknesses. Additionally, including industry news, development insights, and automotive publicity empowers automakers to stay informed about industry trends and competition.
Variables | Description |
---|---|
Car series | Name of the car series. |
Brand | Name of the brand. |
Year | Year in which the car series was sold. |
Month | Month in which the car series was sold. |
Car model type | Car model category: Sedan, SUV and MPV. |
Brand energy type | Brand category based on energy type of vehicle produced. |
Size | Vehicle size: mini, minivan, minibus, small, compact, mid-size, larger than mid-size, full-size. |
Brand country of origin | Country in which the brand was created. |
Model launch date | Year when the car series was launched on the Chinese market. |
Brand establishment date | Year the brand was created. |
Brand entered China date | Year when the brand officially entered the Chinese market. |
Sales | Total sales of the car series in the month. |
Variables | Description |
---|---|
Car series | Name of the car series. |
Brand | Name of the brand. |
Size | Vehicle size: mini, minivan, minibus, small, compact, mid-size, larger than mid-size, full-size. |
Car model type | Car model category: Sedan, SUV and MPV. |
User ID | Name that users use when making online reviews. |
Year of review | Year when the user reviews. |
Month of review | Month when the user reviews. |
Specific model purchased | Specific model of a car series purchased by a user. |
Official price | Official prices for specific models purchased. |
Car energy type | Vehicle energy type: gasoline vehicle, diesel vehicle, hydrogen vehicle, and so on. |
Brand energy type | Brand category based on energy type of vehicle produced. |
Brand country of origin | Country in which the brand was created. |
Brand establishment date | Year the brand was created. |
Brand entered China date | Year when the brand officially entered the Chinese market. |
Model launch date | Year when the car series was launched on the Chinese market. |
Year of purchase | Year the user purchased the model. |
Month of purchase | Month the user purchased the model. |
Sales | Total sales of the car series in the month when the user purchased the model. |
Experience duration | Months between purchase date and review posting date. |
Province | Province in which the user purchased the model. |
City | City in which the user purchased the model. |
Transaction price | Real transaction price of the model. |
Average energy consumption | Gasoline, diesel, electricity, or hydrogen consumed for every 100 kilometers traveled. |
Mileage | Kilometers the user has driven the model at the date the review was posted. |
Overall rating | User’s overall rating of the vehicle purchased. |
Exterior rating | User’s rating of the exterior of the vehicle. |
Interior rating | User’s rating of the interior of the vehicle. |
Space rating | User’s rating of the space of the vehicle. |
Features rating | User’s rating of the feature of the vehicle. |
Power rating | User’s rating of the power of the vehicle. |
Energy consumption rating | User’s rating of the energy consumption of the vehicle. |
Driving rating | User’s rating of the driving of the vehicle. |
Comfort rating | User’s rating of the comfort of the vehicle. |
Advantage | The advantages of the model as perceived by the user. |
Disadvantage | The disadvantage of the model as perceived by the user. |
Exterior comments | User’s comments on the exterior of the vehicle. |
Interior comments | User’s comments on the interior of the vehicle. |
Space comments | User’s comments on the space of the vehicle. |
Features comments | User’s comments on the feature of the vehicle. |
Power comments | User’s comments on the power of the vehicle. |
Energy consumption comments | User’s comments on the energy consumption of the vehicle. |
Driving comments | User’s comments on the driving of the vehicle. |
Comfort comments | User’s comments on the comfort of the vehicle. |
Variables | Description |
---|---|
Title | Title of the information. |
Pageview | Number of times the information was viewed. |
Number of comments | Number of comments the information received. |
Text | Text content contained in the information. |
Release date | Date on which this information was published. |
Author | Person who posted the information. |
Source | Source of the information. |
Information type | Whether the information is original, compiled, a press release, or reprinted from another platform. |
Information label | Labels chosen by the author that summarizes the information, based on its content. |
Data type | Format (size) | Download |
---|---|---|
Sales | csv (3.6 MB) | link |
Reviews | csv (480 MB) | link |
News and Information | csv (224 MB) | link |
If you publish work based on, or using, this dataset, we would appreciate citations to the following:
@inproceedings{ding23srnicar,
author = {Ruixin Ding and Bowei Chen and James M. Wilson and Zhi Yan and Yufei Huang},
title = {SRNI-CAR: A Comprehensive Dataset for Analyzing the Chinese Automotive Market},
booktitle = {Proceedings of 2023 IEEE International Conference on Big Data (BigData)},
pages = {3405--3412},
month = {12},
year = {2023}
}
Ruixin Ding (2589583d@student.gla.ac.uk)
Bowei Chen (bowei.chen@glasgow.ac.uk)
This work is licensed under
a Creative Commons Attribution-NonCommercial 4.0 International License.