The key successful factors of video and mobile game crowdfunding projects using a lexicon-based feature selection approach

Chen, Mu-Yen; Chang, Jing-Rong; Chen, Long-Sheng; Shen, En-Li

doi:10.1007/s12652-021-03146-4

The key successful factors of video and mobile game crowdfunding projects using a lexicon-based feature selection approach

Original Research
Published: 23 March 2021

Volume 13, pages 3083–3101, (2022)
Cite this article

Download PDF

Journal of Ambient Intelligence and Humanized Computing Aims and scope Submit manuscript

The key successful factors of video and mobile game crowdfunding projects using a lexicon-based feature selection approach

Download PDF

Mu-Yen Chen¹,
Jing-Rong Chang²,
Long-Sheng Chen ORCID: orcid.org/0000-0002-2967-9956² &
…
En-Li Shen²

2887 Accesses
11 Citations
Explore all metrics

Abstract

The emergence of crowdfunding has given many capital demanders a new fund-raising channel, but the overall project success rate is very low. Many scholars have begun to discover key suscessful factors of crowdfunding projects. Previous studies have used questionnaires survey to identify important project features. In addition to requiring a lot of manpower and time, there may also be sampling bias. Moreover, related studies also reported that the project description will affect the success of the crowdfunding project, but there is no research to tell fundraisers which success factors should be included in the content of the project description. Besides, in recent years, game crowdfunding projects have been attracted lots of attention in terms of total fundraising amount and number of projects. Moreover, in traditional feature selection and text mining approaches, the selected terms are un-organized and hard to be explained. Therefore, this study will focus on real video and mobile game project descriptions to replace conventional questionnaires. To solve these issues, we present a lexicon-based feature selection method. We attempt to define “content features” and build lexicons to determine the attributes’ values. Three feature selection methods including decision tree (DT), Least Absolute Shrinkage and Selection Operator (LASSO), and support vector machine–recursive feature elimination (SVM–RFE) will be employed to find organized candidate key successful factors. Then, support vector machines (SVM) will be used to evaluate the performances of candidate feature subsets. Finally, this study has identified the key successful factors for video and mobile games, respectively. Based on the experimental results, we can give fundraisers some useful suggestions to improve the success rate of crowdfunding projects.

User-Generated Short Video Content in Social Media. A Case Study of TikTok

Imbalanced data preprocessing techniques for machine learning: a systematic mapping study

Article 09 November 2022

Vitor Werner de Vargas, Jorge Arthur Schneider Aranda, … Jorge Luis Victória Barbosa

Conceptualising and measuring social media engagement: A systematic literature review

Article Open access 11 August 2021

Mariapina Trunfio & Simona Rossi

1 Introduction

With the rapid development of the Internet, crowdfunding has emerged accordingly (Mollick 2014). Zhang and Chen (2019) defined crowdfunding as a fundraising method in which individuals, groups or companies raise funds from the public through the Internet (Cox et al. 2018; Kim et al. 2017). It’s reported that the total amount of funds raised has exceeded US$5.2 billion, and the total number of participants has exceeded 18.47 million in Kickstarter (Kickstarter.com August 2020). Crowdfunding brings a totally new channel of fundraising for straterups which often encounter insufficient resources in the early stage (Cosh et al. 2009; Kim et al. 2017). Different from traditional fundraising channels, fundraisers even could build relationships with investors on the platform, and eventually become partners or customers. Moreover, entrepreneurs can also use crowdfunding to attract customers to participate in the development of new products to improve products and better meet consumer needs (Petruzzelli et al. 2019).

But it’s reported that the success rate of crowdfunding is very low. According to statistics of Kickstarter on August 2020, the overall success rate is only about 37.97%. It can be seen that initiating a crowdfunding project is not easy to succeed. Consquently, how to improve the success rate of crowdfunding projects is one of major concerns of all fundraisers. To achieve this goal, lots of researchers have paid attention on this issue. For instances, Liang et al. (2020) found that the number of pictures, the number of videos, the number of comments, and the number of updates have a positive impact on the success of crowdfunding, but the readability is negative. Fernandez-Blanco et al. (2020) found that comments and updates are helpful to the development of crowdfunding projects. Wang et al. (2017) believe that the text content of the crowdfunding project description will affect the decision-making of supporters. Lagazio and Querci (2018) detailed text content is more effective than introduction videos. Zhou et al. (2018) found that the length, readability, semantics of the project description and the fundraiser’s past fundraising experience will affect the success of fundraising. Kim et al. (2017) stated that detailed project descriptions have a positive impact on the success of fundraising projects. These studies reported that the project description will affect the success of the crowdfunding project, but there is no research to tell fundraisers which success factors should be included in the content of the project description.

Moreover, related works usually use qualitative research methods with data surveyed by questionnaires. In addition to requiring a lot of manpower and time, there may also be sampling bias. In recent years, text minig and feature selection have been succesfully applied to deal with the huge text reviews in social media (Wang et al. 2017). Feature selection mainly processes numerical data, although the natural language processing technology (NLP) in text mining technology can process text data. But, if we only use both feature selection and text mining approaches, we cannot obtain organized factors. The selected terms will be quite difficult to be explained, and the number of extracted features will be very huge (Chen et al. 2015).

Besides, in terms of total fundraising amount and number of projects, game projects have been ranked as top one in crowdfunding platdforms. Unfortunately, only few existing literatures focus on the research of game crowdfunding. To solve this problem, we present a lexicon-based feature selection method which aims to discover the crucial content features of project descriptions toward “video game” and “mobile game”.

To sum up, this study will collect real video games and mobile games projects as our research data. In our proposed method, we attempt to define “content features” that may affect the success of game crowfunding projects based on literature review, and establishing lexicons to determine the value of features. The natural language processing (NLP) technology in text mining can be used to process collect text data. Then, three feature selection methods including decision trees (DT), Least Absolute Shrinkage and Selection Operator (LASSO), support vector machine–recursive feature elimination (SVM–RFE) will be employed to select candidate key successful factors, and finally support vector machines (SVM) will be performed to evaluate the performance of candidate feature subsets. The discovered key success factors can provide fundraisers with a basis when establishing crowdfunding projects to help them increase the success rate of video game and mobile game projects.

2 Literature review

2.1 Crowdfunding

Crowdfunding is an innovative fundraising method for many entrepreneurs (Yang et al. 2020; Borrero-Dominguez et al. 2020). Traditional fundraising methods are very difficult for most of startups because they cannot propose equivalent values (Ramadani 2009). So, investors cannot assess risks (Schwienbacher and Larralde 2010). Unlike traditional fundraising channels, crowdfunding has lower costs and is very suitable for startups, non-profit organizations and individuals (Vanacker and Manigart 2010; Liang et al. 2020). Now, crowdfunding has become one of the most important fundraising methods (Cumming et al. 2020).

In crowdfunding, fundraisers initiate projects on platforms, express their products, services or ideas through projects, and use the power of the Internet to obtain funds (Ziegler et al. 2018; Hollas 2013; Colgren 2014; Mollick 2014; Mollick and Robb 2016). Gerber et al. (2012) believe that the use of crowdfunding can also build relationships and form a social network. While sponsors invest to obtain income or return, some sponsors wanted to gain recognition from others (Bretschneider et al. 2014). Results of crowdfunding could be considered as an indicator of a new product or service before entering the market (Mollick 2014; Meyskens and Bird 2015).

Most of researches aim to discover the success or failure factors of crowdfunding projects (Short et al. 2017), but there are also some studies that explore the contribution behavior and motivation of supporters (Xu et al. 2016; Cox et al. 2018; Shneor and Munim 2019). Other scholars have studied game crowdfunding projects. For examples, Lax (2017) interviewed crowdfunding creators in three different game industries. He found six success factors, including project rewards, project goals, the quality of products and projects, project team, fundraiser information and project preparation. Smith (2015) discussed the fundraising process of game fundraising projects and observed common occurrences between developers and players to study how interaction affects product development and production. Some published works (Wang et al. 2017; Kim et al. 2017; Lagazio and Querci 2018; Zhou et al. 2018) have reported that the project description is important for the success of the crowdfunding project, but no available studies can tell fundraisers which success factors should be included in the content of the project description. Therefore, this study will use the real game crowdfunding project for experiments and analysis, and try to discover key content features of game crowdfunding projects.

2.2 Factors of games

From the available literature, relatively few studies focus on game crowdfunding. In order to find the key successful factors of game crowdfunding projects, this study attempts to find the potential factors related to games from literature. For examples, Caci et al. (2018) analyzed players who play “Pokémon GO”, discussed the interaction between players’ play motivation, player’s personality, and play habits, and conducted research on individual differences in play motivation and personality characteristics. The work of Chan et al. (2017) shows the information, interactivity, and entertainments of mobile games have a positive impact on the trust and loyalty of players. Hsiao and Chen (2016) identified the factors that affect the perceived value and loyalty of games, including fun, flexibility of play time, interactivity, and rich rewards. Chen et al. (2010) defined 19 online game quality elements for massively multiplayer online role-playing games. Zhao and Fang (2009) showed that online game technical factors (game stories, game images, game duration, game control, and the quality of game service) has a significant impact on the game fun of players; game fun and social norms have a positive impact on game intentions; social norms, the quality of online game communities and game intentions are important predictive indicators for online game loyalty. Thereofre, we used results of literature review to build content features.

2.3 Text mining

Text mining can be regarded as a process of editing, organizing, and analyzing a large number of documents. Its main task is to convert text into data for analysis through language analysis and natural language processing (Dreisbach et al. 2019). In this era of information explosion, these large amounts of unstructured or semi-structured text data need to be processed with text mining technology to discover hidden knowledge (Thomaz et al. 2017). Miner et al. (2012) defined the main procedures of text mining as data retrieval and processing, word segmentation, feature selection, classification and clustering, text representation and interpretation.

Nowadays, lots of studies use text mining to find the potential characteristics in the introduction of crowdfunding projects. For examples, Wang et al. (2017) analyze the text content of crowdfunding project descriptions and the emotions of the project founders when making the project descriptions. Du et al. (2015) studied the quality and source credibility of the description of crowdfunding projects, and analyzed the impact on the success of crowdfunding projects. Text mining is also used in various areas. Loureiro et al. (2020) used text mining technology to analyze the full text of VR and AR-related journals and conference papers. Zhong et al. (2020) combined deep learning and text mining to automatically analyze the hazard records of building construction.

Moreover, because traditional questionnaire surveys are prone to experimental effects and the information brought by online text content will be more objective, massive, and less sample biased than using questionnaires (Schuckert et al. 2015). As a result, this study will use text mining methods to process the unstructured text content of crowdfunding projects. But, if we only use text mining approaches which acquire knowledge form term-document matrix (TDM), we cannot obtain organized factors, and the number of extracted features will be very huge (Chen et al. 2015). Therefore, this study proposed a lexicon-based feature selection method which uses NLP and lexicons to construct content features from project descriptions.

2.4 Feature selection

Feature selection aims to find important features from original feature set (Zhao et al. 2020). By using selected crucial features, it can improve the prediction accuracy of the classifier and reduce the training time (Devi and Sabrigiriraj 2018; Dash and Liu 1997). Therefore, this study will use three feature selection methods to find the important factors that will affect the success of crowdfunding from the project features and content features of crowdfunding.

2.4.1 Decision trees

DT is an easy-to-understand classification method. It is not only easy to use but also can quickly find rule conditions for high-dimensional data (Yang 2019; Ma et al. 2016; Sok et al. 2016). In this study, the leaf nodes of rule tree will be considered as important factors. Being one of famous feature selection methods, DT has been successfully applied in various fields. For examples, Kwon et al. (2020) used decision trees to determine important factors which would affect the shopping behavior of second-hand stores. Namazkhan et al. (2020) used decision trees to analyze key factors affecting household natural gas consumption. Chen et al. (2021) employed decision trees to select the crucial factors of increasing customers’ satisfaction from online customer reviews. Consequently, this study will use DT as one of the feature selection methods for crowdfunding projects.

2.4.2 LASSO

LASSO is a regression analysis method that can perform feature selection and regularization at the same time. The ultimate goal of reducing the variables to zero is to obtain the feature subset that minimizes the prediction error of the variable (Tibshirani 1996). LASSO obtains a more refined model by establishing a valve function, and reduces the sum of coefficients by the square of the least square method, and compresses the sum of absolute values of coefficients to less than the constant 1 (Wang et al. 2018).

LASSO has been suceesfully applied in various fields. For examples, McEligot et al. (2020) used LASSO to determine the most relevant variables that cause breast cancer. Schmidt et al. (2020) utilized LASSO to detect and identify global position system (GPS) fraud. Guenther and Sawodny (2019) employed LASSO to determine the important factors affecting the temperature comfort of open offices in Singapore. Therefore, this study will use SVM–RFE as one of the feature selection methods.

2.4.3 SVM–RFE

SVM–RFE is another famous feature selection technique. It can find relatively important features from a large number of features. Through the weight vector obtained during training, all the features are arranged in descending order, and the feature with the smallest coefficient is deleted in each down generation, and then retrained and sorted. Repeat the above steps to the end. We can get all the features in descending order (Frank et al. 2006; Witten et al. 2011). SVM–RFE has been successfully applied in many areas. For examples, Wang et al. (2019) used SVM–RFE to identify autism. Shao et al. (2017) applied SVM–RFE to predict the price of electricity in electricity market analysis. In the work of Liu et al. (2017), they used SVM–RFE to locate protein sub-cells in biomedicine. Chang et al. (2020) applied SVM–RFE as one of the feature selection methods to determine the important factors of influencing O2O trust (Chang et al. 2020). As a result, this study will use SVM–RFE as one of the feature selection methods.

2.5 SVM

SVM (Cortes and Vapnik 1995) is a supervised machine learning model used for classification and regression analysis. Because SVM can be applied to various linear and non-linear classification problems, and has a complete theoretical framework and tools, it has been widely used (Paul et al. 2016). For examples, Gamal et al. (2019) utilized SVM with other a variety of classifiers to perform sentiment analysis and classification of Arab tweets and comments on twitter. Vijayakumar and Muhammad (2019) employed SVM, NB and maximum entropy with natural language processing methods to indentify sapm comments on the online forum. Krammer et al. (2019) utilized SVM, RBF, and MLP to analyze online comments for abnormal behavior. To sum up, SVM has an excellent performance on text classification, so this study will use SVM to evaluate the performance of selected candidate feature subset.

3 Proposed methodology

This section will introduce the implementation procedure of the proposed lexicon-based feature selection method. The procedure shown in Fig. 1 can be divided into nine steps. First, we collect data from a real crowdfunding website. Next, the “project features” and “content features” have been induced from the relevant literature. In the content features, we focus on the aspect of the game’s success. Then, we set up the lexicons for content features, and compute attributes’ value based on built lexicons. Later, feature selection methods will be utilized to find the candidate feature subsets, and finally the SVM classifier has been performed to evaluate the performance of the selected feature subsets. After that, we can identify important features, and make specific suggestions accordingly. The following describes the steps of the method used in this study.

Step 1: Collect data

We use Kickstarter (https://www.kickstarter.com), one of the most popular crowdfunding platforms in the world, as the data sources. The scope of the collection is the “video game” project and “mobile game” under the “game” category in the platform. Therefore, this research will be divided into experiment #1 (video game projects) and experiment #2 (mobile game projects).

Python language will be employed to write a web crawler tool. The crawled data is divided into two parts. The first part includes content features which will capture the content of project descriptions. The second part involves project features, including the target amount, video clip, number of updates, number of comments, project duration, fundraising experience, number of pictures, description length, and number of investors.

Step 2: Define factors

This study divides the features into two parts, project features and content features. The project features will be induced from related literature of crowdfunding projects. The content features are from the literature related to the game theme to find out the aspects and words related to the success of the game. For content features, we will gather feature related keywords to build lexicons. In one lexicon, the keywords’ similar words, synonyms, and antonyms also will be included in the lexicon. Based on constructed lexicons, we can calculate the values of features. We’ll clarify how to define project fetaures and content feature, respectively, as follows.

Step 2.1 Define project features

According to relevant literature on crowdfunding, we find out project features that may affect the success of crowdfunding projects, and give clear definitions, since these features have not been used for game crowdfunding projects. The definitions of project features have been shown in Table 1.

Table 1 Project features

The key successful factors of video and mobile game crowdfunding projects using a lexicon-based feature selection approach

Abstract

Similar content being viewed by others

User-Generated Short Video Content in Social Media. A Case Study of TikTok

Imbalanced data preprocessing techniques for machine learning: a systematic mapping study

Conceptualising and measuring social media engagement: A systematic literature review

1 Introduction

2 Literature review

2.1 Crowdfunding

2.2 Factors of games

2.3 Text mining

2.4 Feature selection

2.4.1 Decision trees

2.4.2 LASSO

2.4.3 SVM–RFE

2.5 SVM

3 Proposed methodology

4 Results

4.1 Data collection and prepreparation

4.2 Data preprocessing

4.3 Feature selection

4.4 Evaluated by SVM

4.4.1 Results of experiment #1 (video game)

4.4.2 Results of experiment #2 (mobile game)

4.4.3 Concluding remarks

5 Discussions and conclusions

5.1 Discussions

5.2 Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation