Abstract
In this paper, we consider a dataset compiled from online job adverts for consecutive fixed periods, to identify whether repeated and automated observation of skills requested in the job market can be used to predict the relevance of skillsets and the predominance of skills in the near future. The data, consisting of co-occurring skills observed in job adverts, is used to generate a skills graph whose nodes are skills and whose edges denote the co-occurrence appearance. To better observe and interpret the evolution of this graph over a period of time, we investigate two clustering methods that can reduce the complexity of the graph. The best performing method, evaluated according to its modularity value (0.72 for the best method followed by 0.41), is then used as a basis for the SCODIS framework, which enables the discovery of in-demand skillsets based on the observation of skills clusters in a time series. The framework is used to conduct a time series forecasting experiment, resulting in the F-measures observed at 72%, which confirms that to an extent, and with enough previous observations, it is indeed possible to identify which skillsets will dominate demand for a specific sector in the short-term.
Keywords
Co-funded by the European Union’s Horizon 2020 research and innovation programme under the QualiChain Project, Grant Agreement No. 822404.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
- 7.
References
Smith, D., Ali, A.: Analyzing computer programming job trend using web data mining. Issues Inf. Sci. Inf. Technol. 11, 203–214 (2014)
Sodhi, M.S., Son, B.-G.: Content analysis of OR job advertisements to infer required skills. J. Oper. Res. Soc. 61, 1315–1327 (2010)
Coulter, N., Monarch, I., Konda, S.: Software engineering as seen through its research literature: a study in co-word analysis. J. Am. Soc. Inf. Sci. 49, 1206–1223 (1998)
Blondel, V., Guillaume, J.-L., Lambiotte, R., Lefebvre, E.: Fast unfolding of communities in large networks. J. Stat. Mech: Theory Exp. 2008, P10008 (2008)
Sibarani, E.M., Scerri, S., Morales, C., Auer, S., Collarana, D.: Ontology-guided job market demand analysis: a cross-sectional study for the data science field. In: SEMANTiCS, pp. 25–32 (2017)
Newman, M.E.J., Girvan, M.: Finding and evaluating community structure in networks. Phys. Rev. E 69, 026113 (2004)
Xu, X., Yuruk, N., Feng, Z., Schweiger, T.A.J.: SCAN: a structural clustering algorithm for networks. In: KDD, pp. 824–833 (2007)
Callon, M., Courtial, J.P., Laville, F.: Co-word analysis as a tool for describing the network of interactions between basic and technological research: the case of polymer chemistry. Scientometrics 22(1), 155–205 (1991)
Lee, P., Lakshmanan, L.V.S., Milios, E.E.: Incremental cluster evolution tracking from highly dynamic network data. In: ICDE, pp. 3–14 (2014)
Hopcroft, J., Khan, O., Kulis, B., Selman, B.: Tracking evolving communities in large linked networks. Proc. Nat. Acad. Sci. 101(suppl 1), 5249–5253 (2004)
Dadzie, A.-S., Sibarani, E., Novalija, I., Scerri, S.: Structuring visual exploratory analysis of skill demand. Web Semant. 49, 51–70 (2018)
Fortunato, S., Lancichinetti, A.: Community detection algorithms: a comparative analysis. In: The Fourth International ICST Conference on Performance Evaluation Methodologies and Tools, pp. 27:1–27:2 (2009)
Sibarani, E.M., Scerri, S.: Generating an evolving skills network from job adverts for high-demand skillset discovery. In: Cheng, R., Mamoulis, N., Sun, Y., Huang, X. (eds.) WISE 2020. LNCS, vol. 11881, pp. 441–457. Springer, Cham (2019). https://doi.org/10.1007/978-3-030-34223-4_28
Todd, P.A., McKeen, J.D., Gallupe, R.B.: The evolution of IS job skills: a content analysis of IS job advertisements from 1970 to 1990. MIS Q. 19(1), 1–27 (1995)
Wowczko, I.A.: Skills and vacancy analysis with data mining techniques. Informatics 2, 31–49 (2015)
Aken, A., Litecky, C., Ahmad, A., Nelson, J.: Mining for computing jobs. IEEE Softw. 27(1), 78–85 (2010)
Surakka, S.: Analysis of technical skills in job advertisements targeted at software developers. Inform. Educ. 4(1), 101–122 (2005)
Kennan, M.A., Willard, P., Cecez-Kecmanovic, D., Wilson, C.S.: A content analysis of Australian IS early career job advertisements. Austr. J. Inf. Syst. 15(2), 169–190 (2009)
Asur, S., Parthasarathy, S., Ucar, D.: An event-based framework for characterizing the evolutionary behavior of interaction graphs. In: KDD, pp. 913–921 (2007)
Kim, M.-S., Han, J.: A particle-and-density based evolutionary clustering method for dynamic networks. In: VLDB, pp. 622–633, August 2009
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Sibarani, E.M., Scerri, S. (2020). SCODIS: Job Advert-Derived Time Series for High-Demand Skillset Discovery and Prediction. In: Hartmann, S., Küng, J., Kotsis, G., Tjoa, A.M., Khalil, I. (eds) Database and Expert Systems Applications. DEXA 2020. Lecture Notes in Computer Science(), vol 12392. Springer, Cham. https://doi.org/10.1007/978-3-030-59051-2_25
Download citation
DOI: https://doi.org/10.1007/978-3-030-59051-2_25
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-59050-5
Online ISBN: 978-3-030-59051-2
eBook Packages: Computer ScienceComputer Science (R0)