Introduction

The ongoing European project JuxtaLearn aims at fostering learning and curiosity in different fields of science (or STEM) by combining or “juxtaposing” the understanding of domain concepts with performing. Concretely, the students’ performance is substantiated in the form of creative video making and editing activities. We see this way of learning by performing and presenting as a variant of Papert’s “constructionism” (Papert and Harel 1991) and as similar to learning by teaching (Gartner et al. 1971). In this context, we are interested in studying the role of video as a medium for learning in different (including passive) forms of usage.

The design of learning activities in JuxtaLearn is guided by previously identified threshold concepts (Meyer and Land 2003). Threshold concepts are the basis for reinforcing deeper understanding and further creative production through scaffold reflections focused on essential elements. To identify such concepts and to explore how they are understood and appropriated by teachers and students, a series of face-to-face workshops have been conducted. After an initial workshop with science teachers, a second workshop also involved a group of six A-level students. As misconceptions surrounding threshold concepts are difficult to pin down, the workshop was structured through a role reversal in which the students taught the teachers, to elicit a deeper understanding of the gaps in the students’ knowledge. In this context, learning analytics techniques have been used to extract structured representations of the underlying conceptual relations.

From another angle, we have also tried to identify processes of understanding around videos by analyzing existing web-based learning communities, namely the Khan Academy. Since the videos are also available on YouTube EDU as a less “educationally guided” environment, we were able to compare comments from both contexts. For our analysis, we only look at comments, e.g., questions and answers from discussions on the video page and not the videos themselves. We are particularly interested in extracting information from these texts to shed light on the following aspects:

  • Associations of concepts as indicators of the authors’ mental constructs (which may be adequate or inadequate from a scientific point of view);

  • Concepts that are frequently addressed in questions as indicators for possible origins of problems of understanding;

  • Associations between concepts often used in answers as indicators for missing relations in the original understanding.

We use these indicators to infer possible misconceptions or “stumbling blocks.” As known from classical learner modeling (cf. Wenger 1987), we have to distinguish between missing knowledge about concepts and/or relations and misconceptions as often idiosyncratic constructions of incorrect knowledge. An example of a misconception (beyond missing knowledge) would be an incorrect association between two or more concepts.

Related work and background

Conceptual change

When learning new and complex concepts in school, students interpret the knowledge taught in relation to their prior knowledge (Krüger 2007). In everyday life, they might simply add newly gained knowledge to their existing knowledge base, if some new information is combinable with the existing knowledge. This process is called enrichment (Chi 2008; Inagaki and Hatano 2008; Carey 1985). But the typical school situation is different, here, the students have to face cognitive conflicts, because the corresponding knowledge structures often differ from each other (Vosniadou 2007). Vosniadou and Brewer (1992) offer an illustrative example for such a cognitive conflict and the possible resulting changes of conceptual models: if children learn that the earth is a sphere, this information may be in contradiction with their everyday knowledge, since they do not fall down from earth. This conflict ends up in two ways: either the formation of misconceptions, erroneous conclusions (Chi 2008), or a revision of prior knowledge, which describes the transformation from wrong into correct knowledge (e.g., Chi 2008; Vosniadou 2007; Hatano and Inagaki 2002; Carey 1985). Vosniadou and Brewer (1992) report that some children, who do not combine but instead just add new information, created a new mental model with two worlds as a result—one flat earth, where people can not fall down from and another one rotating as a solar object in the sky. Consequently, these misconceptions are based on false beliefs, accumulated false beliefs, or ontological miscategorisation (Chi 2008). Concurrently the conflict provides the possibility for a revision, which means that students restructure their existing knowledge, in this case with regard to the law of gravity, which leads to the correct knowledge. This revision can be supported by the influence of social processes (Hatano and Inagaki 1997) as they are envisioned in the JuxtaLearn learning process, and by the meta-conceptual awareness of own beliefs (Vosniadou 2003). The question remains how to make the students’ conceptual models visible for them or even their teachers? Since domain knowledge in the field of STEM learning has to be constructed in the form of complex relations (e.g., structural, temporal, or causal) between a variety of concepts, we conclude that text networks based on online comments are suitable to reveal the student’s knowledge and to represent their problems of understanding.

A network perspective on conceptual models and conceptual change

In various scenarios of learning, knowledge building and knowledge production, humans externalize their knowledge in terms of “knowledge artifacts,” which are often represented in the form of texts, and thus susceptible to being analyzed by text mining (Heyer et al. 2006). Content analysis, as a form of artifact analysis, can be used to reduce qualitative textual data into clusters of conceptual categories aiming to unfold patterns and relationships of meaning (Julien 2008). Although several (semi-)automated methods can be used to detect these patterns from content, e.g., statistical methods based on the Vector Space Model (VSM) or Latent Dirichlet Allocation (Blei 2012) as a probabilistic method, to date these methods have barely been used on learning data (He 2013). Content analysis in the context of learning data has primarily been used for the clustering of resources, e.g., the grouping of e-learning resources according to their similarity (e.g., Hung 2012; Tane et al. 2004). Sherin (2012) found that even without using semantic background knowledge, a VSM-based clustering of spoken word transcripts is an adequate instrument to identify student’s concepts and the dynamics of their mental constructs. Additionally, He (2013) provided evidence that similar techniques are suitable for grouping learners’ main topics in student-to-teacher online questions and peer-to-peer chat messages related to online video learning lessons.

The aforementioned methods are based on the “bag of word” model, in which the given order of words in a text is of no relevance to the analysis (Blei 2012). A method that takes into account the words’ positioning is “Network Text Analysis” (NTA). NTA is a text mining method, which is based upon the assumption that knowledge can be modeled as a network of concepts (Carley et al. 2013a). Against this background, a concept is a single idea, which is represented by one or more words in a network (nodes). The links representing semantic relationships between these words (edges) are differing in strength, directionality, and type based on the words’ position to each other in the text (Carley et al. 2013a). The union of all relations builds the semantic network (Carley et al. 2013a), similar to the relational network of a concept map. Similar to text networks, concept maps are networks, in which knowledge is represented by concepts and their relationships to each other. They differ from text-based semantic networks inasmuch as they are arranged hierarchically with the ontological root concepts at the top (Novak and Cañas 2008). In the context of knowledge construction research, concept maps are often used to trace the student’s knowledge development (Engelmann and Hesse 2010; Engelmann et al. 2009; Schreiber and Engelmann 2010).

Jacobsen and Kapur (2010) have suggested to conceive learners’ mental models or “ontologies” as scale-free networks, which would allow applying known characteristics of such networks to theories of conceptual change. According to Barabási (2009), the evolution of scale-free networks can be explained by the mechanisms of “preferential attachment.” Applied to learners’ ontologies, preferential attachment means that newly learned concepts are most frequently associated or linked to those concepts that are already more densely connected than others. From this, Jacobsen and Kapur (2010) conclude that such “hubs,” i.e., nodes with a relatively high degree centrality, represent root categories of knowledge domains. Hoppe et al. (2012) support this theory in a study in which the volunteers had to create concept maps on the subject of global warming. This study clearly showed a scale-free nature of the maps in terms of an inverse power law degree distribution (a known structural characteristic of scale-free networks). This implies that there are more hubs than to be expected in a randomly connected network. Also, Hoppe et al. (2012) could show that certain graph-theoretical structural measures known, correlate with quality judgements of the maps by independent experts. Interestingly, the “density” measure shows a highly significant negative medium correlation with criterion of map “completeness.” Again the scale-free model provides a clear explanation: In a growing scale-free network, the density is anti-proportional to the size of the network, i.e., the smallest networks will show the highest density. Based on this characterization, the authors hypothesize that newly appearing hubs represent ‘hot spots of conceptual change’ (Hoppe et al. 2012, p. 297), whereby this change describes a restructuring process, in which learners revise their false beliefs and misconceptions on the relational or ontological level (Chi 2008). If the number of edges around a node is reduced suddenly, this may indicate a qualitative change of understanding or paradigm shift (Hoppe et al. 2012). Viewing concept maps as networks allows for applying a variety of techniques known from Social Network Analysis (cf. Wasserman and Faust 1994). As an example, the betweenness centrality measure is suggested as a possible indicator to identify ‘bridge concepts’ that link different knowledge domains (Hoppe et al. 2012). It is important to note that the results of NTA (see above) are also networks that can be further analyzed in the same way as concept maps. This would also allow the comparison of textual input (e.g., from student wikis) with concept maps.

Our basic idea and approach is to use content analysis techniques to generate network representations from knowledge artifacts originally created by students or experts and to apply structural and differential (comparative) measures to these representations in order to detect similarities or mismatches. In this approach, expert maps or ontologies can be used as “normative” references for comparison, e.g., to indicate deviations from standard domain knowledge and possibly possible misconceptions. Regarding the evolution of maps, also certain structural features and anomalies, can also be detected.

Network text analysis

Method

We have used the method of “Network Text Analysis” (NTA) as a basis of our analyses of textual artifacts. As defined by Carley et al. (2013a, b), the NTA workflow consists of three main steps: (1) data selection and extraction, (2) text pre-processing and (3) network analysis, which we have applied by the AutoMap/ORA toolset for NTA (Carley et al. 2013a, b). As a first step, the data of interest will be selected, depending on the terms of reference context, for instance, the selection of comments belonging to a certain person or video. In the second step, the pre-processing functions are intended to prepare the textual data for subsequent analyses. Unneeded and unwanted concepts will be reduced removed through simple text cleaning functions such as the removal of extra spaces. Furthermore, this step serves to apply (a) a stemming for reducing words to their root stem by removing suffixes from words, (b) a delete list, which is required for the removal of non-relevant stop words (articles, auxiliary verbs, etc.), and (c) a manually generated thesaurus, used for replacing synonym concepts with the more standard form, for combining n-grams and to correct spelling errors. The next step of pre-processing is the identification and classification of concepts. Relevant concepts will be detected by analyzing the words’ frequencies based on the following principle: words and n-grams that appear more than x-times are considered as relevant and will be included into further analyses, whereby x depends on the size of the corpus. The classification is done by the determination of categories based on the words appearing in a concept list that includes the frequency of all words and can be reduced to the most important key words using a threshold defined by the researcher. After specifying the categories, every single concept will be assigned to one of them. Therefore, an ontology-based meta-thesaurus will be created, which is later used for generating the network. As a result of the processes described before, multimodal networks will be created, whereby the modality of the network depends on the number of categories that can be identified by the researcher.

The analysis process has been applied to the transcripts of an initial role reversal workshop surrounding the STEM topics “moles” in chemistry, “potential difference” in physics and “cells” in biology. In each lesson, two students with different school marks (both excellent/mixed/both middle) taught two teachers. The transcripts were analyzed with the method described above.

Results

As a result of this analysis process, a multimodal network of categorized concepts was generated, graphically represented using ORA. Categories defined are: pedagogical concepts, domain concepts, general concepts (i.e., concepts that are neither domain specific nor pedagogic concepts), tools, and actors, whereby we have declared actors and domain concepts as the most relevant categories. These categories are represented in a meta-thesaurus derived from ontology. Within the thesaurus, the actor category represents all acting persons in the lessons; teachers have been labeled as T1–6, the researcher staff as R1–4 and students as S1–6. The domain concept category represents discipline-specific topics associated with the lesson subjects.

The number of connections between one actor and surrounding topics (also called “degree”) indicates the thematic richness of an actor’s contributions. Regarding the whole semantic network, the degrees reflect that both physics students (S5 and S6), as well as one of the biology students (S1), who score high in their school marks,also have a higher total degree in the network than the other students with middle school marks. Figure 1 depicts an excerpt of the resulting network, a two-mode network of actors and domain concepts, which contains 16 actor nodes (black circular nodes) and 71 domain concept nodes (rectangular). This excerpt only shows six actors (students) and the domain concepts with a connection to at least one of these actors. In Fig. 1, every subject area and corresponding sub-activity in the workshop (chemistry, biology, and physics) form a cohesive cluster in the overall network as illustrated by gray circles in Fig. 1.

Fig. 1
figure 1

Partial view of a network generated from a teacher-student workshop, depicting connections of students (S1 to S6) to domain concepts in blue (two-mode network) and relations between domain concepts in grey. (Color figure online)

Other relevant structural properties of the extracted network are nodes that represent hubs (nodes with a high total degree centrality) or nodes that bridge over between other concepts or between areas of discourse; which in this case are the domains (high in betweenness centrality). Figure 2 shows the top 6 knowledge items ranked according to their total degree centrality and betweenness centrality in the domain concept × domain concept network.

Fig. 2
figure 2

Domain concepts with highest betweenness centralities (on the left) and highest total degree (on the right) in a domain concept × domain concept network

Concerning total degree centrality, we could find the three main topics of the workshops “mole,” “potential difference,” and “cell” within the top 6 ranking. Furthermore, the nodes with a high betweenness centrality seem to bridge over different ontological areas. Mole, for instance, builds a fundamental connection between the physics-related cluster and the biology’s cluster. Furthermore, cell, second highest in betweenness centrality, also connects the three discipline clusters with each other.

Based on this first example, we claim that text networks can not only represent and characterize the specific foci of the workshops, but also provide ontological information, which is related to the student’s conceptualization of specific science topics.

Analysis of video comments

Content analysis

While the NTA workflow described above was conducted on material generated in the context of the JuxtaLearn project, for further content analysis, we focused on material found outside our project, therefore, externalizing our analysis. The approach to content analysis presented here focuses on the attention given to a specific topic in an online discussion around learning materials which in this case are represented by online discussions, composed of comments, and around educational videos. This approach aims to extend the NTA workflow by focusing on the content of single comments instead of the complete discussion.

We used a quantitative approach implementing regular expressions-based matching with the concept lists constructed during the NTA workflow and the preprocessed text base which was generated during this workflow. In principal, this generates a sparsely populated feature vector for each comment based on the occurrence frequency of concepts from each list. We use these vectors to construct a measure we call “semantic richness” which tries to quantify the relation between the domain specific concepts and the general concepts mentioned in each comment. This measure follows the assumption that a comment that uses more domain concepts than general concepts compared to a different comment which uses a lesser ratio of domain and general concepts is more “on topic” thus semantically richer. During our analysis of external data, we derived different methods of evaluating this ratio and the following will present our results.

Khan Academy and YouTube videos

As a starting point for our content analysis, we have chosen Khan Academy’s learning videos and the accompanying discussion in the comments section. The Khan Academy website provides a significant amount of videos on different STEM topics and offers the option to enter into a learning dialog with other students, the same scenario which is envisioned for the JuxtaLearn platform. The website offers message boxes below the videos to enter this dialog through a scaffold question/answer construct rather than an open comment section. The videos themselves are hosted on YouTube EDU, which also enables users to comment on the same videos without assistive scaffolding. The library covers science topics such as biology, chemistry, and physics on different levels ranging from junior high school to university and holds more than 4.300 videos with an average length of 10 min (Khan Academy 2013). For this case study, we used both three videos with the titles “The Mole and Avogadro’s Number,” “Diffusion and Osmosis,” and “Voltage - Difference between electrical potential (voltage) and electrical potential energy.” In total, we have extracted 1.284 comments Khan Academy’s web service. These textual artifacts have been used as a sample for our analysis. Khan Academy provides the aforementioned scaffold discussion elements and encourages discussion on the topic of the video, which means we can assume that artifacts extracted from the discussion will be about the topic of the video as well. Table 1 gives an overview of the amount of artifacts used in our study. It shows the number of questions and answers for each area the video topic can be assigned to, as well as their sum (number of comments) for easy comparison with other sources.

Table 1 Type and number of extracted comments on STEM videos at Khan Academy’s website

YouTube EDU is a subsection of YouTube that focuses on educational videos. The idea behind it is to provide everything related to education, spanning from short lessons for supplementing school learning to full courses from universities, and other professional material from educators around the globe. As mentioned above, this includes the Khan Academy videos. For the purpose of comparison, we extracted video comments (using the appropriate Google API) for the same videos. In total we got 1.201 comments from YouTube EDU distributed among the three videos as shown in Table 2. YouTube EDU does not provide any scaffolding for discussion resulting in a mixture of comments and smaller discussion compared to Khan Academy’s strict question/answer construct.

Table 2 Subject and number of extracted comments on STEM videos from YouTube EDU

Data sampling and first observations

The aforementioned strict construct of questions/answers seems to be enforced by a strong moderation by the staff of Khan Academy, because the discussion artifacts have a very low amount of noise, e.g., spam comments. This is reflected by the high amount of very short and poorly written comments extracted from YouTube EDU. Additionally, the artifacts extracted from Khan Academy seem to focus on the video’s topic rather than on the video itself, while the opposite is true for comments from YouTube EDU, which mostly contain short comments that focus on e.g., the quality of the video. This observation can be confirmed by analyzing the content based on the list of domain, and other concepts, we constructed during the NTA workflow. Employing a quantitative approach we counted the occurrences of general and domain concepts in artifacts from both platforms, representing these values as feature vectors for general and domain concepts. Table 3 shows the results from this approach.

Table 3 Results for our semantic richness calculations

The two combinatory measures were introduced to create a singular value describing the semantic richness. We introduced two different formulas for both versions of feature vectors (those using all occurrences, therefore, with duplicates/those using only the first occurrence), the first normalizes the value to a range of 0–1, but favors shorter comments as a comment containing only a domain concept will have a semantic richness of 1. The second favors longer comments, but the value is uncapped, meaning it can grow infinitely when only domain concepts are found (the +1 is needed to avoid division by 0).

The two combinatory measures were introduced to create a singular value describing the semantic richness. We introduced two different formulas for both versions of feature vectors (those using all occurrences, therefore, with duplicates/those using only the first occurrence), the first normalizes the value to a range of 0–1, but favors shorter comments as a comment containing only a domain concept will have a semantic richness of 1. The second favors longer comments, but the value is uncapped, meaning it can grow infinitely when only domain concepts are found (the +1 is needed to avoid division by 0).

The numbers clearly indicate both longer and semantically richer artifacts on Khan Academy, which might be a result of the platforms focus on provision of educational content and discussion or simply the suspected strong moderation.

Introduction of “Signal Concepts”

Idea

The idea behind the introduction of signal concepts to supplement our use of general and domain concepts, is the observation that certain general concepts (e.g., related to explanation request) may indicate specific problems of understanding. These signal concepts occur in combination with one or more domain concepts and “signal” a specific relationship either between the author and a domain concept or between two domain concepts. This distinction is reflected by two types of signal concepts: unary and binary. Unary signal concepts refer only to one domain concept and typically express a specific information need or problem of understanding in the part of the author. Binary signal concepts reference two domain concepts in combination or inter-relation. Examples of unary signal concept are “help_needed” or “explain,” which may indicate that the author has a problem in understanding the connected domain concept. A typical example for the binary type is “difference_between,” which may indicate that the author thinks or inquires about a difference between two domain concepts. Signal concepts provide a resource for teachers interested in learning about potential problems that their students have. Thus, our approach indicates a topic or a domain concept that may be worth focusing on in a future lesson.

Figure 3 gives a partial view of an exemplary network of type (signal concept) × (domain concept) focusing on “difference_between.” This network highlights an inherent problem with analyzing signal concepts through our standard NTA workflow: many instances of “difference pairs” will appear around the signal concept difference_between, but as a consequence of the underlying bi-partite graph representation we cannot formally distinguish these pairs without using semantic background knowledge (e.g., to separate out “osmosis” and “diffusion”). That is, the NTA workflow aggregates all occurrences of a signal concept into one single node in a kind of combinatorial multiplication that does no longer show the original instance pairs.

Fig. 3
figure 3

Network showing an excerpt centered around “difference_between”

Approach

To avoid the “combinatorial multiplication” effect, we have replaced the NTA analysis at this stage by a tailored method that maintains the original context in which the pairs of domain concepts associated to a binary signal concept appeared. To visualize these pairs, we introduce “combination nodes” between the signal concept and corresponding pairs of domain concepts. The underlying analysis process is similar to NTA in that it also uses a text window running over the text source (i.e., a preprocessed comment) to detect the co-occurrences in this context. We do not use the whole comment as a context, since some comments are quite long and take up several different issues. For the results presented below, we used a window size of seven words from the preprocessed text, meaning the signal concept and the domain concept(s) must share a context with at most five words between them.

In addition, we have refined and extended this approach by again using the lists of domain concepts and general concepts constructed for the NTA analysis together with a matching algorithm that uses regular expressions to find and highlight patterns of domain and signal concepts in the comments. This algorithm transforms the preprocessed text into a compact string representation by replacing each word with a single letter. We used six letters (G, D, S, T, M, and O) which represent general concepts (G), domain concepts (D), unary signal concepts (S), binary signal concepts (T), binary signal concepts that need to be in the middle of two domain concepts (M), and other (O). The patterns, we used a selection of three patterns for the matching on these string representations, are as shown in Table 4, and they can be broken down into two different schemas, for unary and binary signal concepts. The rules reflect different orders of occurrence in the text for these two types of signal concepts.

Table 4 Patterns used for digging up signal concepts from artifacts

These patterns were then used to highlight the signal and domain concept(s) in their original context as an additional way of visualizing possible problems in a human readable form.

Results

For the first approach, we generated a new network around the same signal concept “difference_ between” to illustrate the usefulness of our approach and specifically the introduction of the new combination nodes. Figure 4 only shows the “cloud” of concept pairs around the concept difference_between and uses combination nodes to separate the corresponding pairs.

Fig. 4
figure 4

Network including blue combination nodes hovering around “difference_between”. (Color figure online)

The red node represents the signal concept that this network focuses on while the purple nodes are the new combination. The size of these combination nodes represents their occurrence in all of the analyzed texts. This means nodes will be larger the more often the combination of the signal concept and both domain concepts occurred in the specified text window. Blue nodes show all the domain concepts that were mentioned along with the signal concept “difference between” in the data we analyzed.

The second approach resulted in a list of comments with co-occurring signal and domain concept(s), containing highlighting for the referenced concepts. An excerpt of this list is shown in Table 5, illustrating the usefulness of this approach. It puts our preprocessed text with highlighting next to the original comment to indicate the detection of a signal and domain concept but also to show the original context both as a means of feedback for us by showing the validity of our approach, as well as a possible source of information in a learning analytics context.

Table 5 Highlighting of signal and domain concepts next to their original context

Ontology extraction from Wikipedia

Idea

While the results described above were promising, the NTA workflow relied on automatic extraction, but required a manual classification of domain concepts. For a fully automatic approach, we employed the idea of extracting an ontology based on external sources. We think the approach of automatically extracting an ontology from available resources might be a viable solution. YAGO uses Wikipedia and WordNet (Suchanek et al. 2007), but focuses on describing persons. Therefore, the included taxonomy is based on entities while we require a domain vocabulary instead. While it will be interesting to explore the possibility of extracting a subset of YAGO that suits our needs, instead we decided to implement a very basic approach to evaluate the feasibility of ontology extraction from Wikipedia before evaluating YAGO’s suitability.

We decided on using Wikipedia and its structure of categories for the same STEM disciplines as before, specifically: biology, chemistry, and physics, to extract a domain vocabulary, we could use in our content analysis.

Method

We used a web crawling technique to extract all categories for the three disciplines mentioned and declared the name of the pages related to these categories as words for our new domain concept dictionary. To keep the possibility of expanding this domain dictionary into a domain ontology, we decided to mirror the structure of categories and subcategories in a taxonomy and declared these classes for a domain ontology. The page names we extracted were then treated as instances of these classes, representing the words in our dictionary.

For the results described below, we used the top level category and pages, and then used their subcategories as a seed for a second iteration and did the same twice more. This means we used the category “Biology” and three levels of subcategories and their related pages as the source for our basic ontology.

Results

The extracted ontology contains more than 180.000 page names represented as instances. Using a different set of comment data, again extracted from videos stored on Khan Academy and YouTube, we compared a similar analysis as described before on a total of 5,152 comments. Here, semantic richness is no longer calculated using an interactively (or “manually”) defined domain vocabulary but using the automatically extracted much larger domain ontology instead. As for signal concepts, we do not see how to automate the selection process, so we still rely on the same list used before.

The results for semantic richness are shown in Table 6, while Fig. 5 shows the newly calculated network of signal concepts with the combination nodes are sized according to their occurence value.

Table 6 New results for our semantic richness calculations
Fig. 5
figure 5

New (reduced) network with scaled combination nodes around “difference_between”

Since we do not have positive list or dictionary of all general concepts, we consider all words that are not either stop words or represent domain concepts to be general concepts. This new signal concept network was too complex, so it had to be reduced for the visualization to be human readable. We have removed combination nodes (pairs) with less than five occurrences.

Table 7 shows an excerpt of a ranked list of domain concept combinations around “difference_between” found in the comments of physics and chemistry videos.

Table 7 Occurence values for combination nodes

Conclusion

In our first study, we aimed to find evidence that NTA is a useful instrument to identify and analyze students’ conceptual understanding, while learning a specific science topic. Similar to a student’s self-created concept map, (semi-)automatically generated text networks provide information about the learners’ conceptions of the domain. As theoretically expected, we could corroborate that hubs found in the ensuing concept networks indeed represent ontological root categories, e.g., the concept of “mole” that was the main topic of a role reversal lesson and consequently reaches a high value for total degree. Furthermore, given betweenness centrality values support the theory that nodes high in this measure indicate bridges between different ontological areas. Overall, we could conclude that the analysis of a network’s structural features provides important information on ontology development. This led us to a second study, in which we investigated the impact of differentiating between various categories.

While investigating general concepts of comments in a discussion around publicly available educational videos, we could identify an ontological subcategory of signal concepts among frequent general terms high in degree. These words (such as “explanation” or “difference_between”) indicate a certain type of concepts, which in conjunction with domain concepts can help to identify problems of comprehension. By means of extending the NTA by a tailored analysis, we receive indicators for drawing conclusions as to which (pre-)knowledge the students might miss or which concepts are particularly difficult to distinguish (such as “osmosis” and “diffusion”).

As a next step on our research agenda, we will particularly look at the progression of concept networks over time. Particularly deviations from “normal” progression following the preferential attachment principle (i.e., disappearing hubs or new clusters of high connectivity) will be analyzed to enable a better understanding for the evolution of the students’ conceptual models.