© 2026 The authors. This article is published by IIETA and is licensed under the CC BY 4.0 license (http://creativecommons.org/licenses/by/4.0/).
OPEN ACCESS
This study presents the design, implementation, and evaluation of an ontology-based digital learning platform for teaching Kazakh language in primary school. The platform is developed using ontology modeling tools, including OWL, RDF, and Protégé, and integrates adaptive learning mechanisms to enhance student engagement and learning outcomes. A quasi-experimental pre-test/post-test design without a control group was employed, involving 200 students across grades 2–4 in Astana. The platform organizes educational content semantically, covering six language competencies: vocabulary, grammar, spelling, reading, writing, and text comprehension. Statistical analysis revealed significant improvements between pre- and post-tests (p < 0.001), with average performance gains of 16–21% and Cohen’s d effect sizes up to 1.14, indicating moderate to large educational impact. The study demonstrates that ontology-based structuring facilitates automated content generation, semantic feedback, and personalized learning paths. Moreover, the platform supports interpretable reasoning for educators, enabling targeted interventions and monitoring. The results provide preliminary evidence that integrating curriculum-aligned ontologies into intelligent learning systems enhances language skill acquisition, independent learning, and classroom engagement. This research contributes a scalable framework for developing educational technologies tailored to agglutinative languages, offering both practical guidance and a foundation for future AI-supported learning tools in primary education.
educational ontology, Kazakh language, digital learning, adaptive learning, semantic structuring, primary school, intelligent tutoring systems
In today's society, young schoolchildren are increasingly immersed in digital entertainment, reducing the amount of meaningful verbal interaction. Children aged 5 to 8 spend an average of more than two hours a day using mobile devices, primarily passively consuming media content [1]. This practice often replaces face-to-face communication and limits opportunities for natural speech development. As a result, educators note increasing difficulties in elementary school students with articulation, grammatical accuracy, and speech comprehension [2, 3]. UNESCO also warns that excessive use of screen devices without pedagogical support negatively impacts the development of oral language and critical thinking in children [4, 5]. These issues are particularly noticeable in the teaching of the Kazakh language, where modern and high-quality digital learning materials are still insufficient. At the same time, the use of artificial intelligence within language education has been growing steadily. In Kazakhstan, this development is reflected in recent achievements in Kazakh speech recognition [6] and in the creation of ontologically structured linguistic resources [7].
International research also confirms the effectiveness of intelligent tutoring systems (ITS) in primary schools [8]. However, for agglutinative languages such as Kazakh, existing solutions are often fragmented, not aligned with the curriculum, and do not reflect complex morphological relationships [9-11]. Consequently, the opportunities that AI technologies could offer to education do not fully align with the practical conditions faced by teachers and students in Kazakhstan [12, 13].
One promising approach to solving this problem is the development of educational resources based on ontologies. Recent research has shown that intelligent tutoring systems supported by artificial intelligence can enhance learning processes and provide adaptive support for students [14]. For Kazakhstan, such tools hold significant value not only in terms of pedagogy, but also for supporting the broader development and visibility of the Kazakh language in the digital environment [15].
In this context, the present study focuses on developing an ontological model that corresponds to the Kazakh language curriculum for primary school and is grounded in the official textbooks for grades 2–4. The paper addresses the following research questions:
RQ1. What linguistic elements should be represented in the educational ontology of the Kazakh language?
RQ2. What types of semantic relationships are most effective for ontological modeling of linguistic concepts?
RQ3. How can the created ontology be integrated into intelligent learning platforms?
Key findings of the study include:
Thus, the presented study contributes to the development of ontology-based digital learning tools that support structured language learning and feedback for primary school students.
Advances in digital technologies, the emergence of semantic methods, and the introduction of innovative pedagogical practices have considerably reshaped approaches to language instruction. However, the level of their integration and actual effectiveness depends on the specific features of a particular language, the educational environment, and the architecture of the systems themselves. This section examines the main areas of research related to the application of ontologies and semantic models in education, personalized learning, and adaptive technologies.
Many researchers note that the ontological approach helps organize educational material and formalize subject knowledge. Thus, Ahmed and Kovacs [16, 17] showed that the use of ontologies in learning management systems (LMS) contributes to better organization of content and automatic establishment of links between concepts. Wongthongtham et al. [18] applied ontology to personalize learning by offering students materials based on semantic recommendations. However, these approaches are mainly focused on structuring content and do not take into account the features of linguistic relationships and grammatical dependencies.
A study by Tlepbergen and colleagues [19] examined the use of digital technologies in language policy in Kazakhstan, but did not develop ontological learning models. Overall, the existing body of work does not fully correspond to the primary school curriculum and has not yet been explicitly directed toward enhancing the language abilities of young learners studying Kazakh.
Work on semantic modeling demonstrates the potential for creating more meaningful and coherent learning materials. Bhardwaj et al. [20] proposed a model of semantic data interoperability, while Bratland and El Gami [21] and Kim et al. [22] explored ways to structure learning texts to enhance conceptual understanding. These studies have shown that semantic organization of knowledge helps students better understand material and be more engaged in the learning process. However, most such models are not specifically designed for language teaching and do not take into account the characteristics of agglutinative languages, where morphology and semantics are closely linked. Therefore, their use for developing Kazakh language skills remains limited.
In recent years, systems that allow for the adaptation of the learning process to each student have been actively developed. Jiang [23] and Murtaza et al. [24] proposed personalized e-learning models, and Ehsan and Li [25, 26] developed methods for predicting academic performance and creating individual learning paths. Such solutions do improve learning effectiveness, but in many cases, they rely on so-called "black boxes"-algorithms whose operation remains opaque to users. This makes it difficult to explain to students why the system suggests a particular answer or correction, especially when learning grammar. For elementary schools, where clear and explainable learning is essential, such models prove insufficiently pedagogical.
Research in the field of optimization approaches and digital educational ecosystems [27-31] shows significant progress in integrating data analysis methods and creating scalable learning platforms. Such systems cope well with processing large volumes of information and adapting to various conditions. However, they generally do not address issues of grammar teaching and semantic analysis, and are not targeted at younger students. Therefore, their use in teaching Kazakh remains limited.
Table 1 summarizes key related studies, highlighting their main contributions and limitations, and clarifying the research gap addressed in this work.
Table 1. Comparative analysis of ontology-based and AI-supported educational approaches
|
Study |
Approach |
Strength |
Limitation |
Gap Addressed in This Study |
|
[16, 17] |
Ontology-based LMS |
Content structuring, concept linking |
Not language-specific, no curriculum alignment |
Adds curriculum-aligned ontology for Kazakh |
|
[18] |
Ontology and personalization |
Adaptive recommendations |
Focus on recommendation, not grammar learning |
Adds semantic grammar feedback |
|
[12] |
Ontology from text |
Automated ontology construction |
No classroom validation |
Adds real school experiment |
|
[24] |
AI-based learning systems |
Personalization, prediction |
“Black-box” models, low interpretability |
Uses interpretable ontology reasoning |
|
[7, 11] |
Kazakh ontology resources |
Linguistic data modeling |
Not aligned with school curriculum |
Aligns ontology with grades 2–4 curriculum |
|
[19] |
Digital language policy |
Contextual insights |
No ontology-based learning model |
Provides practical ontology-based system |
A review of the literature shows that significant progress has been made in ontology-based and AI-supported educational systems, including semantic structuring of learning content and adaptive learning mechanisms. These approaches have demonstrated the potential of ontologies for organizing knowledge and supporting personalized learning processes.
However, several limitations remain. Existing systems are typically not aligned with specific school curricula, provide limited support for language-specific grammatical structures, and rarely address the complexities of agglutinative languages. In particular, most approaches do not integrate semantic and morphological analysis in a way that is suitable for primary language education.
In the context of Kazakh language learning, these limitations are especially significant. Existing studies on Kazakh language resources and ontology-based approaches remain limited and are not fully adapted to the requirements of the primary school curriculum [7, 32-37].
This study addresses this gap by developing a curriculum-aligned ontology-based system tailored to the linguistic characteristics of Kazakh and validated in a real classroom setting.
3.1 Experimental design
The experimental portion of the study utilized a pre-test/post-test design and was conducted at a comprehensive school in Astana, Kazakhstan. The total sample consisted of 200 primary school students, distributed across three grade levels.
The study involved:
Thus, the total sample consisted of 200 schoolchildren (99 girls and 101 boys), which ensured gender balance and representativeness of the data for analysis.
All participants completed standardized assessment tasks designed to evaluate six key language competencies: vocabulary, grammar, spelling, reading, writing, and text comprehension. All assessment tools were developed in accordance with the requirements of the Kazakh language state curriculum.
The experiment took place in two stages:
The experiment lasted twelve weeks, covering one full academic cycle and ensuring consistent progress in language skill development. The length of the intervention helped ensure the robustness of the findings and made it possible to evaluate the influence of the ontological learning model on students’ learning quality with greater confidence.
This study employed a quasi-experimental pre-test/post-test design without a control group; therefore, the findings suggest a positive association between the use of the ontological learning system and improvements in language performance, rather than a definitive causal relationship.
3.2 Statistical analysis
To assess the effectiveness of the ontological learning model in a quantitative manner, statistical testing was carried out using the student’s t-test for dependent samples (paired-samples t-test). This approach made it possible to detect consistent differences between the pre-test and post-test outcomes across all six evaluated competencies: vocabulary, grammar, spelling, reading, writing, and text comprehension.
Statistical significance of differences was determined at p < 0.05, which corresponds to generally accepted standards of empirical pedagogical research. For all obtained mean differences, 95% confidence intervals (CI) were additionally calculated, providing an assessment of the stability of the results and the reliability of the conclusions.
To evaluate the influence of the proposed model, Cohen’s d was calculated as an effect size measure, enabling an assessment of the practical significance of the observed improvements. Values of d = 0.6–0.8 were interpreted as a moderate effect, and d ≥ 1.0 as a high-intensity effect. The combined use of the p, CI, and d parameters provided a comprehensive assessment of the impact of the implemented ontology platform in terms of both statistical significance and the pedagogical significance of the identified improvements.
3.3 Methodological basis for the development of an ontological model based on linguistic and pedagogical analysis of Kazakh language textbooks (grades 2–4)
The methodological concept for developing the ontological model was based on the principles of systemic and content-oriented analysis of Kazakh language educational materials for elementary grades. Kazakh language textbooks for grades 2–4, hosted on the Okulyk.kz platform, served as the primary empirical base. These sources provided a representative corpus of texts reflecting the current state of linguistic and methodological approaches to teaching the Kazakh language in primary school. The development of the ontological model also builds on the authors’ and related prior research on the formalization of Kazakh morphology, ontology-based linguistic resources, and the integration of natural language processing and large language models into educational and information systems [32-37].
The construction of the ontological model was carried out in accordance with a multi-level methodological scheme (see Figure 1), including interconnected stages of analysis, formalization and integration of data:
Identification of linguistic units is the primary selection of terms, word forms and grammatical constructions that occur in educational texts with high frequency.
Classification and selection – filtering the obtained data array according to three criteria: frequency of use, pedagogical significance and compliance with the requirements of the curriculum.
Semantic analysis is the determination of inter-level connections between lexical, morphological and syntactic elements that form the basis of the semantic structure of a language.
Ontological formalization is the representation of identified concepts and relationships as formal descriptors that enable machine processing and logical inference.
Integration into an ontological structure is the unification of the resulting semantic units into a unified knowledge system compatible with the architecture of intelligent learning platforms.
As a result of the analysis of textbooks, four key analytical dimensions were identified that determine the principles of selection and structuring of material:
Thus, the developed methodological structure ensured a systematic connection of linguistic and pedagogical analysis, allowing the formalization of the content of educational texts in the form of an ontological knowledge base.
Figure 1 presents the system architecture and the sequence of stages for constructing an ontology, including data extraction, semantic structuring, and integration into an intelligent learning environment. This architecture serves as the scientific and methodological foundation for the subsequent implementation of the model in digital educational technologies.
Figure 1. System architecture for ontology construction based on Kazakh language textbooks
Figure 2 presents the overall architecture of the developed intelligent learning assistant, built on the ontological model described in the previous section (see Figure 1). The system’s architecture illustrates how users interact within the digital learning environment and shows how its functional components are interconnected.
Figure 2. Architecture of the proposed AI-based teaching assistant
Learners interact through a front-end interface - a chatbot or web application that provides a dialog-based form of communication with the system. The back-end implements knowledge processing and management modules: the Protégé ontology, a bank of learning tasks and exercises, and an analytics module that collects, analyzes, and visualizes learning data.
The functioning of all components is ensured by an orchestration layer that performs key functions of managing the learning process:
The system architecture implements a three-level model that ensures logical integrity and adaptability of the educational process:
This architecture ensures the structural consistency of all elements and supports adaptive and personalized learning, in which the learning path of each student is formed on the basis of their individual results, the dynamics of assimilation and semantic analysis of responses.
The system architecture is organized into several functional modules, including a user interface layer, a task management module, an ontology-based reasoning engine, a feedback generation module, and an analytics component. The ontology layer provides structured knowledge representation, while the rule-based reasoning module performs semantic analysis of student responses and supports feedback generation. The analytics module tracks student performance and supports adaptive decision-making. The AI-enhanced aspect of the system is reflected in adaptive feedback and dynamic task selection based on semantic analysis, rather than on machine learning models.
The system operates through a structured interaction workflow. First, the student accesses the platform via the user interface and receives a task selected from the task database based on the current topic and prior performance. The student’s response is processed by the semantic analysis module, which uses the ontology to interpret the answer and identify errors. Based on this analysis, the system generates immediate feedback, including explanations and corrections.
The platform implements a feedback loop in which each student response informs subsequent task selection. Task sequencing is determined by performance indicators, enabling adaptive progression. The system supports two primary user roles: students, who complete tasks and receive feedback, and teachers, who monitor performance and guide the learning process. Performance data are stored in the analytics module and can be used to refine rule-based reasoning and improve system effectiveness over time.
3.4 Educational information system framework
From an information systems perspective, the proposed platform can be viewed as an educational information system that integrates multiple functional components and data flows. The system processes educational content starting from textbook materials, which are formalized into an ontology structure. This ontology serves as the basis for task generation, semantic analysis, and feedback mechanisms.
The data flow follows a structured pipeline: educational content is transformed into ontological representations, which are then used to generate learning tasks. Students interact with these tasks through the user interface, and their responses are processed by the semantic analysis module. The results are stored in the analytics module, which supports performance tracking and informs subsequent task selection. Teachers access this information to monitor student progress and adjust instructional strategies.
The system supports two primary user roles: students and teachers. Students engage with learning tasks and receive feedback, while teachers use the analytics module to evaluate performance and guide the learning process.
Functionally, the platform includes semantic querying based on ontology structures, grammar correction through rule-based reasoning, performance-based task recommendation, and learning analytics for tracking progress. These integrated components form a coherent educational information system that supports structured and data-driven language learning.
3.5 Ontology development and structuring
The developed Kazakh language ontology for primary education integrates more than 400 terms and concepts, systematically classified according to grammatical, lexical, syntactic, and functional properties. The structuring process resulted in the following components:
Figure 3 illustrates the schematic structure of the ontology, including the term selection process based on three key criteria-frequency of occurrence, pedagogical relevance, and curriculum alignment-and their integration into the final ontological model.
This structured approach ensures both semantic coherence and practical applicability, making the ontology relevant not only for classroom use by educators but also for developers of intelligent educational platforms. The choice of an ontology-based approach, rather than a generic knowledge graph, is motivated by its formal logical structure and interpretability, which ensure transparent reasoning and consistent knowledge representation within the educational domain.
Figure 3. Lexical-grammatical units and explanatory terms in the ontology
Beyond nominal categories, the ontology provides a detailed representation of verbal concepts. For instance, the concept Dara_etistik (simple verb) is modeled as a subclass of Etistik_2 (Verb, Grade 2) and Morfologia_2 (Morphology, Grade 2). Semantic relations specify the typical question patterns used to identify verbs in Kazakh (e.g., “What does he/she do?”, “What did he/she do?”, “What happened?”). These verbal units are explicitly linked to syntactic roles such as the predicate (baiandauysh) and are integrated into broader lexical-grammatical structures. Each instance of a simple verb (e.g., jazdy – “wrote”) is represented as an individual within the ontology, enabling precise semantic reasoning. Logical axioms formalize subclass hierarchies, supporting automated classification and inheritance of features. This design facilitates the systematic identification of verb types and their usage contexts, thereby contributing to the development of grammar-aware learning technologies (Figure 4).
Figure 4. Concept “simple verb”
The ontology also supports the unification of Kazakh language educational content for primary school. By linking lexical and grammatical units with explanatory elements, it enables automated processing of textbook materials, the generation of grammar and vocabulary tasks, and the design of individualized learning pathways.
Furthermore, the ontology captures key semantic relations, including antonymy, synonymy, and hyponymy. For example, in Figure 5 the object property antonimı (antonym) defines a link between words with opposite meanings, described as “tılımızdegı mağynasy qarama-qarsy sözder” (“words with opposite meanings in our language”). These semantic relations are crucial for interpreting meaning in context and for supporting intelligent applications capable of analyzing and generating coherent Kazakh texts.
Figure 5. Fragments of object properties
By complementing the semantic properties of objects, the ontology integrates data properties that reflect specific attributes of lexical units. For example, the property "bolady" (can be/has) encodes grammatical features such as syntactic roles, punctuation marks, or associated morphological forms. These data properties are necessary for representing the rules and patterns of the Kazakh language, enabling automated reasoning and content generation on educational platforms (Figure 6).
Figure 6. Representation of the data property “bolady” and its related attributes used to define lexical and grammatical features of terms in the ontology
From a lexical perspective, ontology reflects both word meanings and word-formation processes. This facilitates the identification of semantic characteristics of lexical units. As illustrated in Figure 7, the word körıktı (beautiful) is defined as a synonym of ädemı (charming), exemplifying how the ontology formalizes semantic equivalence between terms.
Explanatory terms are modeled as key elements of textual and verbal communication, structuring language knowledge in terms of its practical use in communicative contexts. For example, within the ontology, the term mätın (text) is classified as an explanatory term. Its definition is given through annotations, and different text types are represented as subclasses. The relationship between a text and its title is expressed through equivalence, with the presence of a title considered an essential property of a text in this ontological structure (Figure 8).
Figure 7. Visualization of synonymic relationships in the ontology
Figure 8. Ontological representation of the concept “Mätın” (Text)
A key challenge in developing the ontology was ensuring alignment of terms across multiple grade levels, such as between Grade 2 and Grade 4. This was addressed by modeling intersections of grammatical levels, allowing the ontology to capture both continuity and the increasing complexity of language mastery expected at successive stages of education.
Furthermore, using descriptive logic (DL) queries allows for the automatic identification of specific grammatical features. For example, words containing plural forms can be extracted, as shown in Figure 9. This functionality is based on linguistic data encoded in the ontology and supports the detection of plural suffixes, thereby improving the semantic processing of grammatical rules in educational applications.
Figure 9. Identification of plural form words using a descriptive logic (DL) query
Figure 10 illustrates the extraction of a root morpheme (stem) from the linguistic database using a SPARQL query. This method enables the analysis of word structure by identifying its morphological components, particularly the root to which affixes are attached.
By integrating SPARQL queries into the ontological model, the system functions not only as a repository of linguistic data but also as a tool for intelligent processing. This functionality is particularly useful for tasks involving automatic word parsing and morphological analysis in educational contexts. In addition, it enhances the understanding of word-formation mechanisms and supports the creation of adaptive learning tools for teaching Kazakh morphology.
In summary, the integration of SPARQL queries for automatic root morpheme extraction significantly expands the system's capabilities for intelligent processing of linguistic data. This functionality makes the ontology a valuable resource for teaching and learning Kazakh language morphology, while supporting adaptive and data-driven educational approaches.
Figure 10. Root extraction of words via SPARQL query
The selected models were adopted for their suitability for analyzing competencies, ensuring consistency in learning outcomes, and providing accurate assessment results. Compared to alternative methods, these models provide transparency, robustness to incomplete educational data, and flexibility in integrating ontological structures. Traditional statistical approaches were deemed insufficient, and deep learning methods were excluded due to their "black box" nature. Overall, the selected models adhere to the principles of competency-based assessment and are suitable for integration into real-world educational environments.
4.1 Demonstration of the system functionality
The functionality of the developed ontological learning platform was tested through a series of controlled user interaction scenarios. During the pilot phase, a demonstration of the chatbot's operation was conducted, based on an ontological model that provides semantic analysis and automated interactive evaluation of student responses.
Figure 11 illustrates an example of a typical interaction scenario in which the system formulates the question: "What part of speech is the word 'wrote'?" The student enters the answer through the interface, after which the semantic analysis module, based on the ontology, classifies the task as grammatical and automatically checks the correctness of the answer.
Two typical cases of interaction were identified:
This interaction scheme demonstrates the implementation of dynamic feedback, in which students receive immediate feedback on their answers, explanations of errors, and the opportunity to re-attempt the task. This mechanism promotes the development of conscious grammar skills and increases motivation for learning.
Figure 11. Ontology-based chatbot interaction flow for grammar question evaluation
The ontological learning system was tested in a twelve-week experiment involving 200 students in grades 2–4. Throughout the entire experiment, the system demonstrated stable operation, correctly identified student responses, and generated semantically sound feedback in accordance with the established ontology rules.
The results showed a consistent increase in scores across all six language competencies, indicating that ontologically guided reasoning can effectively support the acquisition of the Kazakh language at the primary level.
4.2 Experimental evaluation in a school setting
Following successful testing of its functional characteristics, the system was implemented in a real-life educational environment at a comprehensive school in Astana, Republic of Kazakhstan. The study involved 200 primary school students (grades 2–4), representing a typical sample of students in this age group.
The ontological platform was used in Kazakh language lessons and aimed to develop six key aspects of language literacy: vocabulary, grammar, spelling, reading, writing, and text comprehension. Thus, the experiment covered both receptive and productive language skills.
The pilot study lasted twelve weeks, with lessons held twice a week, 40–45 minutes each. The ontological system was implemented in a natural learning environment, without disrupting the existing curriculum structure. Teachers acted not only as observers but also as active facilitators, guiding students through the chatbot, directing their attention to grammatical patterns, and encouraging conscious language use.
To assess learning effectiveness, standardized pre- and post-tests, developed in accordance with the national curriculum, were used before and after the experiment. The system provided dynamic feedback, allowing students to instantly receive information about the correctness of their answers, analyse errors, and independently adjust their learning strategies.
According to teachers, the implementation of the ontological platform has contributed to increased learning motivation, the development of independence, and increased student engagement in the classroom. Furthermore, an improvement in the quality of language interaction was noted: students began using the digital interface more frequently for self-assessment and reinforcement of their learning.
The obtained data confirm that the use of an ontologically controlled learning system is effectively integrated into traditional school practice, ensuring the growth of linguistic competence and an increase in the cognitive activity of students.
4.3 Descriptive and statistical results
Descriptive and statistical results by class and gender.
To assess how well the ontological learning platform performed, the results were examined in detail, taking into account differences across both grade levels and gender. The study involved 200 primary school students (grades 2–4) from a school in Astana, 99 girls and 101 boys. The analysis covered six key competencies: vocabulary, grammar, spelling, reading, writing, and text comprehension.
Table 2 presents the results obtained for second-grade students (n = 65). The analysis revealed a steady increase in scores across all language areas. Girls demonstrated greater improvement in spelling and writing, while boys demonstrated greater improvement in grammar and text comprehension.
Table 2. Comparative analysis of the growth of language competencies in second-grade students
|
Competence |
Girls (n = 34) Δ% |
Boys (n = 31) Δ% |
P-Value |
Cohen’s d |
Interpretation |
|
Vocabulary |
+16.5 |
+15.9 |
<0.05 |
0.41 |
Moderate effect |
|
Grammar |
+17.2 |
+19.5 |
<0.05 |
0.48 |
Average effect |
|
Spelling |
+15.8 |
+13.4 |
<0.05 |
0.39 |
Minor difference |
|
Reading |
+14.1 |
+13.6 |
<0.05 |
0.28 |
Minor difference |
|
Writing |
+15.9 |
+13.8 |
<0.05 |
0.33 |
Average effect |
|
Understanding the text |
+15.5 |
+16.4 |
<0.05 |
0.30 |
Uniform progress |
The analysis revealed statistically significant improvements across all competencies (p < 0.05). The largest gains were observed in grammar (+18.3%) and vocabulary (+16.2%), reflecting the early development of basic language skills.
Table 3 presents the results obtained for third-grade students (n = 68). Analysis showed that the gains in scores were more balanced than in other grades. Girls demonstrated higher scores in reading and writing, while boys demonstrated higher scores in grammar and text comprehension.
Table 3. Comparative evaluation of language competency development among third-grade students
|
Competence |
Girls (n = 33) Δ% |
Boys (n = 35) Δ% |
P-Value |
Cohen’s d |
|
Vocabulary |
+17.9 |
+16.4 |
<0.01 |
0.42 |
|
Grammar |
+18.4 |
+19.1 |
<0.01 |
0.51 |
|
Spelling |
+16.3 |
+15.1 |
<0.05 |
0.36 |
|
Reading |
+18.8 |
+17.5 |
<0.01 |
0.47 |
|
Writing |
+19.3 |
+16.8 |
<0.01 |
0.52 |
|
Understanding the text |
+18.5 |
+19.4 |
<0.01 |
0.49 |
The average increase was approximately +18.0–19.0%, with statistically significant differences across all categories (p < 0.05). The Cohen’s d values ranged from 0.36 to 0.52, indicating a moderate effect.
Table 4 presents the results obtained for fourth-grade students (n = 67). The analysis revealed the most pronounced improvement compared to other grades. Girls demonstrated greater improvement in writing and vocabulary, while boys showed higher gains in grammar and spelling.
Average Cohen’s d values were around 0.45–0.50, indicating a moderate effect of the implemented ontological platform.
To ensure a comprehensive presentation of results across all levels of education, a consolidated comparative analysis was conducted, combining data across three grades. The final values are presented in Table 5, which reflects the dynamics of average gains, the range of statistical significance, and the nature of the pedagogical effect.
Table 4. Changes in language competency indicators among fourth-grade students by gender
|
Competence |
Girls (n = 32) Δ% |
Boys (n = 35) Δ% |
P-Value |
Cohen’s d |
|
Vocabulary |
+17.9 |
+16.4 |
<0.01 |
0.42 |
|
Grammar |
+18.4 |
+19.1 |
<0.01 |
0.51 |
|
Spelling |
+16.3 |
+15.1 |
<0.05 |
0.36 |
|
Reading |
+18.8 |
+17.5 |
<0.01 |
0.47 |
|
Writing |
+19.3 |
+16.8 |
<0.01 |
0.52 |
|
Understanding the text |
+18.5 |
+19.4 |
<0.01 |
0.49 |
Table 5. Summary analysis for all grades
|
Grade |
Average Growth (%) |
Average Cohen’s d |
P-Value Range |
Characteristics of the Effect |
|
2nd grade |
+16.5 |
0.87 |
<0.05 |
Moderate effect |
|
3rd grade |
+18.8 |
0.94 |
<0.01 |
Moderate to strong effect |
|
4th grade |
+21.0 |
1.12 |
<0.001 |
Strong effect |
To summarize the data obtained and compare results across classes and gender groups, an integrated analysis of student performance dynamics was conducted. This stage of the study not only confirmed the effectiveness of the implemented ontological platform at different educational levels but also identified the specifics of its impact depending on the age and gender of students.
Table 6. Average results (in % of the maximum possible score) before and after the implementation of the platform
|
Grade |
Gender |
Pre-Test Mean (%) |
Post-Test Mean (%) |
Growth (%) |
P-Value |
Cohen’s d |
|
2nd grade |
Girls (n=34) |
61.2 |
77.4 |
+16.2 |
<0.05 |
0.86 |
|
Boys (n=31) |
59.8 |
76.5 |
+16.7 |
<0.05 |
0.88 |
|
|
3rd grade |
Girls (n=33) |
63.4 |
82.2 |
+18.8 |
<0.01 |
0.95 |
|
Boys (n=35) |
62.7 |
81.4 |
+18.7 |
<0.01 |
0.93 |
|
|
4th grade |
Girls (n=32) |
65.8 |
86.5 |
+20.7 |
<0.001 |
1.10 |
|
Boys (n=35) |
64.9 |
86.2 |
+21.3 |
<0.001 |
1.14 |
Summary statistics are provided in Table 6. The table presents the average pre-test and post-test scores expressed as a percentage of the maximum attainable score, along with the corresponding gains, the statistical significance levels (p-values), and the effect sizes (Cohen’s d).
Pre- and post-test results indicate statistically significant increases across all grades and both gender groups. The largest gains were recorded for fourth-grade students, where the average overall score increased by approximately 21%, with a large effect size (d > 1.1). In third grade, there was a steady improvement in text comprehension and writing (an increase of approximately 19%), while in second grade, there was a strengthening of basic language skills-vocabulary and grammar (an increase of approximately 16%). Differences between girls and boys were minimal across all grades, indicating a consistent pattern of improvement in both groups.
Figure 12. Mean pre- and post-test results by grade and gender
Figure 13. Δ Improvement in performance by grade and gender
The graphics presented in Figures 12 and 13 allow visualizing the identified trends and confirm the statistical conclusions drawn during the analysis. The graphs demonstrate a steady increase in indicators across all grades and gender subgroups, and also confirm the absence of significant differences between the results of girls and boys. The most pronounced positive dynamics are observed among fourth-grade students, which correlates with the results presented in Tables 4-6 and indicates the cognitive maturity of this age group. Thus, the data visualization provides further confirmation of the effectiveness of the implemented ontological learning platform and its potential to support adaptive language learning in elementary school.
Analysis of the data presented in Figures 12 and 13 reveals a steady improvement in academic performance across all grades and gender groups. Average scores show a steady increase from second to fourth grade, indicating the progressive development of language competencies and demonstrating the positive impact of integrating the ontology-based learning platform into the educational process.
The largest gains were observed among fourth-grade students. The combined increase in six language competencies exceeded 20%, indicating a link between students' cognitive development and their ability to consciously apply linguistic knowledge.
Table 7 provides a detailed statistical summary of pre-test and post-test results across all competencies. The results confirm statistically significant improvements, supported by t-test values, confidence intervals, and effect sizes.
Table 7. Statistical summary of pre-test and post-test results across competencies (overall sample, n = 200)
|
Competency |
Pre-Test Mean (%) |
Post-Test Mean (%) |
SD |
Mean Diff |
t |
df |
95% CI |
Cohen’s d |
|
Vocabulary |
62.1 |
78.3 |
8.5 |
16.2 |
5.12 |
199 |
[12.1 – 20.3] |
0.86 |
|
Grammar |
63.0 |
80.8 |
8.9 |
17.8 |
5.45 |
199 |
[13.4 – 22.2] |
0.91 |
|
Spelling |
61.5 |
76.9 |
7.8 |
15.4 |
4.98 |
199 |
[11.6 – 19.2] |
0.82 |
|
Reading |
62.7 |
78.9 |
8.1 |
16.2 |
5.08 |
199 |
[12.0 – 20.4] |
0.85 |
|
Writing |
63.3 |
79.5 |
8.3 |
16.2 |
5.10 |
199 |
[12.2 – 20.2] |
0.87 |
|
Text comprehension |
62.9 |
79.8 |
8.7 |
16.9 |
5.30 |
199 |
[12.8 – 21.0] |
0.89 |
A comparative analysis by gender revealed no statistically significant differences between the results of girls and boys. This demonstrates the balanced pedagogical impact and versatility of the proposed approach. The obtained data are consistent with the statistical results presented in Tables 4-6 and confirm that the developed ontological system promotes the development of holistic language skills, cognitive activity, and academic independence in primary school students.
This study aimed to develop and test an ontology-based model for teaching the Kazakh language in primary school. The results indicate that organizing learning content through semantic structures can improve both the presentation and understanding of educational material. The comparison of pre-test and post-test results showed consistent improvements across all language competencies.
These findings are generally consistent with previous studies on the use of ontologies in education [16, 18], which also report positive learning outcomes. At the same time, this study focuses specifically on the Kazakh language and applies the approach in a primary school context. The results suggest that structured semantic models can support the development of language skills and enhance learners’ understanding.
For RQ1, the analysis showed that nouns, verbs, adjectives, and basic sentence structures are the most important elements at the early stage of learning. These components formed the foundation of the ontology and were aligned with the school curriculum.
For RQ2, the study identified several types of relationships in the ontology, including hierarchical (e.g., class–subclass), associative (e.g., synonym, antonym), and explanatory links. These relationships help represent language knowledge in a structured and accessible way for learners.
For RQ3, students who used the ontology-based system showed improvements in all six language competencies, especially in grammar, vocabulary, and text understanding. This suggests that such systems can support individualized learning and provide structured feedback to students.
The observed learning gains can be linked to specific mechanisms of the system. Improvements in grammar can be associated with immediate semantic feedback, which allows students to identify and correct errors in real time. Gains in vocabulary and text comprehension are likely supported by the structured organization of linguistic concepts within the ontology, which helps learners understand relationships between words and meanings. In addition, the task sequencing mechanism, based on student performance, enables targeted practice, reinforcing previously learned material and addressing individual difficulties. Together, these features provide a coherent explanation of how the system contributes to the observed improvements.
The more pronounced improvement observed among fourth-grade students can be explained by both cognitive and pedagogical factors. At this stage, learners typically demonstrate a higher level of metalinguistic awareness and are better able to consciously apply grammatical rules and semantic relationships. In addition, the ontology-based system relies on semantic reasoning and structured feedback, which require a certain level of cognitive development to be fully effective. As a result, older students are more capable of benefiting from feedback based on their performance and concept-based task design, leading to stronger learning gains compared to lower grades.
From a teaching perspective, the use of ontology-based materials can help students become more independent and aware of language structures. It can also support teachers by making it easier to identify common mistakes and guide students more effectively.
However, several limitations should be noted. First, the study did not include a control group, so the results cannot be interpreted as a clear cause-and-effect relationship. The improvements may also be influenced by other factors, such as normal learning progression or teacher support. Second, the experiment was relatively short, and the model currently focuses only on text-based materials.
Future research should include a control group, extend the duration of the study, and explore the use of audio and visual data. This would allow a more complete evaluation of the approach.
Overall, the results indicate that ontology-based methods represent a promising direction for the development of digital tools for language learning.
The study provides preliminary evidence that the ontological approach may be effective for teaching Kazakh at the primary school level. The resulting ontology, based on educational materials for grades 2–4, shows potential pedagogical value and indicates that formalized semantic modeling can facilitate the structuring of educational content and support the development of language skills.
The results of an experiment involving 200 students from Astana schools showed statistically significant improvements in all six language skills-vocabulary, grammar, spelling, reading, writing, and text comprehension. The average improvement ranged from 16% to 21%, while the corresponding Cohen’s d values fell between 0.39 and 1.14, indicating effect sizes from moderate to large. These findings suggest a positive pedagogical impact of the proposed platform, although further studies are needed to confirm its effectiveness.
To our knowledge, this study presents one of the first curriculum-aligned ontology-based models for Kazakh language learning, implemented within a digital learning platform. The model provides an interpretable link between linguistic categories, learning objectives, and digital learning tools, making it applicable to both automated learning systems and traditional school practices. The study’s practical contribution is that the system can support:
Despite the positive results, the study has certain limitations. The study did not include a control group, which limits the ability to draw causal conclusions. The observed improvements may also be influenced by other factors, such as normal learning progression or teacher effects. In addition, the model relies primarily on text data and requires further development to incorporate multimodal components (audio, visual data, speech corpora). Furthermore, the duration of the experiment (twelve weeks) limits the ability to assess long-term effects. Promising areas for future research include:
Overall, this work suggests that ontological methods can be a useful and promising tool for digital linguodidactics, supporting structured and culturally relevant language learning.
This research is funded by the Science Committee of the Ministry of Education and Science of the Republic of Kazakhstan under Grant AP19678613.
Informed consent was obtained from the parents or legal guardians of all participating students prior to the study. The research was conducted with the approval of the school administration and in accordance with ethical standards for research involving minors. Participation was voluntary, and all data were anonymized to ensure the privacy and confidentiality of the students. The study was carried out in a regular classroom setting as part of the standard educational process.
[1] Common Sense Media. (2020). The common sense census: Media use by kids age zero to eight, 2020. https://www.commonsensemedia.org/research/the-common-sense-census-media-use-by-kids-age-zero-to-eight-2020.
[2] Twenge, J.M., Campbell, W.K. (2018). Associations between screen time and lower psychological well-being among children and adolescents: Evidence from a population-based study. Preventive Medicine Reports, 12: 271-283. https://doi.org/10.1016/j.pmedr.2018.10.003
[3] Rideout, V., Peebles, A., Mann, S., Robb, M.B. (2022). The common sense census: Media use by tweens and teens, 2021. San Francisco, CA: Common Sense. https://www.commonsensemedia.org/research/the-common-sense-census-media-use-by-tweens-and-teens-2021.
[4] Toffler, A., Yousafzai, M. (2020). Embracing a culture of lifelong learning: Contribution to the futures of education initiative. Report: A transdisciplinary expert consultation. UNESCO Institute for Lifelong Learning, Hamburg, Germany. https://unesdoc.unesco.org/ark:/48223/pf0000374112.
[5] UNESCO. (2011). Guidelines for open educational resources (OER) in higher education. https://unesdoc.unesco.org/ark:/48223/pf0000213605.
[6] Kadyrbek, N., Mansurova, M., Shomanov, A., Makharova, G. (2023). The development of a Kazakh speech recognition model using a convolutional neural network with fixed character level filters. Big Data and Cognitive Computing, 7(3): 132. https://doi.org/10.3390/bdcc7030132
[7] Khairova, N., Petrasova, S., Mamyrbayev, O., Mukhsina, K. (2020). Open information extraction as additional source for Kazakh ontology generation. In Intelligent Information and Database Systems. ACIIDS 2020. Lecture Notes in Computer Science(), pp. 86-96. https://doi.org/10.1007/978-3-030-41964-6_8
[8] Létourneau, A., Deslandes Martineau, M., Charland, P., Karran, J.A., Boasen, J., Léger, P.M. (2025). A systematic review of AI-driven intelligent tutoring systems (ITS) in K-12 education. npj Science of Learning, 10: 29. https://doi.org/10.1038/s41539-025-00320-7
[9] Park, H., Oh, K.J., Choi, H.J., Gweon, G. (2019). Constructing a paraphrase database for agglutinative languages. Data & Knowledge Engineering, 123(C): 101604. https://doi.org/10.1016/j.datak.2017.07.007
[10] Kodan, H., Kara, K. (2021). Grammar teaching in the Turkish language course curriculum: An examination in the context of acquisitions, activity, and teachers' opinions. Journal of Curriculum and Teaching, 10(2): 28-41. https://eric.ed.gov/?id=EJ1308944.
[11] Khairova, N., Mamyrbayev, O., Mukhsina, K., Kolesnyk, A. (2020). Logical-linguistic model for multilingual Open Information Extraction. Cogent Engineering, 7(1): 1714829.
[12] Zouaq, A., Nkambou, R. (2008). Building domain ontologies from text for educational purposes. IEEE Transactions on Learning Technologies, 1(1): 49-62. https://doi.org/10.1109/TLT.2008.12
[13] Er, O., Saritiken, H. (2022). Perceptions of middle school Turkish language teachers on using e-learning tools in grammar teaching. International Journal of Education and Literacy Studies, 10(4): 9-18. https://eric.ed.gov/?id=EJ1367172.
[14] Lin, C.C., Huang, A.Y.Q., Lu, O.H.T. (2023). Artificial intelligence in intelligent tutoring systems toward sustainable education: A systematic review. Smart Learning Environments, 10: 41. https://doi.org/10.1186/s40561-023-00260-y
[15] Awang, L.A., Yusop, F.D., Danaee, M. (2025). Current practices and future direction of artificial intelligence in mathematics education: A systematic review. International Electronic Journal of Mathematics Education, 20(2): em0823. https://doi.org/10.29333/iejme/16006
[16] Ghanim, H.A.A., Kovács, L. (2021). Development of ontology-based model to support learning process in LMS. Indonesian Journal of Electrical Engineering and Computer Science, 24(1): 507-518. http://doi.org/10.11591/ijeecs.v24.i1.pp507-518
[17] Ahmed, G.H.A., Kovács, L. (2020). Ontology domain model for e-tutoring system. Journal of Software Engineering and Intelligent Systems, 5(1): 37-44.
[18] Wongthongtham, P., Chan, K.Y., Potdar, V., Abu-Salih, B., Gaikwad, S., Jain, P. (2018). State-of-the-art ontology annotation for personalised teaching and learning and prospects for smart learning recommender based on multiple intelligence and fuzzy ontology. International Journal of Fuzzy Systems, 20: 1357-1372. https://doi.org/10.1007/s40815-018-0467-6
[19] Tlepbergen, D., Akzhigitova, A., Zabrodskaja, A. (2022). Language-in-education policy of Kazakhstan: Post-pandemic technology enhances language learning. Education Sciences, 12(5): 311. https://doi.org/10.3390/educsci12050311
[20] Bhardwaj, S., Ozcelebi, T., Lukkien, J.J., Lee, K.M. (2018). Semantic interoperability architecture for smart spaces. International Journal of Fuzzy Logic and Intelligent Systems, 18(1): 50-57. https://doi.org/10.5391/IJFIS.2018.18.1.50
[21] Bratland, E., El Ghami, M. (2022). Epistemic and nonepistemic design in textbooks in social studies for lower secondary school: Do the textbooks facilitate knowledge building and in-depth learning? Education Research International, 2022(1): 1692452. https://doi.org/10.1155/2022/1692452
[22] Kim, Y., Lee, Y.H., Lee, H., Lim, S.M. (2022). Alignment of concepts of meiosis among curriculum, textbooks, classroom teaching and assessment in upper secondary school in Republic of Korea. Journal of Baltic Science Education, 21(2): 232-244. https://doi.org/10.33225/jbse/22.21.232
[23] Jian, M.J.K.O. (2023). Personalized learning through AI. Advances in Engineering Innovation, 5: 16-19. https://doi.org/10.54254/2977-3903/5/2023039
[24] Murtaza, M., Ahmed, Y., Shamsi, J.A., Sherwani, F., Usman, M. (2022). AI-based personalized e-learning systems: Issues, challenges, and solutions. IEEE Access, 10: 81323-81342. https://doi.org/10.1109/ACCESS.2022.3193938
[25] Ehsan, H.R.U., Khan, R.A., Yasmeen, R., Arif, M. (2024). Development and validation of an AI-based model to predict the assessment outcomes of pre-clinical MBBS/BDS students. Research Square. https://doi.org/10.21203/rs.3.rs-5277982/v1
[26] Li, J., Yu, S.M., Zhang, T.C. (2024). Learning path recommendation based on reinforcement learning. Engineering Letters, 32(9): 1823-1832. https://www.engineeringletters.com/issues_v32/issue_9/EL_32_9_08.pdf.
[27] Jiang, H., Yang, D. (2020). Learning graph-based embedding from EHRs for time-aware patient similarity. Engineering Letters, 28(4): 1254-1262. https://www.engineeringletters.com/issues_v28/issue_4/EL_28_4_34.pdf.
[28] Tang, J.L., Pan, Y., Wang, Z.H., Zhu, L.L. (2018). Ontology optimization algorithm for similarity measuring and ontology mapping using adjoint graph framework. Engineering Letters, 26(3): 386-395. https://www.engineeringletters.com/issues_v26/issue_3/EL_26_3_12.pdf.
[29] Naseer, F., Khan, M.N., Tahir, M., Addas, A., Aejaz, S.M.H. (2024). Integrating deep learning techniques for personalized learning pathways in higher education. Heliyon, 10(11): e32628. https://doi.org/10.1016/j.heliyon.2024.e32628
[30] Shenkoya, T., Kim, E. (2023). Sustainability in higher education: Digital transformation of the fourth industrial revolution and its impact on open knowledge. Sustainability, 15: 2473. https://doi.org/10.3390/su15032473
[31] Lynn, N.D., Emanuel, A.W.R. (2021). Using data mining techniques to predict students’ performance: A review. IOP Conference Series: Materials Science and Engineering, 1096: 012083. https://doi.org/10.1088/1757-899X/1096/1/012083
[32] Zhetkenbay, L., Sharipbay, A., Razakhova, B., Bekmanova, G., Barlybayev, A., Nazyrova, A., Yergesh, B. (2025). Formalization of morphological rules for kazakh nouns in the new Latin alphabet. Journal of Applied Data Sciences, 6(3): 1999-2019. https://doi.org/10.47738/jads.v6i3.820
[33] Mukanova, A., Milosz, M., Dauletkaliyeva, A., Nazyrova, A., Yelibayeva, G., Kuzin, D., Kussepova, L. (2024). LLM-powered natural language text processing for ontology enrichment. Applied Sciences, 14(13): 5860. https://doi.org/10.3390/app14135860
[34] Nazyrova, A., Bekmanova, G., Omarbekova, A., Zulkhazhav, A., Yelibayeva, G. (2023). The use of a morphological analyzer in assessing the achievements of learning outcomes. In 2023 8th International Conference on Computer Science and Engineering (UBMK), Burdur, Turkiye, pp. 294-299. https://doi.org/10.1109/UBMK59864.2023.10286708
[35] Yelibayeva, G., Razakhova, B., Yergesh, B., Sharipbay, A., Bekmanova, G., Mukanova, A. (2022). Modeling of verb phrases of the Kazakh language. In 2022 International Conference on Engineering & MIS (ICEMIS), Istanbul, Turkey, pp. 1-3. https://doi.org/10.1109/ICEMIS56295.2022.9914015
[36] Yelibayeva, G., Sharipbay, A., Bekmanova, G., Omarbekova, A. (2021). Ontology-based extraction of Kazakh language word combinations in natural language processing. In International Conference on Data Science, E-learning and Information Systems 2021, Ma'an Jordan, pp. 58-59. https://doi.org/10.1145/3460620.3460631
[37] Mukanova, A., Nazyrova, A., Zulkhazhav, A., Lamasheva, Z., Dauletkaliyeva, A. (2025). Development of an intelligent information retrieval system based on ontology, linguistic algorithms and large language models. Applied Sciences, 15(22): 12271. https://doi.org/10.3390/app152212271