Most research on virtual reality in education looks at short bursts: a 20-minute intervention here, a one-hour session there. These snapshots tell us whether students find VR engaging (they usually do) and whether they remember content immediately afterward (they usually do). But what happens when VR becomes a regular part of a course, week after week, for an entire semester?
A new study published in Frontiers in Virtual Reality (Wu, Klassen & White, 2026) answers that question with a semester-long ethnographic study of VR in undergraduate neuroanatomy. The findings are illuminating, and for anyone building or purchasing educational technology, they carry an important message: having impressive technology is not enough.
What the Researchers Actually Did
Unlike typical VR studies that measure pre-test and post-test scores after a brief session, Wu and colleagues embedded themselves in an undergraduate neuroanatomy course at a California university for an entire semester. Students used VR headsets regularly in their laboratory sessions to explore 3D brain structures, rotating them, zooming in, examining neural pathways from angles impossible with physical models or 2D diagrams.
The researchers used two theoretical frameworks to analyse what they observed. The first, TPACK (Technological Pedagogical Content Knowledge), examines what teachers know and how they integrate technology, pedagogy, and subject matter. The second, EVT (Expectancy-Value Theory), looks at student motivation: do students expect to succeed? Do they see value in the task? What costs (effort, frustration, confusion) do they experience?
The combination is clever. TPACK tells us about the teacher side. EVT tells us about the student side. Together, they reveal whether VR implementation actually works in practice.
The Instructor Understood VR's Potential. But Not How to Teach With It.
Here is where things get interesting. The instructor in this study had strong Technological Content Knowledge. They understood why VR was good for neuroanatomy: three-dimensional visualisation of complex brain structures, the ability to manipulate and rotate models, seeing spatial relationships that flat diagrams cannot convey. The what of VR was clear.
But the instructor's Technological Pedagogical Knowledge was weak. They understood what VR could show. They did not understand how to teach with it. There was no structured guidance for students on how to use the VR environment productively. There were no scaffolded learning activities that built from simple to complex. The assumption seemed to be that the technology itself would do the teaching.
This gap, strong TCK but weak TPK, appears frequently in educational technology adoption. Enthusiastic teachers see the potential of a new tool. They understand its capabilities. But translating those capabilities into effective pedagogy requires a different kind of knowledge, and that knowledge is often missing.
Students Felt Lost, Frustrated, and Unsure Why They Were Doing This
The student experience reflected the instructor's TPK gap. Using the EVT framework, the researchers found that students experienced:
- Low expectancy beliefs: Students were unsure whether they could succeed with VR or what success even looked like
- Unclear utility value: Why were they using VR instead of traditional methods? How would this help them as future healthcare professionals?
- High "cost": The effort required to learn the VR system, the cognitive load of navigating an unfamiliar interface, and the frustration when things did not work as expected
Some students did find the VR engaging. The intrinsic interest value was there, at least initially. But that novelty-driven interest could not overcome the accumulated frustrations. When students do not know why they are doing something, when success criteria are vague, and when the interface creates more confusion than clarity, even impressive technology fails to deliver learning.
The researchers describe a misalignment between instructor assumptions and student reality. The instructor assumed students would intuitively know how to learn from VR. Students needed much more guidance than they received.
Technology Without Pedagogy Is Just Expensive Hardware
This study joins a growing body of evidence that educational technology succeeds or fails based on its pedagogical design, not its technical capabilities. You can have the most sophisticated VR hardware in the world. If instructors do not know how to teach with it, and if students do not receive adequate scaffolding, the technology becomes an expensive distraction rather than a learning tool.
The researchers propose a TPACK-EVT integration framework that connects instructor knowledge to student motivation. When instructors have strong Technological Pedagogical Knowledge, they design activities that give students clear expectations (raising expectancy beliefs), explicit connections to career goals (raising utility value), and appropriate support structures (reducing perceived cost). The two frameworks are not independent. What teachers know shapes how students experience learning technology.
How WhimsyLabs Addresses the TPK Gap
We read research like this and think: this is exactly why we designed WhimsyLabs the way we did. We knew from the start that giving schools virtual lab software was not enough. The software itself had to embed the pedagogy.
Our AI tutor, WhimsyCat, exists specifically to address the TPK gap. Teachers using WhimsyLabs do not need to figure out how to teach with virtual labs because the scaffolding is built into the experience. WhimsyCat provides real-time guidance to students: explaining what they are doing and why, prompting them when they get stuck, asking questions that develop scientific thinking, and adjusting to individual pace and confusion levels.
This directly addresses the EVT concerns the study identified:
- Expectancy beliefs: WhimsyCat tells students what they are working toward and confirms when they are on track. Success criteria are explicit, not assumed.
- Utility value: Experiments connect to real-world applications. Students see why proper pipette technique matters. They understand how titration relates to pharmaceutical testing or environmental monitoring.
- Cost reduction: The interface is browser-based, runs on Chromebooks, and requires no VR headset learning curve. When students struggle, WhimsyCat notices and helps immediately rather than letting frustration compound.
We also address the "semester-long" problem the study highlights. WhimsyLabs is designed for sustained use, not one-off demos. Our process-based assessment captures how students work across multiple sessions, tracking technique development and experimental thinking over time. Teachers get dashboards showing learning trajectories, not just single-session snapshots.
Pedagogy First, Technology Second
The Wu study reinforces something we believe strongly: technology should serve pedagogy, not the other way around. It is tempting for schools to buy impressive hardware and assume learning will follow. It rarely works that way.
When evaluating any educational technology, ask these questions:
- Does the product require teachers to develop new pedagogical knowledge? Or does it embed that pedagogy in the experience?
- Does it provide scaffolding for students? Or does it assume students will figure things out?
- Does it reduce cognitive load and frustration? Or does it add new barriers?
- Can it be used sustainably across a term or year? Or is it a novelty that wears off?
VR and other immersive technologies have genuine potential for science education. Three-dimensional visualisation of molecular structures, anatomical systems, and physics concepts offers advantages that flat screens cannot match. But that potential only becomes actual learning when the technology is wrapped in thoughtful pedagogical design.
From Research to Practice
Studies like this one are valuable because they move beyond "does VR work?" to "how and why does VR work or fail?" The short-term intervention studies have their place, but they miss the complexities that emerge over sustained use. Students can tolerate confusion for 30 minutes. When confusion persists for weeks, it undermines motivation and learning.
For educators and administrators considering virtual lab software, this research offers a clear warning: do not assume that sophisticated technology automatically produces sophisticated learning. Ask vendors how their products support teachers pedagogically. Ask how they scaffold student experience. Ask what happens when students get stuck or confused.
At WhimsyLabs, we designed for exactly these challenges. WhimsyCat is not decoration. It is the pedagogical scaffolding that the Wu study shows is essential. Process-based assessment is not marketing. It is how you sustain meaningful learning across a full academic term rather than just a flashy demonstration.
Technology can transform science education. But only when it comes with the pedagogy to match.
Related Articles
- The OECD AI Learning Paradox: Why GenAI Fails Students (And How Virtual Labs Succeed)
- The Critics Are Right: Most EdTech Is Useless
- AI Science Tutors: What Actually Works in the Classroom
References
- Wu, X., Klassen, A., & White, T. (2026). Understanding extended VR use in undergraduate STEM education: a focused ethnographic case study of neuroanatomy instruction. Frontiers in Virtual Reality. https://doi.org/10.3389/frvir.2026.1783834
- Mishra, P., & Koehler, M. J. (2006). Technological pedagogical content knowledge: A framework for teacher knowledge. Teachers College Record, 108(6), 1017-1054.
- Eccles, J. S., & Wigfield, A. (2002). Motivational beliefs, values, and goals. Annual Review of Psychology, 53(1), 109-132.
