AI Technology Trends

Machine Learning Advances Gene Therapy: Doctoral Research Transforms Treatment Strategies

April 17, 2026

Key Takeaways

Kelvin Idanwekhai, a chemistry doctoral student at UNC‑Chapel Hill, is applying machine‑learning algorithms to streamline the purification of therapeutic viruses used in gene therapy.
Traditional design‑of‑experiment (DOE) methods become impractical when more than a handful of variables (pH, salt concentration, temperature, flow rate, etc.) must be optimized simultaneously.
By letting the algorithm learn which experimental conditions are most promising, Idanwekhai reduced the search from ~900,000 possible experiments to just 30 trials while still finding optimal settings.
Applied to three distinct viruses, the approach raised viral yields from 70 % to 99 %, cut impurities, and preserved biological activity after only three optimization rounds.
The model also provides mechanistic insight—for example, revealing that pH exerts the strongest influence on yield—enabling faster, data‑driven decision‑making.
Practical hurdles remain: most lab instruments do not automatically export data, forcing manual extraction that consumed months of work.
Looking forward, Idanwekhai envisions integrating reinforcement learning and large language models to let AI read scientific literature, suggest experiments, and close the loop between instruments and computation.
A user‑friendly, no‑code software tool is being developed so that bench scientists can harness these AI‑driven optimizations without programming expertise.
Under the mentorship of Professor Alexander Tropsha, Idanwekhai enjoys the intellectual freedom to pursue high‑risk, high‑reward ideas that could reshape bioprocessing and gene‑therapy manufacturing.

Machine Learning Replaces Trial‑and‑Error in Virus Purification
Kelvin Idanwekhai’s work began with a simple observation: optimizing the production of therapeutic viruses involves juggling dozens of parameters—pH, salt concentration, temperature, flow rate, and more—each of which can dramatically affect yield and purity. “In chemistry and bioprocessing, people often rely on design-of-experiment methods,” said Idanwekhai. “Those work fine when you’re dealing with a small number of parameters. But when you’re trying to adjust 10 or more things at once… it becomes impossible to test everything manually.” Traditional DOE matrices explode in size, making exhaustive screening impractical for academic labs with limited time and resources.

Idanwekhai turned to machine learning as a way to intelligently prune that vast experimental space. Instead of testing every combination, his algorithm “learns” which experiments are most likely to succeed based on early results, then focuses subsequent trials on those promising conditions. Over successive iterations, the model refines its predictions, honing in on the optimal set of parameters with far fewer runs.

From Hundreds of Thousands to Just Thirty Trials
To demonstrate the power of this approach, Idanwekhai applied it to a problem with a theoretical search space of roughly 900,000 distinct experiments. “With new tools, we can explore a search space that might include 900,000 possible experiments, and we can find an optimal set of parameters we need in just 30,” he explained. The algorithm achieved this by iteratively selecting experiments that maximized expected information gain, effectively performing a smart, directed search rather than a brute‑force sweep.

The technique was not merely theoretical; it was put to work purifying three different viruses that serve as vectors for gene therapy. Each virus has unique biophysical properties, yet the same machine‑learning framework accommodated all three. After only three rounds of optimization, the team observed a dramatic rise in viral yield—from 70 % up to 99 %—while simultaneously reducing impurity levels and preserving the viruses’ biological activity. Such improvements translate directly into lower manufacturing costs and higher doses available for patients.

Gaining Mechanistic Insight from the Model
Beyond boosting productivity, the algorithm offered a window into which variables truly mattered. “You can look inside the model and see, for example, that pH has the biggest effect on yield,” said Idanwekhai. This interpretability is a crucial advantage over black‑box optimization: researchers can confirm that the model’s recommendations align with biochemical intuition, and they can focus future effort on the most influential parameters. In this case, pH emerged as the dominant lever, guiding the team to tighten controls around acidity while allowing less critical factors to vary more freely.

Practical Barriers: Data Silos in the Laboratory
Despite the promise of the approach, Idanwekhai encountered real‑world obstacles that slowed progress. Most laboratory equipment does not automatically export data in a format that machine‑learning pipelines can ingest. “A lot of experimental data are stuck in the instruments themselves,” he noted. “I had to go to the machines and extract the data manually. It took months.” This manual data‑wrangling step erodes some of the speed gains promised by AI and highlights a need for greater standardization and connectivity in lab hardware.

Idanwekhai hopes that future instruments will incorporate simple, plug‑and‑play interfaces—such as USB or Wi‑Fi‑enabled data streams—so that experimental results flow seamlessly into computational models. Achieving a true “closed loop,” where the AI suggests an experiment, the instrument runs it, and the outcome is immediately fed back to further refine predictions, would dramatically accelerate the optimization cycle.

Future Directions: Reinforcement Learning, LLMs, and No‑Code Tools
Looking ahead, Idanwekhai is excited about integrating more advanced AI techniques. Reinforcement learning, which trains agents to make sequences of decisions by rewarding successful outcomes, could enable the system to navigate multi‑step purification protocols autonomously. Large language models (LLMs)—the technology behind modern chatbots—could read scientific literature, extract prior experimental conditions, and suggest hypotheses that humans might overlook. “His lab has already started developing an AI platform that is designed to autonomously search for and optimize potential drug molecules,” the article notes, indicating a broader vision beyond viral purification.

Critically, the team is also building a user‑friendly software tool that lets bench scientists apply these machine‑learning strategies without writing code or understanding the underlying models. “They have developed another software tool that lets lab scientists use the same approach without needing to know how to build computer models,” said Idanwekhai. This democratization aims to put powerful optimization capabilities into the hands of everyday researchers, shortening the gap between discovery and scalable production.

Mentorship and Academic Freedom Drive Innovation
Idanwekhai credits much of his progress to the supportive environment fostered by his advisor, Professor Alexander Tropsha. “Under the guidance of professor Alexander Tropsha, Idanwekhai said he has found the perfect environment to innovate. ‘Dr. Tropsha really values independence,’ he said. ‘He tells us, ‘Your Ph.D. is not my Ph.D.’ That freedom lets me chase ideas I believe in.’” Such mentorship encourages students to pursue unconventional ideas—like applying AI to bioprocessing—without the pressure to conform to established norms, fostering the kind of breakthrough that could reshape an entire field.

Implications for Gene Therapy and Beyond
The successful application of machine‑learning to virus purification holds immediate promise for gene‑therapy manufacturing, where cost and scalability remain major barriers to widespread clinical use. Higher yields and purer products mean lower doses are needed to achieve therapeutic effect, reducing both expense and potential side‑effects. Moreover, the insights gained—such as the overriding importance of pH—can inform the design of more robust purification platforms that are less sensitive to minor fluctuations, improving batch‑to‑batch consistency.

Beyond gene therapy, the methodology could be adapted to other biologics—vaccines, monoclonal antibodies, enzyme therapeutics—where downstream processing involves similarly complex, multi‑parameter optimization. By converting trial‑and‑error into data‑driven decision‑making, Idanwekhai’s work exemplifies how AI can transform the life‑sciences laboratory from a place of educated guesses into a precision engine of discovery.

In sum, Kelvin Idanwekhai’s research illustrates a paradigm shift: the future of bioprocessing may not be confined to the bench but will increasingly reside in algorithms that learn, predict, and guide experiments. As lab instruments become more interconnected and AI tools more accessible, the vision of a fully automated, intelligent optimization loop moves closer to reality—one that could accelerate the delivery of life‑saving genetic medicines to patients worldwide.

Doctoral student uses machine learning to transform gene therapy

SignUpSignUp form

Modal title

LEAVE A REPLY Cancel reply