TI - Human versus machine: Can generative AI anticipate insect biological control outcomes?
AU - Wyckhuys, K.A.G.
AU - Akutse, K.S.
AU - Amalin, D.M.
AU - Araj, S-E.
AU - Beltran, M.J.B.
AU - Fekih, I.B.
AU - Calatayud, P-A.
AU - Cicero, L.
AU - Cokola, M.C.
AU - Colmenarez, Y.C.
AU - Dessauvages, K.
AU - Dubois, T.
AU - Durocher-Granger, L.
AU - Fernández-Triana, J.L.
AU - Francis, F.
AU - Haddi, K.
AU - Harrison, R.D.
AU - Haseeb, M.
AU - Iwanicki, N.S.A.
AU - Jaber, L.R.
AU - Khamis, F.M.
AU - Legaspi, J.C.
AU - Lomeli-Flores, R,J.
AU - Lyu, B.
AU - Montoya-Lerma, J.
AU - Nurkomar, I.
AU - O’Hara, J.E.
AU - Perier, J.D.
AU - Ramírez-Romero, R.
AU - Sanchez-Garcia, F.J.
AU - Robinson-Baker, A.M.S.
AU - Silveira, L.C.P.
AU - Simeon, L.
AU - Solter, L.F.
AU - Santos-Amaya, O.F.
AU - de Souza Tavares, W.
AU - Trabanino, R.
AU - Valicente, F.H.
AU - Vásquez, C.
AU - Wang, Z.
AU - Zang, L.S.
AU - Zhang, W.
AU - Zimba, K.J.Wu
AU - Wu, K.
AU - GC, Yubak D.
AB - Generative artificial intelligence (AI) could transform evidence synthesis and revolutionize the global scientific enterprise, yet its agricultural applications are understudied. Here, we systematically assess the performance of three web-grounded AI engines (ChatGPT, ScholarAI and DeepSeek) in synthesizing the global literature on biological control of the fall armyworm Spodoptera frugiperda, and benchmark their outputs against a recent, near-exhaustive human review. Though all engines rapidly screened vast literature corpora, they exhibited shortcomings in factual accuracy, reporting reliability and data consistency. In machine-run syntheses, natural enemy prevalence and performance data often diverged from published records while the level of agreement in enumerating top-performing taxa was evenly low. Meanwhile, internal consistency between laboratory and field-level parasitism data for ScholarAI and DeepSeek was similar to that in human-run reviews. All models tended towards faulty data extrapolation, hallucination and data fabrication, and a sporadic exclusion of key species. While autonomous, machine-only efforts accurately capture coarse-grained patterns in natural enemy identity, abundance, and impacts, they carry limited utility for (living) evidence syntheses or rigorous decision-support. Yet, handled with prudence and due human oversight, machine power might eventually revitalize underfunded disciplines and advance nature-friendly farming.
PY - 2026
UR - https://www.cifor-icraf.org/knowledge/publication/46163/
DO - https://doi.org/10.1016/j.compag.2025.111317
KW - agricultural applications, agroecology, artificial intelligence, biological control, fall armyworm, large language models, machine learning, pest control, pesticides, sustainable agriculture, systematic reviews
ER -