2026
arXiv preprint arXiv:2605.03998
Research archive
Peer-reviewed work, applied machine learning, clinical research, and teaching materials — grouped by topic below.
Grouped by topic. Each paper links to DOI or PDF when available.
27 total
Adversarial testing, red-teaming, and guardrail evaluation for frontier language models.
2026
arXiv preprint arXiv:2605.03998
2026
arXiv preprint arXiv:2605.03179
Young, Richard J.
A 1,554-prompt consensus-labeled benchmark distinguishing prompts that elicit executable malicious code from prompts that probe defensive security knowledge, enabling sharper evaluation of LLM misuse-resistance without conflating the two failure modes.
2026
arXiv preprint arXiv:2603.26410
Young, Richard J.
Open-weight reasoning models often surface relevant information inside their thinking tokens that does not propagate to the final answer. This work measures that divergence at scale and characterizes when chain-of-thought traces stop being a faithful explanation of the model's output.
2026
arXiv preprint arXiv:2603.22582
Young, Richard J.
An empirical evaluation of chain-of-thought faithfulness across modern reasoning models, testing whether the stated rationale actually reflects the computation that produced the answer, and quantifying systematic failure modes.
2026
arXiv preprint arXiv:2603.20172
Young, Richard J.
Faithfulness scores for chain-of-thought reasoning depend heavily on the classifier used to judge them. This work shows how classifier sensitivity drives apparent disagreements across faithfulness studies and proposes more robust evaluation protocols.
2025
arXiv preprint arXiv:2512.13655
Young, Richard J.
A cross-architecture comparison of abliteration techniques used to remove refusal behavior from open-weight LLMs, examining trade-offs in capability retention, residual safety, and how each method generalizes across model families.
2025
arXiv preprint arXiv:2512.07059
Young, Richard J.
This study evaluates large language model vulnerability to sophisticated multi-turn attacks using the TEMPEST framework across ten frontier models from eight vendors, testing 1,000 harmful behaviors with over 97,000 API queries. Six models showed 96-100% attack success rate (ASR), while four demonstrated greater resilience with ASR between 42-78%. Enabling extended reasoning reduced ASR from 97% to 42%. The research concludes that safety alignment quality varies substantially across vendors, model scale does not ensure adversarial robustness, and deliberative inference represents a promising defense approach. Current alignment techniques remain fundamentally vulnerable to adaptive multi-turn attacks regardless of model scale, though thinking modes offer potential mitigation strategies.
2025
arXiv preprint arXiv:2511.22047
Young, Richard J.
This research examined ten publicly accessible guardrail models from major organizations including Meta, Google, IBM, NVIDIA, Alibaba, and Allen AI, testing them against 1,445 prompts across 21 attack categories. While Qwen3Guard-8B demonstrated the strongest overall accuracy at 85.3%, all models substantially underperformed when encountering novel attacks compared to benchmark prompts. Qwen3Guard dropped from 91.0% to 33.8% (a 57.2 percentage point gap), whereas Granite-Guardian-3.2-5B exhibited superior generalization with only a 6.5% performance decline. The research also identified a previously unknown failure mode where certain guardrail models generated harmful responses when prompted with a helpful mode jailbreak technique. Generalization capacity, rather than benchmark accuracy, represents the more meaningful evaluation criterion for assessing safety guardrail effectiveness.
2025
Journal of Gambling Studies
Ghaharian, Kasra, Soligo, Maximilliano, Young, Richard, Golab, Lukasz, Kraus, Shane W., Wells, Samantha
Co-authored peer-reviewed work with gambling treatment professionals evaluating LLM responses to problem gambling questions, aimed at ensuring LLM-based education and support tools reduce harm rather than inadvertently encouraging risky behavior.
2024
OSF Preprints
Ghaharian, Kasra, Kraus, Shane W., Young, Richard, Golab, Lukasz, Wells, Samantha, Soligo, Maximilliano
A preprint evaluating LLM responses to a set of problem-gambling clinical vignettes, assessing harm avoidance, accuracy, and alignment with treatment-professional expectations.
Clinical applications across cardiology, medical imaging, privacy, and PHI risk.
2026
Home Health Care Management \& Practice
Gada, Eshan, Pangburn, Paula, Sahr, Christopher, Jarvis, Mary S., Young, Richard J.
An analysis of clinical and cost outcomes for Medicare Advantage members following discharge to post-acute care, identifying patterns that predict readmission and total cost of care.
2025
arXiv preprint arXiv:2511.19739
Young, Richard J., Matthews, Alice M.
This study evaluated ten transformer-based embedding models fine-tuned for cardiology using Low-Rank Adaptation on over 106,000 cardiology text pairs from medical textbooks. Encoder-only architectures, particularly BioLinkBERT, achieve superior domain-specific performance (separation score: 0.510) while using fewer computational resources than larger models. The study challenges the notion that bigger language models automatically produce better clinical embeddings and offers practical recommendations for developing medical NLP systems. All models, code, and datasets are publicly available.
2025
arXiv preprint arXiv:2511.18272
Young, Richard J.
This research examines how vision-language models used for document scanning in healthcare handle sensitive patient information. Testing seven different masking approaches on DeepSeek-OCR using 100 synthetic medical billing statements, all strategies achieve only 42.9% protection against health information exposure. While long-form identifiers like patient names and addresses are successfully blocked, shorter structured data such as medical record and social security numbers leak through. This is attributed to the language model's contextual reasoning rather than inadequate visual masking. Combining vision masking with language-level processing reaches 88.6% effectiveness, recommending future work focus on decoder-level adjustments for HIPAA compliance in medical document handling.
2025
Algorithms
Young, Richard J., Matthews, Alice, Poston, Brach
A benchmark of five LLMs and a multi-LLM ensemble for extracting structured protocol fields from ClinicalTrials.gov, applied to aging-related tDCS trials. The pipeline doubled eligible-trial retrieval to 83 studies and reached near-perfect agreement on core stimulation parameters (kappa approximately 0.94; ICC approximately 0.95-0.96), reducing manual screening burden.
2025
Journal of Gambling Studies
Ghaharian, Kasra, Peterson, Jay, Young, Richard
An open-banking analysis of gambling transaction patterns among UK consumers, characterizing behaviors associated with progression from recreational to problem gambling using real transaction-level data.
2024
OSF Preprints
Ghaharian, Kasra, Peterson, Jay, Young, Richard
Preprint version of the open-banking gambling analysis later published in Journal of Gambling Studies, characterizing gambling behavior patterns in UK transaction data.
2024
OSF
Ghaharian, Kasra, Azizsoltani, Mohammad, Puranik, Pranav A., Young, Richard
A research dataset and methodology report documenting the UNLV IGI open-banking gambling research program, including data structure, governance, and analytic approach.
2024
American Journal of Managed Care
Gada, Eshan, Pangburn, Paula, Sahr, Christopher, Schaben, Carol, Young, Richard
An analysis of Medicare Advantage members showing that unfulfilled home-health referrals are associated with significantly higher mortality, highlighting a measurable care-coordination gap with patient-safety implications.
2022
Alzheimer's \& Dementia
Zhuang, Xiaowei, Young, Richard, Tillett, Ryan, Cordes, Dietmar, Oh, Edwin
A multimodal AI approach combining genomic features with brain imaging biomarkers to improve diagnostic accuracy for Alzheimer's disease relative to single-modality baselines.
Language models, embedding methods, and domain-adaptation techniques.
2025
arXiv preprint arXiv:2510.18892
Young, Richard J., Gillins, Brandon, Matthews, Alice M.
Despite widespread deployment of Large Language Models, systematic evaluation of instruction-following capabilities remains challenging. While comprehensive benchmarks exist, focused assessments that quickly diagnose specific instruction adherence patterns are valuable. As newer models may be trained on existing benchmarks, novel evaluation approaches are needed to assess genuine capabilities rather than memorized performance. This paper presents a streamlined evaluation framework using twenty carefully designed prompts to assess LLM instruction-following across diverse task categories. We demonstrate this framework through a large-scale empirical study conducted on October 14, 2025, testing 256 verified working models from 331 available via OpenRouter. To ensure methodological rigor and prevent selection bias, we first verified each model's basic functionality before inclusion. Unlike large-scale benchmarks requiring extensive computational resources, our approach offers a practical diagnostic tool researchers and practitioners can readily apply. Our methodology builds upon verifiable instructions while introducing a compact test suite balancing comprehensiveness with efficiency. Each prompt targets distinct aspects of instruction following, including format compliance, content constraints, logical sequencing, and multi-step task execution. We evaluate models from major providers (OpenAI, Anthropic, Google, Meta, Mistral) and emerging implementations (Qwen, DeepSeek, community models), providing comparative performance analysis. Our findings reveal consistent failure modes and identify specific instruction types posing particular challenges. This work contributes both a practical evaluation tool and one of the most comprehensive empirical analyses of instruction-following capabilities across the contemporary LLM landscape.
Clinical neuroscience, tDCS, brain-computer interfaces, and cognitive enhancement.
2025
Brain Sciences
Wilkins, Erik W., Young, Richard J., Davidson, Ryan, Krider, Robert, Alhwayek, Ghaith, Park, Jin A., Parikh, Arjun C., Riley, Zachary A., Poston, Brach
An investigation of how transcranial alternating current stimulation applied unilaterally influences excitability in the unstimulated contralateral primary motor cortex, with implications for bilateral neuromodulation protocols.
2024
Brain Sciences
Wilkins, Erik W., Young, Richard J., Houston, Daniel, Kawana, Eric, Mora, Eduardo, Sunkara, Manish, Riley, Zachary A., Poston, Brach
This study tests whether transcranial direct current stimulation of the dominant hemisphere produces measurable changes in the non-dominant hemisphere, finding no significant cross-hemispheric effects on excitability during or after stimulation.
2023
Brain Sciences
De Guzman, Kelly A., Young, Richard J., Contini, Vincent, Clinton, Ethan, Hitchcock, Amelia, Riley, Zachary A., Poston, Brach
This study investigates how transcranial alternating current stimulation applied to motor cortex influences fatigue resistance during sustained voluntary muscle contractions.
2018
Alzheimer's \& Dementia: Translational Research \& Clinical Interventions
Sharma, Sandeep, Young, Richard J., Chen, Jingchun, Chen, Xuemei, Oh, Edwin C., Schiller, Martin R.
Analysis showing that short functional protein motifs (minimotifs) are systematically disrupted across neurodegenerative disorders, suggesting a shared molecular vulnerability and a target class for therapeutic study.
Cross-cutting machine learning methods, evaluation, causality, and benchmarks.
2018
Nucleic Acids Research
Lyon, Kenneth F., Cai, Xinge, Young, Richard J., Mamun, Abdullah-Al, Rajasekaran, Sanguthevar, Schiller, Martin R.
The fourth release of the Minimotif Miner database, expanding annotated peptide minimotifs to the million-entry scale and adding new search and visualization tooling for functional motif discovery.
2015
Nucleic Acids Research
Lyon, Kenneth F., Strong, Christy L., Schooler, Sara G., Young, Richard J., Roy, Nikhil, Ozar, Bill, Bachmeier, Michael, Rajasekaran, Sanguthevar, Schiller, Martin R.
Analysis of 1,092 human genomes showing that short functional protein motifs (minimotifs) display patterns of natural variability consistent with being targets of evolutionary selection.