<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Blog on Victoria Dyster, PhD</title><link>http://victoriadyster.com/blog/</link><description>Recent content in Blog on Victoria Dyster, PhD</description><generator>Hugo -- gohugo.io</generator><language>en</language><copyright>© 2026 Victoria Dyster, PhD</copyright><lastBuildDate>Mon, 10 Mar 2025 00:00:00 +0000</lastBuildDate><atom:link href="http://victoriadyster.com/blog/index.xml" rel="self" type="application/rss+xml"/><item><title>Using Learning Curves to Know Whether More Data Will Help</title><link>http://victoriadyster.com/blog/learning-curves-sample-size/</link><pubDate>Mon, 10 Mar 2025 00:00:00 +0000</pubDate><guid>http://victoriadyster.com/blog/learning-curves-sample-size/</guid><description>Learning curves won&amp;rsquo;t tell you exactly how many samples to collect, but they will tell you whether collecting more is worth it at all. In domains where each sample costs real money, that&amp;rsquo;s the question that actually matters.</description></item><item><title>When Random Features Work Just as Well</title><link>http://victoriadyster.com/blog/when-random-features-work-just-as-well/</link><pubDate>Mon, 20 Jan 2025 00:00:00 +0000</pubDate><guid>http://victoriadyster.com/blog/when-random-features-work-just-as-well/</guid><description>On the counterintuitive finding that randomly selecting features from high-dimensional genomic data often matches the performance of careful feature engineering and why that makes mathematical sense.</description></item><item><title>Glorified Excel: The Dashboard Feature Request You Should Push Back On</title><link>http://victoriadyster.com/blog/glorified-excel/</link><pubDate>Fri, 01 Sep 2023 00:00:00 +0000</pubDate><guid>http://victoriadyster.com/blog/glorified-excel/</guid><description>In biotech, the most common dashboard feature request is a filterable, sortable table. The fastest solution is usually a CSV download and the spreadsheet software people already know.</description></item><item><title>FormulAI: Using LangChain to Generate Skincare Formulations</title><link>http://victoriadyster.com/blog/formulai-langchain-formulations/</link><pubDate>Sun, 11 Jun 2023 00:00:00 +0000</pubDate><guid>http://victoriadyster.com/blog/formulai-langchain-formulations/</guid><description>A hackathon project that chains LLM calls with a product ingredient database and Wikipedia to generate skincare formulations — ingredients, assembly protocols, and allergen warnings.</description></item><item><title>How to Impress Someone with a Bioinformatics Pipeline</title><link>http://victoriadyster.com/blog/how-to-impress-someone-with-a-bioinformatics-pipeline/</link><pubDate>Sun, 02 Apr 2023 00:00:00 +0000</pubDate><guid>http://victoriadyster.com/blog/how-to-impress-someone-with-a-bioinformatics-pipeline/</guid><description>What makes a good bioinformatics pipeline? A short, non-technical take on the things that matter — from user requirements to reproducibility to knowing when good enough is good enough.</description></item></channel></rss>