Data science resume

Daniel Short

Data Scientist | NLP, ML & Deployment

Data scientist focused on NLP, anomaly detection, and deployment-minded Python work spanning semantic retrieval, fine-tuning, reusable corpora, and model evaluation.

95%
Faster workflow delivery
10x
Serial tracking expansion
1M+
Consumers reached in surveys

Experience

  • Built reusable domain corpora for citation-based chatbot and semantic-search prototypes over destination content, reporting archives, and traveler-facing information.
  • Analyzed AI search, GEO, and organic-discovery trends to guide LLM-readable content structure, prompt-grounding criteria, and evaluation workflows.
  • Launched 66 surveys and screeners to over 1 million consumers, using response data to refine audience hypotheses and testing inputs for segmentation.
  • Automated recurring reporting pipelines so destination content and performance data could be reused for retrieval, evaluation, prototyping, and iteration.
  • Re-platformed R workflows into a one-click Python app, cutting delivery time by 95% for recurring QA, handoff, and labeling cycles.
  • Built decision-tree models that expanded serial-number tracking by 10x and flagged anomalies with 98% precision for review prioritization and exception follow-up.
  • Deployed an autoencoder model that improved data quality for downstream analytics, model inputs, and exception handling workflows.
  • Designed dashboards that boosted theft reporting by 57.6% and increased prevention 180% through clearer pattern monitoring and case prioritization.
  • Reduced inventory loss by 24% through analytics-driven investigations, root-cause tracking, and tooling improvements.

Education

Selected projects

  • Smart Sentence Retriever

    Built embeddings-based semantic search with AWS Lambda ranking, relevance tuning, evaluation, and browser demo delivery.

  • Chatbot (LoRA + RAG)

    Fine-tuned a Mistral chatbot with retrieval and cited answers over destination content for grounded QA and controlled response style.

  • Handwriting Legibility Scoring

    Fine-tuned a PyTorch CNN and built a custom evaluation set for legibility scoring, error analysis, and threshold tuning.

  • Shape Classifier Demo

    Built a TensorFlow classifier for handwritten shape recognition with browser inference testing, demo delivery, and deployment-ready interaction.

View the data science portfolio