HereSay Voice AI Classifications Dataset

Version 2026-Q2-v2 · Released by HereSay · ODC-BY 1.0

10 aggregate classification datasets computed from 54 anonymized voice AI conversations (2,334 turns). Includes OpenAI's Asking/Doing/Expressing trichotomy, voice-specific facets, per-turn sentiment (VADER), question-type and pronoun distributions, conversation-arc transitions, and time-of-day patterns. Raw conversation text is not included.

License: ODC-BY 1.0 — Free to use, free to redistribute, attribution required. When you use this dataset in research, journalism, or commercial work, include:
"Contains information from the HereSay Voice AI Classifications Dataset (2026-Q2-v2) by HereSay (heresay.live), which is made available under the ODC Attribution License (ODC-BY 1.0)."

You need a free HereSay account to download. This is so you see the license at download time.

Sign in or create a free account

What's inside the ZIP

Read the blog post with headline findings →