Data collection case study: Natural conversations in specific dialects

Recording two people casually playing a game might not sound like a challenge, but when a big 5 global technology service provider needed over 1000 conversations between unique pairs of speakers of very specific regional dialects delivered in just 2 months, figuring out how to facilitate recordings became the name of the game. Sigma.AI pivoted quickly and built a custom, automated game coordination tool to efficiently match pairs in a short period of time.

1160

Recorded conversations of unique pairs of regional dialect speakers

25%

Project management time saved through custom tooling

2

Months turnaround time for dataset delivery

Challenge

  • Record 1000+ natural, fluid conversations between two participants
  • Source participants who are native speakers of highly specific regional dialects, currently living at least 10 years in that region
  • Many participants needed, of diverse ages and genders
  • Client provided a set of games as prompts, e.g. describing an image or a word
  • Pairs of participants could not repeat — each of the conversations needed to be unique and between different people

Solution

  • Tapped into existing pool of 25,000+ annotators and linguists to quickly source participants for specific regional dialects
  • Trained participants for the project including video tutorials integrated into the user interface
  • Advised client on optimal distribution of not only gender and age but also vocal qualities and speed of speech within dataset
  • Unified several user interfaces into one to help foster more natural conversations
  • Optimized on-the-ground participant workflows
  • Created a custom tool for participant matching to maximize matching unique pairs according to their availability

Project story

For a natural understanding project, a major global technology player faced a daunting data collection task: capture over 1,000 authentic, flowing conversations, all within an incredibly tight two-month deadline.

Their approach consisted of providing speakers with games to play as prompts, where participants would have to describe an image or a word to their conversation partner. This would encourage natural, free-flowing conversations, not limited to a script.

Finding the optimal balance of diverse speakers

The first challenge was sourcing speakers of very specific dialects — they needed not just native speakers, but those with at least 10 years of residency in the dialect’s region, and a broad representation of ages and genders.

Sigma was able to tap into its existing pool of over 25,000 annotators and linguists to quickly source speakers of the requested dialects with the required local experience.

But there were still a few challenges to solve. Given such a tight deadline, how could the dataset include the right balance of ages and genders needed to train the algorithm for all possible circumstances? Sigma collaborated with the client to identify the optimal distribution, maximizing speaker diversity while acknowledging real-world constraints. Their speech and language AI expertise also allowed Sigma to advise on incorporating diverse vocal pitch, timbre, and speaking speeds.

Project and product teams collaborate to deliver at scale

Right from the start, Sigma understood that efficient workflows were essential for delivering the project at scale and on time. This required a concerted effort from project managers and product development to optimize tools and processes. Sigma’s commitment to customization — of tools, teams, and processes — ensured they could adapt to the project’s specific demands and satisfy the client.

Sigma’s proactive approach to project management, with constant feedback loops with the client and participants, allowed them to anticipate and address potential issues before they escalated. For instance, seeing that speakers were struggling with multiple user interfaces, the product development team quickly integrated them into a single, seamless platform. Project managers then created and embedded video tutorials within the interface. These adaptations resulted in smoother participation, quicker recording setup, and more natural conversations.

Automated pair matching saves 25% of project management time

Two key challenges emerged in the data collection process. First, the logistics of scheduling gameplay between a large number of participants with different availabilities, while ensuring unique pairings for each 45-minute conversation. Second, maintaining participant motivation for these relatively short sessions. 

The most essential piece of the optimization puzzle was automating the process of matching unique pairs of players together. The product team created a custom interface to match participants’ availability and automatically send them calendar invitations, optimizing the most labor-intensive part of the project management process and saving 25% of the total project management time. 

This also solved the question of motivation – participants could set their own schedules, lose no time on overhead and coordination, and could be sure that their next conversation would happen as soon as possible. It also greatly improved the speakers’ satisfaction with the project and continued participation, because their game partners were more likely to show up consistently and on time thanks to the efficient, automated meeting planner.

A major technology services client needs 2000 hours of video in 24 languages transcribed by humans — and wants to launch all 24 teams at once.
A client in consumer hardware needed highly sensitive user data annotated. Sigma.AI designed, implemented & operated secure facilities for 400+ annotators.
Need pixel-perfect image annotation? Learn how Sigma's human-in-the-loop team helped a robotics client achieve 1-pixel tolerance labeling.
Pixel-perfect image annotation for product recognition: A case study
EN