Home
What we do?
AI Training Data
Custom Data SourcingBuild Custom Datasets.
Data Annotation & EnhancementLabel and refine data.
Data ValidationStrengthen data quality.
RLHFEnhance AI accuracy.
Data LicensingAccess premium datasets effortlessly.
Crowd as a ServiceScalable with global data.
Build AI
AI AgentsDeploy intelligent AI assistants.
AI Digital TransformationAutomate business growth.
Talent AugmentationScale with AI expertise.
Model EvaluationAssess and refine AI models.
Solutions
Use Cases
Computer VisionDetect, classify and analyze images.
Conversational AIEnable smart, human-like interactions.
Natural Language Processing (NLP)Decode and process language.
Sensor FusionIntegrate and enhance sensor data.
Generative AICreate Al-powered content.
Healthcare AIGet Medical analysis with Al.
ADASPowered advanced driver assistance
Industries
AutomotiveIntegrate AI for safer, smarter driving.
HealthcarePower diagnostics with cutting-edge AI.
Retail/E-CommercePersonalize shopping with AI intelligence.
AR/VRBuild next-level immersive experiences.
GeospatialMap, track, and optimize locations.
Banking & FinanceAutomate risk, fraud, and transactions.
DefenseStrengthen national security with AI.
Capabilities
Managed Model GenerationDevelop AI models built for you.
Model ValidationTest, improve, and optimize AI.
Enterprise AIScale business with AI-driven solutions.
Generative AI & LLM AugmentationBoost AI’s creative potential.
Sensor Data CollectionCapture real-time data insights
Autonomous VehicleTrain AI for self-driving efficiency.
Products
Data MarketplaceExplore premium AI-ready datasets.
Annotation ToolLabel data with precision.
RLHF ToolTrain AI with real-human feedback.
Transcription ToolConvert speech into flawless text.
Pricing
Our Company
About MacgenceLearn about our company.
In The MediaMedia coverage highlights.
CareersExplore career opportunities.
JobsOpen positions available now.
Resources Case Studies, Blogs and Research Report.
Case StudiesSuccess Fueled by Precision Data
BlogInsights and latest updates.
Research ReportDetailed industry analysis.
Contact Us

Indian Agent to US Customer Speech Dataset in English for Finance 50Hrs

Name: Indian Agent to US Customer Speech Dataset in English for Finance
Creator: Macgence
License: https://data.macgence.com/terms-and-conditions

The audio dataset includes call center conversations in Finance, featuring English speakers from India and US, with detailed metadata and accurate transcriptions.

Speech Recognition

English

Banking & Finance
Industry
50 Hours
Duration
2
Individuals

Get Access

Description

About This OTS Dataset

This Off-The-Shelf (OTS) dataset offers a comprehensive collection of audio recordings showcasing conversations between US Customer representatives and Indian Agents within the financial sector. It is meticulously curated to enhance speech recognition and conversational AI models tailored specifically to the unique dynamics of interactions between US Customer representatives and Indian agents in finance.

Metadata Availability: Insights into Participant Details

Each participant is accompanied by detailed metadata, including age, gender, country, state, dialect, domain, topic, call type, and outcome. This rich metadata facilitates informed decision-making during model development.

Audio Recording Details

Audio Duration: 50 hours
Format Utilized: WAV for optimal quality
Customizable Sample Rate: Variable to accommodate specific project needs, while maintaining a standard of 16-bit for clear, high-quality audio
Diverse Recording Environments: Conducted in various real-world settings, offering a comprehensive portrayal of call center interactions
Standard Recording Equipment: Utilizing standard call center devices for meticulous capture of authentic conversations between Indian agents and US customers, ensuring an accurate representation of communication dynamics.

These technical specifications ensure compatibility and optimal performance for various AI development applications within the financial sector.

Insights into Audio Data

The dataset comprises 50 hours of high-quality audio recordings covering various topics within the financial domain. Created through collaboration with native English speakers from the United States, it captures realistic interactions, ensuring a balanced representation of American accents, dialects, and demographics.

Dataset Transcription Details

Manual verbatim transcriptions in JSON format accompany each audio file, capturing speaker-wise dialogues with time-coded segmentation and non-speech labels. These transcriptions expedite the development of financial-related call centre conversational AI and automatic speech recognition (ASR) models.

License

Exclusively curated by Macgence, this financial call centre audio dataset is available for commercial use, empowering AI developers in the finance sector.

Updates and Customization

Consistent updates with fresh audio data captured in varied real-world scenarios guarantee ongoing relevance and precision. We offer customization options such as adjusting sample rates and providing bespoke transcriptions tailored to your specific criteria and needs.

Why Macgence Stands Out

At Macgence, we're more than just a data provider. We offer tailored solutions to meet your specific needs in AI development. Here's why we believe Macgence is the right partner for you:

Tailored Solutions: Your project is unique, and we understand that. We'll customize everything to align precisely with your objectives.
Versatile Data: Our dataset spans a broad spectrum of applications within the finance sector, encompassing speech recognition, natural language processing, and beyond.
Ongoing Support: We're committed to providing continuous assistance throughout your project lifecycle. Our dataset is regularly refreshed with new recordings, and our team remains readily available to offer guidance and support whenever needed.
Transparent Licensing: Utilize our dataset for commercial purposes with confidence. Our transparent and straightforward licensing terms ensure clarity and peace of mind for your organization.
Comprehensive Assistance: Besides data provisioning, we offer a suite of supplementary services to augment your project. Whether it entails sourcing additional data, conducting meticulous labeling, or tailoring datasets to align with your project specifications, we're equipped to provide comprehensive support.

Choose Macgence for your AI development needs and unlock the full potential of our tailored solutions and expertise.

ASR

Conversational AI

Chatbot

Language Modelling

TTS

Speech Analytics

Request this Dataset

* Marked fields are mandatory

Indian Agent to US Customer Speech Dataset in English for Finance 50Hrs

Speech Recognition

Banking & Finance

50 Hours

2