ISO/IEC 19794-13:2018 信息技术 生物统计数据互换格式 第13部分:声音数据

标准编号:ISO/IEC 19794-13:2018

中文名称:信息技术 生物统计数据互换格式 第13部分:声音数据

英文名称:Information technology — Biometric data interchange formats — Part 13: Voice data

发布日期:2018-03

标准范围

ISO/IEC 19794-13:20 18规定了一种数据交换格式,其可用于存储、记录和传输假设来自在单个会话中记录的单个说话者的数字化声学人类语音数据(语音)。这种格式是专门为支持各种说话人识别和验证(SIV)应用而设计的,包括依赖于文本和独立于文本的应用,对语音数据捕获条件或收集环境的假设最少。以这种格式封装的数据的其他用途,例如自动语音识别(ASR),可能是可能的,但在本文档中没有提及。本文档也不涉及已经处理到特征或语音模型级别的数据的处理。本文档中不涉及特定于应用的要求、设备或功能。本文档支持可选地包含非标准化扩展数据。该文档允许交换捕获的原始数据和数字处理(增强)的语音数据。对原始源输入的任何处理的描述旨在被包括在与语音表示(VR)相关联的元数据中。本文档不涉及数据流。关于对存储和传输的生物特征数据打上时间戳以及使用加密技术保护其真实性、完整性和保密性的规定超出了本文档的范围。根据本文档格式化的信息可以记录在机器可读介质上,或者可以通过系统之间的数据通信来传输。描述语音数据交换格式的一般面向内容的子条款之后是寻址XML模式定义的子条款。ISO/IEC 19794-13:20 18包括语音和说话人识别社区常用的词汇,以及其他ISO标准的术语。

ISO/IEC 19794-13:2018 specifies a data interchange format that can be used for storing, recording, and transmitting digitized acoustic human voice data (speech) assumed to be from a single speaker recorded in a single session. This format is designed specifically to support a wide variety of Speaker Identification and Verification (SIV) applications, both text-dependent and text-independent, with minimal assumptions made regarding the voice data capture conditions or the collection environment. Other uses for the data encapsulated in this format, such as automated speech recognition (ASR), may be possible, but are not addressed in this documnet. This document also does not address handling of data that has been processed to the feature or voice model levels. No application-specific requirements, equipment, or features are addressed in this document. This document supports the optional inclusion of non-standardized extended data. This document allows both the original data captured and digitally-processed (enhanced) voice data to be exchanged. A description of any processing of the original source input is intended to be included in the metadata associated with the voice representations (VRs). This document does not address data streaming.
Provisions that stored and transmitted biometric data be time-stamped and that cryptographic techniques be used to protect their authenticity, integrity and confidentiality are out of the scope of this document.
Information formatted in accordance with this document can be recorded on machine-readable media or can be transmitted by data communication between systems.
A general content-oriented subclause describing the voice data interchange format is followed by a subclause addressing an XML schema definition.
ISO/IEC 19794-13:2018 includes vocabulary in common use by the speech and speaker recognition community, as well as terminology from other ISO standards.

标准预览图


立即下载标准文件