
<!DOCTYPE HTML>
<html class="oxh met-web" >
<head>
<meta charset="utf-8">
<meta name="renderer" content="webkit">
<meta http-equiv="X-UA-Compatible" content="IE=edge,chrome=1">
<meta name="viewport" content="width=device-width,initial-scale=1.0,maximum-scale=1.0,minimum-scale=1.0,user-scalable=0,minimal-ui">
<meta name="format-detection" content="telephone=no">
<title>语音特征提取: 看懂梅尔语谱图(Mel-spectrogram)、梅尔倒频系数(MFCCs)的原理-有问必答-西数科技-科技创造无限可能</title>
<meta name="description" content="语音特征提取: 看懂梅尔语谱图(Mel-spectrogram)、梅尔倒频系数(MFCCs)的原理">
<meta name="keywords" content="司法鉴定">
<meta name="generator" content="MetInfo V7.9" data-variable="../|cn|cn|metv75|2|142|28" data-user_name="">
<link href="../favicon.ico?1702101608" rel="shortcut icon" type="image/x-icon">
<link rel="stylesheet" type="text/css" href="../public/web/css/basic.css?1713669838">
<link rel="stylesheet" type="text/css" href="../templates/metv75/cache/metinfo.css?1713670248">
<link rel="canonical" href="http://www.jsos.cn/news/news28.htm">
<style>
body{
    background-color: !important;font-family: !important;}
h1,h2,h3,h4,h5,h6{font-family: !important;}
</style>
<script>(function(){var t=navigator.userAgent;(t.indexOf("rv:11")>=0||t.indexOf("MSIE 10")>=0)&&document.write("<script src=\"../public/plugins/html5shiv/html5shiv.min.js\"><\/script>")})();</script>
</head>
<!--[if lte IE 9]>
<div class="text-xs-center m-b-0 bg-blue-grey-100 alert">
    <button type="button" class="close" aria-label="Close" data-dismiss="alert">
        <span aria-hidden="true">×</span>
    </button>
    你正在使用一个过时的浏览器。请升级你的浏览器，以提高您的体验。</div>
<![endif]-->
<body >
<header class='met-head' m-id='met_head' m-type="head_nav">
    <nav class="navbar navbar-default box-shadow-none met-nav">
        <div class="container">
            <div class="row">
                <div class='met-nav-btn'>
                                                                <h3 hidden>西数科技-科技创造无限可能</h3>
                                        <div class="navbar-header pull-xs-left">
                        <a href="../" class="met-logo vertical-align block pull-xs-left" title="司法鉴定，质量鉴定">
                            <div class="vertical-align-middle">
                                                                      <img src="../upload/202307/1688714971.jpg" alt="司法鉴定，质量鉴定" class="mblogo" />
                                    <img src="../upload/202308/1692439831.jpg" alt="司法鉴定，质量鉴定" class="pclogo" />
                                                                </div>
                        </a>
                    </div>
                    <button type="button" class="navbar-toggler hamburger hamburger-close collapsed p-x-5 p-y-0 met-nav-toggler" data-target="#met-nav-collapse" data-toggle="collapse">
                        <span class="sr-only"></span>
                        <span class="hamburger-bar"></span>
                    </button>
                                      </div>
                <div class="navbar-collapse-toolbar pull-md-right p-0 collapse" id='met-head-user-collapse'>
                                      </div>
                <div class="navbar-collapse-toolbar pull-md-right p-0 collapse" id="met-nav-collapse">
                    <ul class="nav navbar-nav navlist">
                        <li class='nav-item'>
                            <a href="../" title="网站首页" class="nav-link
                                                          ">网站首页</a>
                        </li>
                                                                          <li class="nav-item dropdown m-l-10">
                                                          <a
                                href="../product/index.htm"
                                title="产品中心"
                                target='_self'                                class="nav-link dropdown-toggle "
                                data-toggle="dropdown" data-hover="dropdown"
                            >
                                                        <span style=''>产品中心</span></a>
                                                              <div class="dropdown-menu dropdown-menu-right animate animate-reverse">
                                                                                                                                <a href="../product/index.htm" target='_self'  title="全部"
                                    class='dropdown-item nav-parent hidden-xl-up '>全部</a>
                                                                                                                                                              <a href="../product/product_136_1.htm" target='_self' title="数据库取证" class='dropdown-item '><span style=''>数据库取证</span></a>
                                                                                                                                  <a href="../product/product_137_1.htm" target='_blank' title="芯片取证" class='dropdown-item '><span style=''>芯片取证</span></a>
                                                                                                                                  <a href="../product/product_139_1.htm" target='_self' title="视频取证" class='dropdown-item '><span style=''>视频取证</span></a>
                                                                                                                                  <a href="../product/product_177_1.htm" target='_self' title="手机取证" class='dropdown-item '><span style=''>手机取证</span></a>
                                                                                                                                  <a href="../product/product_176_1.htm" target='_self' title="数据恢复" class='dropdown-item '><span style=''>数据恢复</span></a>
                                                                                        </div>
                        </li>
                                                                                                  <li class="nav-item dropdown m-l-10">
                                                          <a
                                href="../news/index.htm"
                                title="常见问题"
                                target='_self'                                class="nav-link dropdown-toggle active"
                                data-toggle="dropdown" data-hover="dropdown"
                            >
                                                        <span style=''>常见问题</span></a>
                                                              <div class="dropdown-menu dropdown-menu-right animate animate-reverse">
                                                                                                                                <a href="../news/index.htm" target='_self'  title="全部"
                                    class='dropdown-item nav-parent hidden-xl-up active'>全部</a>
                                                                                                                                                              <a href="../news/news_140_1.htm" target='_self' title="最新发布" class='dropdown-item '><span style=''>最新发布</span></a>
                                                                                                                                  <div class="dropdown-submenu">
                                        <a href="../news/news_141_1.htm" target='_self' class="dropdown-item "><span style=''>使用教程</span></a>
                                        <div class="dropdown-menu animate animate-reverse">
                                                                                            <a href="../news/news_184_1.htm" target='_self' class="dropdown-item " ><span style=''>FE资源模型</span></a>
                                                                                            <a href="../news/news_185_1.htm" target='_self' class="dropdown-item " ><span style=''>FE技术手册</span></a>
                                                                                    </div>
                                    </div>
                                                                                                                                  <a href="../news/news_142_1.htm" target='_self' title="有问必答" class='dropdown-item active'><span style=''>有问必答</span></a>
                                                                                        </div>
                        </li>
                                                                                                  <li class="nav-item dropdown m-l-10">
                                                          <a
                                href="../download/index.htm"
                                title="下载中心"
                                target='_self'                                class="nav-link dropdown-toggle "
                                data-toggle="dropdown" data-hover="dropdown"
                            >
                                                        <span style=''>下载中心</span></a>
                                                              <div class="dropdown-menu dropdown-menu-right animate animate-reverse">
                                                                                                                                <a href="../download/index.htm" target='_self'  title="全部"
                                    class='dropdown-item nav-parent hidden-xl-up '>全部</a>
                                                                                                                                                              <a href="../download/download_173_1.htm" target='_self' title="软件下载" class='dropdown-item '><span style=''>软件下载</span></a>
                                                                                                                                  <a href="../download/download_166_1.htm" target='_self' title="产品说明书" class='dropdown-item '><span style=''>产品说明书</span></a>
                                                                                                                                  <a href="../download/download_168_1.htm" target='_self' title="培训资料" class='dropdown-item '><span style=''>培训资料</span></a>
                                                                                        </div>
                        </li>
                                                                                                  <li class='nav-item m-l-10'>
                            <a href="../about/about179.htm" target='_self' title="公司简介" class="nav-link "><span style=''>公司简介</span></a>
                        </li>
                                                                    </ul>
                    <div class="metlang m-l-15 pull-md-right">
                                                                                                                <div class="met-langlist vertical-align" m-type="lang" m-id="lang">
                                <div class="inline-block dropdown">
                                                                                                                                                                        <span data-toggle="dropdown" class="met_navbtn dropdown-toggle">
                                                                                                                                      <img src="../public/images/flag/cn.gif" alt="简体中文" width="20">
                                                                                        <span>简体中文</span>
                                          </span>                                    
                                                                          
                                                                          
                                                                                                              <ul class="dropdown-menu dropdown-menu-left mb-animate-reverse animate animate-reverse" id="met-langlist-dropdown" role="menu">
                                                                                                                                                                            
                                        <a href="http://www.jsos.cn/index.php?lang=en" title="English"   target="_blank" class='dropdown-item'>
                                                                                              <img src="../public/images/flag/en.gif" alt="English" width="20">
                                                                                        English                                        </a>

                                                                                                                          
                                        <a href="http://www.jsos.cn/index.php?lang=ru" title="俄语"    class='dropdown-item'>
                                                                                              <img src="../public/images/flag/ru.gif" alt="俄语" width="20">
                                                                                        俄语                                        </a>

                                                                                                                    </ul>
                                                                    </div>
                            </div>
                                            </div>
                </div>
            </div>
        </div>
    </nav>
</header>

    <div class="met-banner carousel slide" id="exampleCarouselDefault" data-ride="carousel" m-id='banner'  m-type='banner'>
          <ol class="carousel-indicators carousel-indicators-fall">
                    <li data-slide-to="0" data-target="#exampleCarouselDefault" class="  active"></li>
                    <li data-slide-to="1" data-target="#exampleCarouselDefault" class="  "></li>
                    <li data-slide-to="2" data-target="#exampleCarouselDefault" class="  "></li>
                    <li data-slide-to="3" data-target="#exampleCarouselDefault" class="  "></li>
            </ol>
                  <a class="left carousel-control" href="#exampleCarouselDefault" role="button" data-slide="prev">
            <span class="icon" aria-hidden="true"><</span>
            <span class="sr-only">Previous</span>
        </a>
        <a class="right carousel-control" href="#exampleCarouselDefault" role="button" data-slide="next">
            <span class="icon" aria-hidden="true">></span>
            <span class="sr-only">Next</span>
        </a>
        <div class="carousel-inner   " role="listbox">
                    <div class="carousel-item   active">
                                      <img class="w-full mobile_img" src="../upload/202308/1692415652.jpg" srcset='../upload/202308/1692415652.jpg 767w,../upload/202308/1692415652.jpg' sizes="(max-width: 767px) 767px" alt="智能化Oracle数据库修复取证软件,一键解析" pch="0" adh="0" iph="0">
                    <img class="w-full pc_img" src="../upload/202308/1692415652.jpg" srcset='../upload/202308/1692415652.jpg 767w,../upload/202308/1692415652.jpg' sizes="(max-width: 767px) 767px" alt="智能化Oracle数据库修复取证软件,一键解析" pch="0" adh="0" iph="0">
                                                          <div class="met-banner-text pc-content" met-imgmask>
                        <div class='container'>
                            <div class='met-banner-text-con p-4'>
                                <div>
                                    <div>
                                                                              <a href="http://jsos.cn/product/product122.htm" title="支持DBF/DMP数据直接解析提取,自动生成Oracle数据库" class="all-imgmask"   ></a>
                                                                                                              <h3 class="animation-slide-top animation-delay-300 font-weight-500" style="color:#23e020;font-size: px;">智能化Oracle数据库修复取证软件,一键解析</h3>
                                                                                                              <p class="animation-slide-bottom animation-delay-600" style='color:#e85207;font-size: px;'>支持DBF/DMP数据直接解析提取,自动生成Oracle数据库</p>
                                                                                                                        <a href="http://www.jsos.cn/product/product122.htm" title="软件下载试用"   target="_blank" class="btn slick-btn   " infoset="|#c90a0a|#043cd6|#0e12f0|#07d1a5||">软件下载试用</a>
                                                                        </div>
                                </div>
                            </div>
                        </div>
                    </div>
                                                      <div class="met-banner-text mobile-content" met-imgmask>
                        <div class='container'>
                            <div class='met-banner-text-con p-4 '>
                                <div>
                                    <div>
                                                                              <a href="http://jsos.cn/product/product122.htm" title="支持DBF/DMP数据直接解析提取,自动生成Oracle数据库" class="all-imgmask"   ></a>
                                                                                                              <h3 class="animation-slide-top animation-delay-300 font-weight-500" style="color:#23e020;font-size: px;">智能化Oracle数据库修复取证软件,一键解析</h3>
                                                                                                              <p class="animation-slide-bottom animation-delay-600" style='color:#e85207;font-size: px;'>支持DBF/DMP数据直接解析提取,自动生成Oracle数据库</p>
                                                                                                                        <a href="http://www.jsos.cn/product/product122.htm" title="软件下载试用"   target="_blank" class="btn slick-btn   " infoset="|#c90a0a|#043cd6|#0e12f0|#07d1a5||">软件下载试用</a>
                                                                        </div>
                                </div>
                            </div>
                        </div>
                    </div>
                            </div>
                    <div class="carousel-item   ">
                                      <img class="w-full mobile_img" src="../upload/202308/1692415971.jpg" srcset='../upload/202308/1692415971.jpg 767w,../upload/202308/1692415971.jpg' sizes="(max-width: 767px) 767px" alt="SQL Server数据库提取与修复" pch="0" adh="0" iph="0">
                    <img class="w-full pc_img" src="../upload/202308/1692415971.jpg" srcset='../upload/202308/1692415971.jpg 767w,../upload/202308/1692415971.jpg' sizes="(max-width: 767px) 767px" alt="SQL Server数据库提取与修复" pch="0" adh="0" iph="0">
                                                          <div class="met-banner-text pc-content" met-imgmask>
                        <div class='container'>
                            <div class='met-banner-text-con p-4'>
                                <div>
                                    <div>
                                                                                                                <h3 class="animation-slide-top animation-delay-300 font-weight-500" style="color:#14e018;font-size: px;">SQL Server数据库提取与修复</h3>
                                                                                                              <p class="animation-slide-bottom animation-delay-600" style='color:#e6103f;font-size: px;'>支持MDF/BAK/NDF等格式数据库的碎片提取与损坏修复及加密解密</p>
                                                                                                                        <a href="http://www.jsos.cn/product/product104.htm" title="了解数据恢复设备"   target="_blank" class="btn slick-btn   " infoset="|#0c36ab|#f20c0c|#c99f9e|#820e0e||">了解数据恢复设备</a>
                                                                        </div>
                                </div>
                            </div>
                        </div>
                    </div>
                                                      <div class="met-banner-text mobile-content" met-imgmask>
                        <div class='container'>
                            <div class='met-banner-text-con p-4 '>
                                <div>
                                    <div>
                                                                                                                <h3 class="animation-slide-top animation-delay-300 font-weight-500" style="color:#14e018;font-size: px;">SQL Server数据库提取与修复</h3>
                                                                                                              <p class="animation-slide-bottom animation-delay-600" style='color:#e6103f;font-size: px;'>支持MDF/BAK/NDF等格式数据库的碎片提取与损坏修复及加密解密</p>
                                                                                                                        <a href="http://www.jsos.cn/product/product104.htm" title="了解数据恢复设备"   target="_blank" class="btn slick-btn   " infoset="|#0c36ab|#f20c0c|#c99f9e|#820e0e||">了解数据恢复设备</a>
                                                                        </div>
                                </div>
                            </div>
                        </div>
                    </div>
                            </div>
                    <div class="carousel-item   ">
                                      <img class="w-full mobile_img" src="../upload/202308/1692415626.jpg" srcset='../upload/202308/1692415626.jpg 767w,../upload/202308/1692415626.jpg' sizes="(max-width: 767px) 767px" alt="芯片级物理数据读取" pch="0" adh="0" iph="0">
                    <img class="w-full pc_img" src="../upload/202308/1692415626.jpg" srcset='../upload/202308/1692415626.jpg 767w,../upload/202308/1692415626.jpg' sizes="(max-width: 767px) 767px" alt="芯片级物理数据读取" pch="0" adh="0" iph="0">
                                                          <div class="met-banner-text pc-content" met-imgmask>
                        <div class='container'>
                            <div class='met-banner-text-con p-4'>
                                <div>
                                    <div>
                                                                                                                <h3 class="animation-slide-top animation-delay-300 font-weight-500" style="color:#14ba4c;font-size: px;">芯片级物理数据读取</h3>
                                                                                                              <p class="animation-slide-bottom animation-delay-600" style='color:#de0b39;font-size: px;'>SD卡、TF卡、U盘、SSD芯片级数据提取分析方案</p>
                                                                                                                        <a href="http://www.jsos.cn/product/product105.htm" title="了解更多"    class="btn slick-btn   " infoset="|#163ff7|#1f9484|#349477|#820e0e||">了解更多</a>
                                                                        </div>
                                </div>
                            </div>
                        </div>
                    </div>
                                                      <div class="met-banner-text mobile-content" met-imgmask>
                        <div class='container'>
                            <div class='met-banner-text-con p-4 '>
                                <div>
                                    <div>
                                                                                                                <h3 class="animation-slide-top animation-delay-300 font-weight-500" style="color:#14ba4c;font-size: px;">芯片级物理数据读取</h3>
                                                                                                              <p class="animation-slide-bottom animation-delay-600" style='color:#de0b39;font-size: px;'>SD卡、TF卡、U盘、SSD芯片级数据提取分析方案</p>
                                                                                                                        <a href="http://www.jsos.cn/product/product105.htm" title="了解更多"    class="btn slick-btn   " infoset="|#163ff7|#1f9484|#349477|#820e0e||">了解更多</a>
                                                                        </div>
                                </div>
                            </div>
                        </div>
                    </div>
                            </div>
                    <div class="carousel-item   ">
                                      <img class="w-full mobile_img" src="../upload/202308/1692415027.jpg" srcset='../upload/202308/1692415027.jpg 767w,../upload/202308/1692415027.jpg' sizes="(max-width: 767px) 767px" alt="数据提取，数据恢复，视频取证" pch="0" adh="0" iph="0">
                    <img class="w-full pc_img" src="../upload/202308/1692414995.jpg" srcset='../upload/202308/1692414995.jpg 767w,../upload/202308/1692414995.jpg' sizes="(max-width: 767px) 767px" alt="数据提取，数据恢复，视频取证" pch="0" adh="0" iph="0">
                                                          <div class="met-banner-text pc-content" met-imgmask>
                        <div class='container'>
                            <div class='met-banner-text-con p-4'>
                                <div>
                                    <div>
                                                                                                                <h3 class="animation-slide-top animation-delay-300 font-weight-500" style="color:#0bde32;font-size: px;">数据提取，数据恢复，视频取证</h3>
                                                                                                              <p class="animation-slide-bottom animation-delay-600" style='color:;font-size: px;'>视频侦察、硬盘取证、数据库取证产品与方案</p>
                                                                                                                    </div>
                                </div>
                            </div>
                        </div>
                    </div>
                                                      <div class="met-banner-text mobile-content" met-imgmask>
                        <div class='container'>
                            <div class='met-banner-text-con p-4 '>
                                <div>
                                    <div>
                                                                                                                <h3 class="animation-slide-top animation-delay-300 font-weight-500" style="color:#0bde32;font-size: px;">数据提取，数据恢复，视频取证</h3>
                                                                                                              <p class="animation-slide-bottom animation-delay-600" style='color:;font-size: px;'>视频侦察、硬盘取证、数据库取证产品与方案</p>
                                                                                                                    </div>
                                </div>
                            </div>
                        </div>
                    </div>
                            </div>
            </div>
</div>
                  <section class="met-crumbs hidden-sm-down" m-id='met_position' m-type='nocontent'>
    <div class="container">
        <div class="row">
            <div class="border-bottom clearfix">
                <ol class="breadcrumb m-b-0 subcolumn-crumbs breadcrumb-arrow">
                    <li class='breadcrumb-item'>
                        你的位置                    </li>
                    <li class='breadcrumb-item'>
                        <a href="../" title="网站首页" class='icon wb-home'>网站首页</a>
                    </li>
                                                                          <li class='breadcrumb-item'>
                            <a href="../news/index.htm" title="常见问题" class=''>常见问题</a>
                        </li>
                                                                                      <li class='breadcrumb-item'>
                            <a href="../news/news_142_1.htm" title="有问必答" class=''>有问必答</a>
                        </li>
                                                                                                  </ol>
            </div>
        </div>
    </div>
</section>
    <main class="met-shownews animsition">
    <div class="container">
        <div class="row">
        <div class="clearfix">

            <div class="col-md-9 met-shownews-body" m-id="noset">
                <div class="row">

                    <section class="details-title border-bottom1">
						<h1 class='m-0'>语音特征提取: 看懂梅尔语谱图(Mel-spectrogram)、梅尔倒频系数(MFCCs)的原理</h1>
						<div class="info font-weight-300">
							<span>2023-08-24 16:10:44</span>
                              							 <span>MetInfo</span>
                                                              							<span>
    								<i class="icon wb-eye m-r-5" aria-hidden="true"></i>
    								<script type='text/javascript' class='met_hits' data-hits='78' src="../hits/?lang=cn&type=news&vid=28&list=0"></script>    							</span>
                            						</div>
					</section>
					<section class="met-editor clearfix">
						<div class="tit" style="margin: 0px; padding: 0px; text-size-adjust: none; height: 40px; line-height: 40px; text-align: center; font-weight: bold; border-bottom: 1px solid rgb(204, 204, 204); overflow: hidden; color: rgb(160, 160, 160); font-family: &quot;Microsoft YaHei&quot;, &quot;LiHei Pro&quot;; font-size: 12px; text-wrap: wrap;">语音特征提取: 看懂梅尔语谱图(Mel-spectrogram)、梅尔倒频系数(MFCCs)的原理</div><div class="related" style="margin: 0px; padding: 6px 0px; text-size-adjust: none; text-align: center; color: rgb(160, 160, 160); font-family: &quot;Microsoft YaHei&quot;, &quot;LiHei Pro&quot;; font-size: 12px; text-wrap: wrap;"><p style="margin-top: 0px; margin-bottom: 0px; padding: 0px; text-size-adjust: none;"><br/></p></div><div class="text" style="margin: 0px; padding: 10px; text-size-adjust: none; line-height: 23px; color: rgb(160, 160, 160); font-family: &quot;Microsoft YaHei&quot;, &quot;LiHei Pro&quot;; font-size: 12px; text-wrap: wrap;">1. 什么是梅尔语谱图和梅尔倒频系数？<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>机器学习的第一步都是要提取出相应的特征(feature)，如果输入数据是图片，例如28*28的图片，那么只需要把每个像素(pixel)作为特征，对应的像素值大小(代表颜色的强度)作为特征值即可。那么在音频、语音信号处理领域，我们需要将信号转换成对应的语谱图(spectrogram)，将语谱图上的数据作为信号的特征。语谱图的横轴x为时间，纵轴y为频率，(x,y)对应的数值代表在时间x时频率y的幅值。通常的语谱图其频率是线性分布的，但是人耳对频率的感受是对数的(logarithmic)，即对低频段的变化敏感，对高频段的变化迟钝，所以线性分布的语谱图显然在特征提取上会出现“特征不够有用的情况”，因此梅尔语谱图应运而生。梅尔语谱图的纵轴频率和原频率经过如下公式互换:<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>其中f代表原本的频率，m代表转换后的梅尔频率，显然，当f很大时，m的变化趋于平缓。而梅尔倒频系数(MFCCs)是在得到梅尔语谱图之后进行余弦变换(DCT,一种类似于傅里叶变换的线性变换），然后取其中一部分系数即可。<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>2. 梅尔语谱图具体是如何获得的？<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>梅尔语谱图分为以下几个步骤。以一段音乐文件为例，详细展示每一步的原理和对应的Python实现。<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>2.1 获取音频信号<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>python可以用librosa库来读取音频文件，但是对于MP3文件，它会自动调用audio_read函数，所以如果是MP3文件，务必保证将ffmpeg.exe的路径添加到系统环境变量中，不然audio_read函数会出错。这里我们首先读取音频文件，并作出0-20秒的波形。现在的音乐文件采样率通常是44.1kHz。用y和sr分别表示信号和采样率。代码和图形如下：<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>import librosa<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>import numpy as np<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>import matplotlib.pyplot as plt<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>from matplotlib import rcParams<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>import matplotlib.ticker as ticker<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>&nbsp;<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>#这是一个画图函数，方便后续作图<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>def personal_plot(x,y):<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>&nbsp; &nbsp; plt.figure(dpi=200,figsize=(12,6))<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>&nbsp; &nbsp; rcParams[&#39;font.family&#39;]=&#39;Comic Sans MS&#39;<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>&nbsp; &nbsp; plt.plot(x,y)<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>&nbsp; &nbsp; plt.xlim(x[0],x[-1])<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>&nbsp; &nbsp; plt.xlabel(&#39;time/s&#39;,fontsize=20)<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>&nbsp; &nbsp; plt.ylabel(&#39;Amplitude&#39;,fontsize=20)<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>&nbsp; &nbsp; plt.xticks(fontsize=16)<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>&nbsp; &nbsp; plt.yticks(fontsize=16)<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>&nbsp; &nbsp; plt.grid()<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>&nbsp;<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>#注意如果文件名不加路径，则文件必须存在于python的工作目录中<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>y,sr = librosa.load(&#39;笑颜.mp3&#39;,sr=None)<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>&nbsp;<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>#这里只获取0-20秒的部分，这里也可以在上一步的load函数中令duration=20来实现<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>tmax,tmin = 20,0<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>t = np.linspace(tmin,tmax,(tmax-tmin)*sr)<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>personal_plot(t,y[tmin*sr:tmax*sr])<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>2.2 信号预加重(pre-emphasis)<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>通常来讲语音/音频信号的高频分量强度较小，低频分量强度较大，信号预加重就是让信号通过一个高通滤波器，让信号的高低频分量的强度不至于相差太多。在时域中，对信号x[n]作如下操作：<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>α通常取一个很接近1的值，typical value为0.97或0.95. 从时域公式来看，可能有部分人不懂为啥这是一个高通滤波器，我们从z变换的角度看一下滤波器的transfer function：<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>可以看出滤波器有一个极点0，和一个零点α。当频率为0时，z=1, 放大系数为(1-α）。当频率渐渐增大，放大系数不断变大，当频率到pi时，放大系数为(1+α)。离散域中，[0,pi]对应连续域中的[0, fs/2](单位Hz)。其中fs为采样率，在我们这里是44.1kHz。因此当频率到22000Hz时，放大系数为(1+α)。下面用两段代码和对应的图像给出一个直观感受:<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>alpha = 0.97<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>emphasized_y = np.append(y[tmin*sr],y[tmin*sr+1:tmax*sr]-alpha*y[tmin*sr:tmax*sr-1])<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>n = int((tmax-tmin)*sr) #信号一共的sample数量<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>&nbsp;<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>#未经过预加重的信号频谱<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>plt.figure(dpi=300,figsize=(7,4))<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>freq = sr/n*np.linspace(0,n/2,int(n/2)+1)<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>plt.plot(freq,np.absolute(np.fft.rfft(y[tmin*sr:tmax*sr],n)**2)/n)<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>plt.xlim(0,5000)<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>plt.xlabel(&#39;Frequency/Hz&#39;,fontsize=14)<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>&nbsp;<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>#预加重之后的信号频谱<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>plt.figure(dpi=300,figsize=(7,4))<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>plt.plot(freq,np.absolute(np.fft.rfft(emphasized_y,n)**2)/n)<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>plt.xlim(0,5000)<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>plt.xlabel(&#39;Frequency/Hz&#39;,fontsize=14)<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>这两段代码里用了函数librosa.fft.rfft(y,n)，rfft表示经过fft变换之后只取其中一半（因为另一半对应负频率，没有用处）, y对应信号，n对应要做多少点的FFT。我们这里的信号有44.1k*20=882000个点，所以对应的FFT 也做882000点的FFT，每一个点所对应的实际频率是该点的索引值*fs/n，这是咋得出来的？因为第882000个点应该对应(约等于)fs(或者离散域中的2pi)，所以前面的点根据线性关系一一对应即可。这里只展示0-5000Hz，可以看出，经过预加重之后的信号高频分量明显和低频分量的差距没那么大了。<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>这样预加重的好处有什么?原文提到了三点:(1)就是我们刚刚提到的平衡一下高频和低频 (2)避免FFT中的数值问题(也就是高频值太小出现在分母的时候可能会出问题) (3)或许可以提高SNR。<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>2.3 分帧(framing)<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>预处理完信号之后，要把原信号按时间分成若干个小块，一块就叫一帧(frame)。为啥要做这一步？因为原信号覆盖的时间太长，用它整个来做FFT，我们只能得到信号频率和强度的关系，而失去了时间信息。我们想要得到频率随时间变化的关系，所以将原信号分成若干帧，对每一帧作FFT（又称为短时FFT，因为我们只取了一小段时间)，然后将得到的结果按照时间顺序拼接起来。这就是语谱图(spectrogram)的原理。<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>下面定义几个变量:<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>frame_size: 每一帧的长度。通常取20-40ms。太长会使时间上的分辨率(time resolution)较小，太小会加重运算成本。这里取25ms.<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>frame_length: 每一帧对应的sample数量。等于fs*frame_size。我们这里是44.1k*0.025=1102.<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>frame_stride: 相邻两帧的间隔。通常间隔必须小于每一帧的长度，即两帧之间要有重叠，否则我们可能会实去两帧边界附近的信息。做特征提取的时候，我们是绝不希望实去有用信息的。 这里取10ms，即有60%的重叠。<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>frame_step: 相邻两帧的sample数量。这里是441.<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>frame_num: 整个信号所需要的帧数。一般希望所需要的帧数是个整数值，所以这里要对信号补0(zero padding)让信号的长度正好能分成整数帧。<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>具体代码如下:<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>frame_size, frame_stride = 0.025,0.01<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>frame_length, frame_step = int(round(sr*frame_size)),int(round(sr*frame_stride))<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>signal_length = (tmax-tmin)*sr<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>frame_num = int(np.ceil((signal_length-frame_length)/frame_step))+1 #向上舍入<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>pad_frame = (frame_num-1)*frame_step+frame_length-signal_length #不足的部分补零<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>pad_y = np.append(emphasized_y,np.zeros(pad_frame))<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>signal_len = signal_length+pad_frame<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>2.4 加窗(window)<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>分帧完毕之后，对每一帧加一个窗函数，以获得较好的旁瓣下降幅度。通常使用hamming window。<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>为啥要加窗？要注意，即使我们什么都不加，在分帧的这个过程中也相当于给信号加了矩形窗，学过离散滤波器设计的人应该知道，矩形窗的频谱有很大的旁瓣，时域中将窗函数和原函数相乘，相当于频域的卷积，矩形窗函数和原函数卷积之后，由于旁瓣很大，会造成原信号和加窗之后的对应部分的频谱相差很大，这就是频谱泄露。hamming window有较小的旁瓣，造成的spectral leakage也就较小。代码实现如下：首先定义indices变量，这个变量生成每帧所对应的sample的索引。np.tile函数可以使得array从行或者列扩展。然后定义frames，对应信号在每一帧的值。frames共有1999行，1102列，分别对应一共有1999帧和每一帧有1102个sample。将得到的frames和hamming window直接相乘即可，注意这里不是矩阵乘法。<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>indices = np.tile(np.arange(0, frame_length), (frame_num, 1)) + np.tile(<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>&nbsp; &nbsp; np.arange(0, frame_num * frame_step, frame_step), (frame_length, 1)).T<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>frames = pad_y[indices] #frame的每一行代表每一帧的sample值<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>frames *= np.hamming(frame_length) #加hamming window 注意这里不是矩阵乘法<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>2.5 获取功率谱<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>我们在2.4中已经获得了frames变量，其每一行对应每一帧，所以我们分别对每一行做FFT。由于每一行是1102个点的信号，所以可以选择1024点FFT（FFT点数比原信号点数少会降低频率分辨率frequency resolution，但这里相差很小，所以可以忽略）。将得到的FFT变换取其magnitude，并进行平方再除以对应的FFT点数，即可得到功率谱。到这一步我们其实已经得到了spectrogram, 只需要用plt.imshow画出其dB值对应的热力图即可，代码和结果如下:<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>NFFT = 1024 #frame_length=1102，所以用1024足够了<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>mag_frames = np.absolute(np.fft.rfft(frames,NFFT))<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>pow_frames = mag_frames**2/NFFT<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>&nbsp;<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>plt.figure(dpi=300,figsize=(12,6))<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>plt.imshow(20*np.log10(pow_frames[40:].T),cmap=plt.cm.jet,aspect=&#39;auto&#39;)<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>plt.yticks([0,128,256,384,512],np.array([0,128,256,384,512])*sr/NFFT)<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>2.6 梅尔滤波器组(Mel-filter banks)<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>较后一步是将梅尔滤波器运用到上一步得到的pow_frames上。所谓梅尔滤波器组是一个等高的三角滤波器组，每个滤波器的起始点在上一个滤波器的中点处。其对应的频率在梅尔尺度上是线性的，因此称之为梅尔滤波器组。每个滤波器对应的频率可以将较大频率（下图中是4000，我们这里是22.05k)用上文中提到的公式转换成梅尔频率，在梅尔尺度上线性分成若干个频段，再转换回实际频率尺度即可。实际操作时，将每个滤波器分别和功率谱pow_frames进行点乘，获得的结果即为该频带上的能量(energy)。这里我们的pow_frames是一个(1999,513)的矩阵(这里可能有人疑问513咋来的？我们刚刚做的不是1024点FFT吗？这里注意因为我们用了rfft，只用了非负的那一半频率，所以是1024/2+1个点），梅尔滤波器fbank是一个(mel_N, 513)的矩阵，其中mel_N代表对应的梅尔滤波器个数，这个值不能太大，因为这里我们一共只有513个点，如果mel_N取得太大，会导致前面几个滤波器的长度都是0 (因为低频的梅尔滤波器特别窄)。我们只要将这两个矩阵相乘pow_frames*fbank.T即可得到mel-spectrogram，结果是一个(1999, 40)的矩阵，每一行是一帧，每一列代表对应的梅尔频带的能量。具体梅尔滤波器的图例和计算公式以及对应代码如下:<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>其中m代表滤波器的序号，f(m-1)和f(m)、f(m+1)分别对应第m个滤波器的起始点、中间点和结束点。大家一定要注意的一点是，这里的f(m)对应的值不是频率值，而是对应的sample的索引！比如，我们这里较大频率是22050 Hz, 所以22050Hz对应的是第513个sample，即频率f所对应的值是f/fs*NFFT。<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>代码中有一段np.where(condition,a,b)，这个函数的功能是检索b中的元素，当condition满足的时候，输出a否则，输出b中的原元素。这一步的操作是为了将其中的全部0值以一个很小的非负值代替，否则在计算dB的时候，log中出现0会出错。<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>#下面定义mel filter<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>mel_N = 40 #滤波器数量,这个数字若要提高，则NFFT也要相应提高<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>mel_low, mel_high = 0, (2595*np.log10(1+(sr/2)/700))<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>mel_freq = np.linspace(mel_low,mel_high,mel_N+2)<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>hz_freq = (700 * (10**(mel_freq / 2595) - 1))<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>bins = np.floor((NFFT)*hz_freq/sr) #将频率转换成对应的sample位置<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>fbank = np.zeros((mel_N,int(NFFT/2+1))) #每一行储存一个梅尔滤波器的数据<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>for m in range(1, mel_N + 1):<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>&nbsp; &nbsp; f_m_minus = int(bins[m - 1]) &nbsp; # left<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>&nbsp; &nbsp; f_m = int(bins[m]) &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; &nbsp; # center<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>&nbsp; &nbsp; f_m_plus = int(bins[m + 1]) &nbsp; &nbsp;# right<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>&nbsp;<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>&nbsp; &nbsp; for k in range(f_m_minus, f_m):<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>&nbsp; &nbsp; &nbsp; &nbsp; fbank[m - 1, k] = (k - bins[m - 1]) / (bins[m] - bins[m - 1])<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>&nbsp; &nbsp; for k in range(f_m, f_m_plus):<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>&nbsp; &nbsp; &nbsp; &nbsp; fbank[m - 1, k] = (bins[m + 1] - k) / (bins[m + 1] - bins[m])<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>filter_banks = np.matmul(pow_frames, fbank.T)<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>filter_banks = np.where(filter_banks == 0, np.finfo(float).eps, filter_banks) &nbsp;# np.finfo(float)是较小正值<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>filter_banks = 20 * np.log10(filter_banks) &nbsp;# dB<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>#filter_banks -= np.mean(filter_banks,axis=1).reshape(-1,1)<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>plt.figure(dpi=300,figsize=(12,6))<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>plt.imshow(filter_banks[40:].T, cmap=plt.cm.jet,aspect=&#39;auto&#39;)<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>plt.yticks([0,10,20,30,39],[0,1200,3800,9900,22000])<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>较后，得到的mel-spectrogram如下:<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>2.7 Mel-spectogram feature<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>机器学习的时候，每一个音频段即可用对应的mel-spectogram表示，每一帧对应的某个频段即为一个feature。因此我们一共获得了1999*40个feature和对应的值。实际操作中，每个音频要采用同样的长度，这样我们的feature数量才是相同的。通常还要进行归一化，即每一帧的每个元素要减去该帧的平均值，以保证每一帧的均值均为0.<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>3. MFCCs原理<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>得到了梅尔语谱图，想得到MFCCs就很简单了。首先，为啥要用MFCCs? 因为2中得到的梅尔谱系数是互相关的，在一些机器学习算法中可能会出问题，因为有些算法假设数据不存在互相关性。因此，可以用DCT变换来压缩梅尔谱，得到一组不相关的系数。DCT在图像压缩领域很常见，大家可以自己查阅相关资料其原理。在语音识别中，得到的梅尔倒频系数只保存前2-13个，剩下的不用，因为研究表明其他系数代表了系数中高阶的变化，在ASR中没啥用。<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>当然，更深层次的原因是MFCC是倒谱系数，所谓倒谱系数，就是对log之后的梅尔谱系数进行DCT变换，其实相当于将实际上是频域的信号当成时域信号强行进行频域变换，得到的是频域信号在伪频域的幅频相应，前2-13个系数代表的是包络，因为他们在伪频域上是低频信号，所以在前面，后面的系数是伪频域的高频信号，代表的是spectral details，在语音识别的时候，对我们帮助更大的是包络，因为包含了formants等信息。<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>4. 总结<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>总的来说，过去在HMM、GMM等模型用的比较火的时候，多将MFCC用于特征提取，因为当时的机器学习算法有相应的不足。如今较热门的是以神经网络为代表的深度学习算法，神经网络内部复杂，在训练的过程中可以在网络内部将互相关的问题弱化，也因此DCT变换显得有些多余，何况还会提高计算量，而且DCT作为一种线性变换，有可能会导致损失信号中一些非线性信息。因此，如今Mel-spectogram用的更多。<br style="margin: 0px; padding: 0px; text-size-adjust: none;"/><br style="margin: 0px; padding: 0px; text-size-adjust: none;"/>&nbsp;</div><p><br/></p>                    </section>
                                                                                                    					        <div class='met-page p-y-30 border-top1'>
            <div class="container p-t-30 ">
                <ul class="pagination block blocks-2 text-xs-center text-sm-left">
                    <li class='page-item m-b-0 '>
                        <a href='../news/news65.htm' title="数据恢复的办理分为甲乙两个等级" class='page-link text-truncate'>
                            上一篇                            <span aria-hidden="true" class='hidden-xs-down'>: 数据恢复的办理分为甲乙两个等级</span>
                        </a>
                    </li>
                    <li class='page-item m-b-0 '>
                        <a href='../news/news41.htm' title="从电子数据存证的角度，浅谈怎样有效规避网络购物风险" class='page-link pull-xs-right text-truncate'>
                            下一篇                            <span aria-hidden="true" class='hidden-xs-down'>: 从电子数据存证的角度，浅谈怎样有效规避网络购物风险</span>
                        </a>
                    </li>
                </ul>
            </div>
        </div>


                </div>
            </div>


            <!-- sidebar met_83_1 -->
            <div class="col-md-3">
                <div class="row">

                    
                    <aside class="met-sidebar panel panel-body m-b-0" boxmh-h m-id='news_bar' m-type='nocontent'>
                        <div class="sidebar-search" data-placeholder="search">
                                            <form method="get" class="page-search-form" role="search" action="../news/index.php?lang=cn" m-id="search_column" m-type="nocontent">
            <input type="hidden" name="lang" value="cn" />
            <input type="hidden" name="stype" value="0" />
            <input type="hidden" name="search" value="search" /><input type="hidden" name="class1" value="128" />
            <div class="input-search input-search-dark">
                <button type="submit" class="input-search-btn"><i class="icon wb-search" aria-hidden="true"></i></button>
                <input
                type="text"
                class="form-control input-lg"
                name="content"
                value=""
                placeholder="收费标准"
                >
            </div>
        </form>                        </div>

                                                      <ul class="sidebar-column list-icons">
                                                                <li>
                                    <a href="../news/index.htm" title="常见问题" class="  " target='_self'>常见问题</a>
                                </li>
                                                                <li>
                                                                          <a href="../news/news_140_1.htm" title="最新发布" class=''>最新发布</a>
                                                                    </li>
                                                                <li>
                                                                          <a href="javascript:;" title="使用教程" class='' target='_self' data-toggle="collapse" data-target=".sidebar-column3-1">使用教程<i class="wb-chevron-right-mini"></i></a>
                                    <div class="sidebar-column3-1 collapse" aria-expanded="false">
                                        <ul class="m-t-5 p-l-20">
                                            <li><a href="../news/news_141_1.htm" target='_self' title="全部" class="">全部</a></li>
                                                                                        <li><a href="../news/news_184_1.htm" target='_self' title="FE资源模型" class=''>FE资源模型</a></li>
                                                                                        <li><a href="../news/news_185_1.htm" target='_self' title="FE技术手册" class=''>FE技术手册</a></li>
                                                                                    </ul>
                                    </div>
                                                                    </li>
                                                                <li>
                                                                          <a href="../news/news_142_1.htm" title="有问必答" class='active'>有问必答</a>
                                                                    </li>
                                                                                            </ul>
                                                                              <div class="sidebar-news-list recommend">
                                <h3 class='font-size-16 m-0'>为您推荐</h3>
                                <ul class="list-group list-group-bordered m-t-10 m-b-0">
                                                                                                            <li class="list-group-item">
                                                                                  <a class="imga" href="../img/img73.htm" title="电子数据鉴定" target=_blank>
                                                <img src="../upload/thumb_src/800_500/1692430067.jpg" alt="电子数据鉴定" style="max-width:100%">
                                            </a>
                                                                                        <a href="../img/img73.htm" title="电子数据鉴定" target=_blank>电子数据鉴定</a>
                                        </li>
                                                                            <li class="list-group-item">
                                                                                  <a class="imga" href="../img/img74.htm" title="存在性鉴定" target=_blank>
                                                <img src="../upload/thumb_src/800_500/1692430012.jpg" alt="存在性鉴定" style="max-width:100%">
                                            </a>
                                                                                        <a href="../img/img74.htm" title="存在性鉴定" target=_blank>存在性鉴定</a>
                                        </li>
                                                                            <li class="list-group-item">
                                                                                  <a class="imga" href="../img/img76.htm" title="同一性鉴定" target=_blank>
                                                <img src="../upload/thumb_src/800_500/1692430459.jpg" alt="同一性鉴定" style="max-width:100%">
                                            </a>
                                                                                        <a href="../img/img76.htm" title="同一性鉴定" target=_blank>同一性鉴定</a>
                                        </li>
                                                                    </ul>
                            </div>
                                            </aside>
                    

                </div>
            </div>



        </div>

        </div>
    </div>
</main>
<footer class='met-foot-info border-top1' m-id='met_foot' m-type="foot">
    <div class="met-footnav text-xs-center p-b-20" m-id='noset' m-type='foot_nav'>
    <div class="container">
        <div class="row mob-masonry">
            <!-- 栏目调用 -->
            <div class="col-lg-6 col-xs-12 left_lanmu">
                <div class="row">
                                                              <div class="col-lg-3 col-md-3 col-xs-6 list masonry-item foot-nav">
                        <h4 class='font-size-20 m-t-0'>
                            <a href="../product/index.htm" target='_self' title="产品中心">产品中心</a>
                        </h4>
                                                  <ul class='ulstyle m-b-0'>
                                                        <li>
                                <a href="../product/product_136_1.htm" target='_self' title="数据库取证">数据库取证</a>
                            </li>
                                                        <li>
                                <a href="../product/product_137_1.htm" target='_blank' title="芯片取证">芯片取证</a>
                            </li>
                                                        <li>
                                <a href="../product/product_139_1.htm" target='_self' title="视频取证">视频取证</a>
                            </li>
                                                        <li>
                                <a href="../product/product_177_1.htm" target='_self' title="手机取证">手机取证</a>
                            </li>
                                                        <li>
                                <a href="../product/product_176_1.htm" target='_self' title="数据恢复">数据恢复</a>
                            </li>
                                                    </ul>
                                            </div>
                                                                                  <div class="col-lg-3 col-md-3 col-xs-6 list masonry-item foot-nav">
                        <h4 class='font-size-20 m-t-0'>
                            <a href="../img/index.htm" target='_self' title="司法鉴定">司法鉴定</a>
                        </h4>
                                                  <ul class='ulstyle m-b-0'>
                                                        <li>
                                <a href="../img/img_169_1.htm" target='_self' title="电子数据">电子数据</a>
                            </li>
                                                        <li>
                                <a href="../img/img_170_1.htm" target='_self' title="声纹鉴定">声纹鉴定</a>
                            </li>
                                                        <li>
                                <a href="../img/img_171_1.htm" target='_self' title="图像鉴定">图像鉴定</a>
                            </li>
                                                        <li>
                                <a href="../img/img_172_1.htm" target='_self' title="质量鉴定">质量鉴定</a>
                            </li>
                                                    </ul>
                                            </div>
                                                                                  <div class="col-lg-3 col-md-3 col-xs-6 list masonry-item foot-nav">
                        <h4 class='font-size-20 m-t-0'>
                            <a href="../news/index.htm" target='_self' title="常见问题">常见问题</a>
                        </h4>
                                                  <ul class='ulstyle m-b-0'>
                                                        <li>
                                <a href="../news/news_140_1.htm" target='_self' title="最新发布">最新发布</a>
                            </li>
                                                        <li>
                                <a href="../news/news_141_1.htm" target='_self' title="使用教程">使用教程</a>
                            </li>
                                                        <li>
                                <a href="../news/news_142_1.htm" target='_self' title="有问必答">有问必答</a>
                            </li>
                                                    </ul>
                                            </div>
                                                                                  <div class="col-lg-3 col-md-3 col-xs-6 list masonry-item foot-nav">
                        <h4 class='font-size-20 m-t-0'>
                            <a href="../download/index.htm" target='_self' title="下载中心">下载中心</a>
                        </h4>
                                                  <ul class='ulstyle m-b-0'>
                                                        <li>
                                <a href="../download/download_173_1.htm" target='_self' title="软件下载">软件下载</a>
                            </li>
                                                        <li>
                                <a href="../download/download_166_1.htm" target='_self' title="产品说明书">产品说明书</a>
                            </li>
                                                        <li>
                                <a href="../download/download_168_1.htm" target='_self' title="培训资料">培训资料</a>
                            </li>
                                                    </ul>
                                            </div>
                                                                                                                                                                                      </div>
            </div>
            <!-- 栏目调用 -->
            <!-- 关注我们二维码 -->
            <div class="col-lg-3 col-md-6 col-xs-12 info masonry-item" m-type="nocontent">
                <h4 class='font-size-20 m-t-0'>
                    关注我们                </h4>
                <div class="erweima row">
                    <div class="imgbox1 col-xs-6">
                        <div class="inline-block text-xs-center">
                            <img src='../upload/thumb_src/112_112/1691328998.jpg' alt='西数科技-科技创造无限可能'>
                            <p class="weixintext">关注公众号</p>
                        </div>
                    </div>
                    <div class="imgbox2 col-xs-6">
                        <div class="inline-block text-xs-center">
                            <img src='../upload/thumb_src/112_112/1691329317.jpg' alt='西数科技-科技创造无限可能'>
                            <p class="weixintext">微信客服</p>
                        </div>
                    </div>
                </div>
            </div>
            <!-- 关注我们二维码 -->
            <!-- 联系我们 -->
            <div class="col-lg-3 col-md-6 col-xs-12 info masonry-item font-size-20" m-id='met_contact' m-type="nocontent">
                                      <p class='font-size-20'>联系我们</p>
                                                      <p class="font-size-24">
                        <a href="tel:4006184118" title="4006184118">4006184118</a>
                    </p>
                                                      <p class="font-size-16 weekbox">
                        周一至周五 09：00-18：00                    </p>
                                                      <a class="p-r-5" id="met-weixin" data-plugin="webuiPopover" data-trigger="hover" data-animation="pop" data-placement='top' data-width='155' data-padding='0' data-content="<div class='text-xs-center'>
                        <img src='../upload/202308/1691328998.jpg' alt='西数科技-科技创造无限可能' width='150' height='150' id='met-weixin-img'></div>
                    ">
                        <i class="fa fa-weixin"></i>
                    </a>
                                                  <a
                                  href="http://crm2.qq.com/page/portalpage/wpa.php?uin={276570401&aty=0&a=0&curl=&ty=1"
                                rel="nofollow" target="_blank" class="p-r-5">
                    <i class="fa fa-qq"></i>
                </a>
                                                  <a href="https://weibo.com/metinfo" rel="nofollow" target="_blank" class="p-r-5">
                    <i class="fa fa-weibo"></i>
                </a>
                                                                                                    </div>
            <!-- 联系我们 -->
        </div>
    </div>
</div>
    <!--友情链接-->
              <div class="met-link text-xs-center p-y-10" m-id='noset' m-type='link'>
        <div class="container">
            <ul class="breadcrumb p-0 link-img m-0">
                <li class='breadcrumb-item'>友情链接 :</li>
                                            <li class='breadcrumb-item'>
                        <a href="http://www.wdsos.com" title="司法鉴定" rel="nofollow" target="_blank">
                                                              <span>西数科技</span>
                                                    </a>
                    </li>
                                    <li class='breadcrumb-item'>
                        <a href="https://www.moj.gov.cn/" title="司法" rel="nofollow" target="_blank">
                                                              <span>司法部</span>
                                                    </a>
                    </li>
                                    <li class='breadcrumb-item'>
                        <a href="https://gat.jiangsu.gov.cn/" title="电子数据,司法鉴定" rel="nofollow" target="_blank">
                                                              <span>江苏公安厅</span>
                                                    </a>
                    </li>
                                    <li class='breadcrumb-item'>
                        <a href="https://gaj.nanjing.gov.cn/" title="电子取证" rel="nofollow" target="_blank">
                                                              <span>南京公安局</span>
                                                    </a>
                    </li>
                                    <li class='breadcrumb-item'>
                        <a href="http://www.jsqa.org/" title="质量鉴定" rel="nofollow" target="_blank">
                                                              <span>江苏质量协会</span>
                                                    </a>
                    </li>
                                    <li class='breadcrumb-item'>
                        <a href="http://www.huaian.gov.cn/" title="网站模板" rel="nofollow" target="_blank">
                                                              <span>淮安市政府</span>
                                                    </a>
                    </li>
                            </ul>
        </div>
    </div>
        <!--友情链接-->
    <div class="copy p-y-10 border-top1">
        <div class="container text-xs-center">
                              <div class="met_footright">
                    <span>©版权所有 2000-2023</span>&nbsp;
                                              <span>西数科技（江苏）有限公司</span>&nbsp;
                                                                  <span>地址：中国南京玄武区珠江路435号601室 邮箱：wd@wdsos.com</span>
                                    </div>
                                          <div><p>联系电话：025-83608636 025-86883952</p><p><a href="https://beian.miit.gov.cn" target="_blank" title="工信部" textvalue="备案号：苏ICP备09074223号 苏公网安备：32010202010982号">备案号：苏ICP备09074223号 苏公网安备：32010202010982号</a></p></div>
                                      <div class="powered_by_metinfo">技术支持：<b><a rel=nofollow href=https://www.mituo.cn target=_blank title=米拓建站>米拓建站 7.9</a></b> </div>
                                                                                      </div>
        </div>
    </div>
</footer>
<div class="met-menu-list text-xs-center   " m-id="noset" m-type="menu">
    <div class="main">
                    <div style="background-color: #298dff;">
                <a href="http://www.jsos.cn" class="item"   target="_blank" style="color: #ffffff;">
                    <i class="icon fa-home"></i>
                    <span>首页</span>
                </a>
            </div>
                    <div style="background-color: #298dff;">
                <a href="../product/index.htm" class="item"   target="_blank" style="color: #ffffff;">
                    <i class="icon fa-product-hunt"></i>
                    <span>产品</span>
                </a>
            </div>
                    <div style="background-color: #298dff;">
                <a href="http://www.98825.com" class="item"   target="_blank" style="color: #ffffff;">
                    <i class="icon fa-th-large"></i>
                    <span>鉴定</span>
                </a>
            </div>
                    <div style="background-color: #298dff;">
                <a href="tel:../13813824669" class="item"   target="_blank" style="color: #ffffff;">
                    <i class="icon fa-address-book-o"></i>
                    <span>联系</span>
                </a>
            </div>
            </div>
</div>

<input type="hidden" name="met_lazyloadbg" value="">
<script src="../cache/lang_json_cn.js?1716775274"></script>
<script src="../public/web/js/basic.js?1713669838" data-js_url="../templates/metv75/cache/metinfo.js?1713670248" id="met-page-js"></script>
<div class="bdsharebuttonbox">
    <a href="#" class="bds_more" data-cmd="more"></a>
    <a href="#" class="bds_qzone" data-cmd="qzone" title="分享到QQ空间"></a>
    <a href="#" class="bds_tsina" data-cmd="tsina" title="分享到新浪微博"></a>
    <a href="#" class="bds_tqq" data-cmd="tqq" title="分享到腾讯微博"></a>
    <a href="#" class="bds_renren" data-cmd="renren" title="分享到人人网"></a>
    <a href="#" class="bds_weixin" data-cmd="weixin" title="分享到微信"></a>
</div>
<script>window._bd_share_config={"common":{"bdSnsKey":{},"bdText":"","bdMini":"2","bdMiniList":false,"bdPic":"","bdStyle":"0","bdSize":"16"},"share":{},"p_w_picpath":{"viewList":["qzone","tsina","tqq","renren","weixin"],"viewText":"分享到：","viewSize":"24"},"selectShare":{"bdContainerClass":null,"bdSelectMiniList":["qzone","tsina","tqq","renren","weixin"]}};with(document)0[(getElementsByTagName('head')[0]||body).appendChild(createElement('script')).src='http://bdimg.share.baidu.com/static/api/js/share.js?v=89860593.js?cdnversion='+~(-new Date()/36e5)];</script>
<textarea name="met_online_data" hidden>{"status":1,"html":"<style>\r\n.onlinebox_two{background-color: transparent;box-shadow: none;}\r\n.onlinebox_two .met-online-box {width: 54px; height: 54px; position: relative; margin-bottom: 1px;}\r\n.onlinebox_two .online-item  {position: absolute; width: 54px; height: 54px; top: 0; right: 0;transition: all.3s; opacity: .8; filter: Alpha(opacity = 80); color: #fff; font-size: 14px;line-height: 54px;overflow: hidden;}\r\n.onlinebox_two .online-item i{font-size: 20px; width: 54px;}\r\n.onlinebox .onlinebox-open {display: block; cursor: pointer; padding: 0 10px; font-size: 18px; line-height: 40px; color: #fff; }\r\n.onlinebox_two .onlineclose{display: none;}\r\n.met-online-modal .close{position: absolute;right: 5px;top: 0;opacity: 1;}\r\n#onlinebox .met-online-box:hover .online-item{width: 130px;opacity: 1;filter:Alpha(opacity=100);}\r\n#onlinebox .met-online-box:hover .online-item i{width: 30px;}\r\n@media (min-width: 768px){\r\n.onlinebox_two .onlinebox-open{display: none;}\r\n}\r\n@media (max-width: 767px){\r\n.onlinebox_two .onlinebox_two_list{display: none;position: relative;}\r\n.onlinebox_two .onlineclose{font-style: initial;color: #fff;opacity: .8;position: absolute; right: -5px; top: -15px; border-radius: 50%; width: 25px;\r\n    height: 25px;line-height:25px; display: block;font-family: arial;z-index: 10;}\r\n}\r\n<\/style>\r\n<div id='onlinebox'  class=\"onlinebox onlinebox_two hide\" m-type='online' m-id='online'>\r\n\t<div class=\"onlinebox-open text-xs-center\" id=\"onlinebox-open\" style=\"background:#1baadb;\"> <i class=\"fa fa-comments-o\"><\/i>\r\n\t<\/div>\r\n\t        \t  \t\t<div class=\"onlinebox_two_list\"> <i class=\"onlineclose font-size-20 text-xs-center\" style=\"background:#1baadb;\">x<\/i>\r\n\t\t\t        \t\t\t\t  \t\t\t\t\t<div class=\"met-online-box\">\r\n\t\t\t\t\t\t<a href=\"https:\/\/wpa.qq.com\/msgrd?v=3&uin=276570401&site=qq&menu=yes\" title=\"276570401\" class=\"online-item text-xs-center\" target=\"_blank\" style=\"background-color: #1baadb;\">\r\n\t\t\t\t\t\t\t<i class=\"icon fa-qq\"><\/i>\r\n\t\t\t\t\t\t\t<span>售前客服<\/span>\r\n\t\t\t\t\t\t<\/a>\r\n\t\t\t\t\t<\/div>\r\n\t\t\t\t\t\t\t\t\t\t\t  \t\t\t\t\t<div class=\"met-online-box\">\r\n\t\t\t\t\t\t<a href=\"tel:18651607829\" title=\"18651607829\" class=\"online-item text-xs-center\" target=\"_blank\" style=\"background-color: #1baadb;\">\r\n\t\t\t\t\t\t\t<i class=\"icon fa-phone-square\"><\/i>\r\n\t\t\t\t\t\t\t<span>服务热线<\/span>\r\n\t\t\t\t\t\t<\/a>\r\n\t\t\t\t\t<\/div>\r\n\t\t\t\t\t\t\t\t\t\t\t  \t\t\t\t\t<div class=\"met-online-box\">\r\n\t\t\t\t\t\t<a class=\"online-item text-xs-center met-weixin\" style=\"background-color: #1baadb;\" href=\"javascript:void(0)\" data-toggle=\"modal\" data-target=\"#met-weixin2\">\r\n\t\t\t\t\t\t\t<i class=\"icon fa-wechat\"><\/i>\r\n\t\t\t\t\t\t\t<span>微信客服<\/span>\r\n\t\t\t\t\t\t<\/a>\r\n\t\t\t\t\t<\/div>\r\n\t\t\t\t\t\t\t\t\t\t\t\t  \t\t\t\t\t<div class=\"met-online-box\">\r\n\t\t\t\t\t\t<a href=\"skype:jointnj?chat\" title=\"jointnj\" class=\"online-item text-xs-center\" target=\"_blank\" style=\"background-color: #1baadb;\">\r\n\t\t\t\t\t\t\t<i class=\"icon fa-skype\"><\/i>\r\n\t\t\t\t\t\t\t<span>skype<\/span>\r\n\t\t\t\t\t\t<\/a>\r\n\t\t\t\t\t<\/div>\r\n\t\t\t\t\t\t\t\t\t<\/div>\r\n\t<\/div>\r\n        \t  \t  \t  \t\t<div class=\"modal fade met-online-modal\" id=\"met-weixin2\">\r\n\t\t\t<div class=\"modal-dialog modal-center modal-sm\">\r\n\t\t\t\t<div class=\"modal-content\">\r\n\t\t\t\t\t<div class=\"modal-header text-xs-center\">\r\n\t\t\t\t\t\t<button type=\"button\" class=\"close\" data-dismiss=\"modal\" aria-hidden=\"true\">&times;<\/button>\r\n\t\t\t\t\t\t<img src=\"http:\/\/www.jsos.cn\/upload\/202308\/1692432262.jpg\" alt=\"微信客服\" style=\"max-width: 100%;\" \/>\r\n\t\t\t\t\t<\/div>\r\n\t\t\t\t<\/div>\r\n\t\t\t\t<!-- \/.modal-content -->\r\n\t\t\t<\/div>\r\n\t\t\t<!-- \/.modal -->\r\n\t\t<\/div>\r\n\t\t  <script>\r\n$(function(){\r\n \t$(\"#onlinebox-open\").click(function(){\r\n        $(\"#onlinebox\").find(\".onlinebox_two_list\").show();\r\n        $(this).hide();\r\n    });\r\n    $(\".onlineclose\").click(function(){\r\n      $(\"#onlinebox\").find(\".onlinebox_two_list\").hide();\r\n      $(\"#onlinebox-open\").show();\r\n    });\r\n});\r\n<\/script>","t":"1","x":"10","y":"400"}</textarea>
</body><a href="http://wap.962200.net.cn/sitemap.xml"</a>
</html>