我正在尝试在我正在开发的应用程序中实现心跳录音功能.
这样做的首选方法是使用iPhone的相机打开灯,让用户将手指放在镜头上,并检测视频输入中与用户心脏相对应的波动.
我发现了一个非常好的起点,这里有以下堆栈溢出问题
该问题提供了绘制心跳时间图的有用代码.
它显示了如何启动AVCaptureSession并打开相机的指示灯,如下所示:
session = [[AVCaptureSession alloc] init];
AVCaptureDevice* camera = [AVCaptureDevice defaultDeviceWithMediaType:AVMediaTypeVideo];
if([camera isTorchModeSupported:AVCaptureTorchModeOn]) {
    [camera lockForConfiguration:nil];
    camera.torchMode=AVCaptureTorchModeOn;
    //  camera.exposureMode=AVCaptureExposureModeLocked;
    [camera unlockForConfiguration];
}
// Create a AVCaptureInput with the camera device
NSError *error=nil;
AVCaptureInput* cameraInput = [[AVCaptureDeviceInput alloc] initWithDevice:camera error:&error];
if (cameraInput == nil) {
    NSLog(@"Error to create camera capture:%@",error);
}
// Set the output
AVCaptureVideoDataOutput* videoOutput = [[AVCaptureVideoDataOutput alloc] init];
// create a queue to run the capture on
dispatch_queue_t captureQueue=dispatch_queue_create("catpureQueue", NULL);
// setup …我正在尝试使用Python来检索实时音频输入的主要频率.目前我正在尝试使用音频流我的笔记本电脑的内置麦克风,但在测试以下代码时,我的结果非常糟糕.
    # Read from Mic Input and find the freq's
    import pyaudio
    import numpy as np
    import bge
    import wave
    chunk = 2048
    # use a Blackman window
    window = np.blackman(chunk)
    # open stream
    FORMAT = pyaudio.paInt16
    CHANNELS = 1
    RATE = 1920
    p = pyaudio.PyAudio()
    myStream = p.open(format = FORMAT, channels = CHANNELS, rate = RATE, input = True, frames_per_buffer = chunk)
    def AnalyseStream(cont):
        data = myStream.read(chunk)
        # unpack the data and times by the hamming window
        indata = …我想要实现的目标如下:我需要声音文件(.wav)的频率值进行分析.我知道很多程序会给出值的可视图(谱图),但我需要原始数据.我知道这可以通过FFT完成,并且应该在python中相当容易编写脚本但不确定如何完全执行它.因此,假设文件中的信号长度为.4s,那么我希望多次测量,为程序测量的每个时间点以及它找到的值(频率)(以及可能的功率(dB))提供输出作为数组.复杂的是我想分析鸟歌,它们经常有谐波或信号超出频率范围(例如1000-2000赫兹).我希望程序也能输出这些信息,因为这对于我想对数据做的分析非常重要:)
现在有一段看起来非常像我想要的代码,但我认为它并没有给我所有我想要的价值....(感谢Justin Peel将这个问题发布到另一个问题:))所以我聚集在一起我需要numpy和pyaudio但不幸的是我不熟悉python所以我希望Python专家可以帮助我吗?
源代码:
# Read in a WAV and find the freq's
import pyaudio
import wave
import numpy as np
chunk = 2048
# open up a wave
wf = wave.open('test-tones/440hz.wav', 'rb')
swidth = wf.getsampwidth()
RATE = wf.getframerate()
# use a Blackman window
window = np.blackman(chunk)
# open stream
p = pyaudio.PyAudio()
stream = p.open(format =
                p.get_format_from_width(wf.getsampwidth()),
                channels = wf.getnchannels(),
                rate = RATE,
                output = True)
# read some data
data = wf.readframes(chunk)
# play stream and find the …我正在编写代码来分析语音所唱的单个音频。我需要一种方法来分析音符的频率。目前,我正在使用PyAudio录制音频文件,该文件存储为.wav,然后立即播放。
import numpy as np
import pyaudio
import wave
# open up a wave
wf = wave.open('file.wav', 'rb')
swidth = wf.getsampwidth()
RATE = wf.getframerate()
# use a Blackman window
window = np.blackman(chunk)
# open stream
p = pyaudio.PyAudio()
stream = p.open(format =
                p.get_format_from_width(wf.getsampwidth()),
                channels = wf.getnchannels(),
                rate = RATE,
                output = True)
# read some data
data = wf.readframes(chunk)
print(len(data))
print(chunk*swidth)
# play stream and find the frequency of each chunk
while len(data) == chunk*swidth:
    # write data …