Browse Source
Update readme
Signed-off-by: Jael Gu <mengjia.gu@zilliz.com>
main
1 changed files with
5 additions and
6 deletions
-
README.md
|
@ -9,23 +9,22 @@ This operator uses reads the waveform of an audio file and then applies VGGish t |
|
|
## Interface |
|
|
## Interface |
|
|
|
|
|
|
|
|
```python |
|
|
```python |
|
|
__call__(self, audio_path: str) |
|
|
|
|
|
|
|
|
__call__(self, datas: List[NamedTuple('data', [('audio', 'ndarray'), ('sample_rate', 'int')])]) |
|
|
``` |
|
|
``` |
|
|
|
|
|
|
|
|
**Args:** |
|
|
**Args:** |
|
|
|
|
|
|
|
|
- audio_path: |
|
|
|
|
|
- the input audio path |
|
|
|
|
|
- supported types: str |
|
|
|
|
|
|
|
|
- datas: |
|
|
|
|
|
- a named tuple including audio data in numpy.ndarray and sample rate in integer |
|
|
|
|
|
|
|
|
**Returns:** |
|
|
**Returns:** |
|
|
|
|
|
|
|
|
The Operator returns a tuple Tuple[('embs', numpy.ndarray)] containing following fields: |
|
|
The Operator returns a tuple Tuple[('embs', numpy.ndarray)] containing following fields: |
|
|
|
|
|
|
|
|
- embs: |
|
|
|
|
|
|
|
|
- vec: |
|
|
- embeddings of the audio |
|
|
- embeddings of the audio |
|
|
- data type: `numpy.ndarray` |
|
|
- data type: `numpy.ndarray` |
|
|
- shape: (num_clips,128) |
|
|
|
|
|
|
|
|
- shape: (num_clips, 128) |
|
|
|
|
|
|
|
|
## Requirements |
|
|
## Requirements |
|
|
|
|
|
|
|
|