avinashkranjan
diff --git a/‎audio_detect/LICENSE
Lines changed: 21 additions & 0 deletions b/‎audio_detect/LICENSE
Lines changed: 21 additions & 0 deletions
diff --git a/‎audio_detect/README.md
Lines changed: 66 additions & 0 deletions b/‎audio_detect/README.md
Lines changed: 66 additions & 0 deletions
diff --git a/‎audio_detect/data/game_sound.zip
7.12 MB b/‎audio_detect/data/game_sound.zip
7.12 MB
diff --git a/‎audio_detect/data/test.wav
31.5 KB b/‎audio_detect/data/test.wav
31.5 KB
diff --git a/‎audio_detect/pic/all_API.png
74.6 KB b/‎audio_detect/pic/all_API.png
74.6 KB
diff --git a/‎audio_detect/pic/insert.png
91.4 KB b/‎audio_detect/pic/insert.png
91.4 KB
diff --git a/‎audio_detect/pic/search.png
96 KB b/‎audio_detect/pic/search.png
96 KB
diff --git a/‎audio_detect/webserver/audio/__init__.py b/‎audio_detect/webserver/audio/__init__.py
diff --git a/‎audio_detect/webserver/audio/common/__init__.py b/‎audio_detect/webserver/audio/common/__init__.py
diff --git a/‎audio_detect/webserver/audio/common/config.py
Lines changed: 17 additions & 0 deletions b/‎audio_detect/webserver/audio/common/config.py
Lines changed: 17 additions & 0 deletions
@@ -0,0 +1,21 @@
+MIT License
+
+Copyright (c) 2020 adolf69
+
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.
@@ -0,0 +1,66 @@
+:exclamation::exclamation: **This repo will no longer be maintained, please visit https://github.com/milvus-io/bootcamp** :exclamation: :exclamation:
+
+# Audio search system with Milvus
+
+This project uses [PANNs](https://github.com/qiuqiangkong/audioset_tagging_cnn)(Large-Scale Pretrained Audio Neural Networks) for Audio Pattern Recognition to perform audio tagging and sound event detection, finally obtaining audio embeddings. Then this project uses [Milvus](https://milvus.io/docs/v0.11.0/overview.md) to search for similar audio clips.
+
+## Local Deployment
+
+### Requirements
+
+- [Milvus 0.10.5](https://milvus.io/docs/v0.10.5/milvus_docker-cpu.md) (please note the Milvus version)
+- [MySQL](https://hub.docker.com/r/mysql/mysql-server)
+- [Python3](https://www.python.org/downloads/)
+
+### Run Server
+
+1. **Install python requirements**
+
+   ```bash
+   $ cd bootcamp/solutions/audio_search/webserver/
+   $ pip install -r audio_requirements.txt
+   ```
+
+2. **Modify configuration parameters**
+
+   Before running the script, please modify the parameters in **webserver/audio/common/config.py**:
+
+   | Parameter    | Description               | Default setting |
+   | ------------ | ------------------------- | --------------- |
+   | MILVUS_HOST  | milvus service ip address | 127.0.0.1       |
+   | MILVUS_PORT  | milvus service port       | 19530           |
+   | MYSQL_HOST   | mysql service ip     | 127.0.0.1       |
+   | MYSQL_PORT   | mysql service port   | 3306            |
+   | MYSQL_USER   | mysql user name      | root            |
+   | MYSQL_PWD    | mysql password       | 123456          |
+   | MYSQL_DB     | mysql datebase name  | mysql           |
+   | MILVUS_TABLE | default table name        | milvus_audio    |
+
+3. **Star server**
+
+   ```bash
+   $ cd webserver
+   $ python main.py
+   ```
+
+## System Usage
+
+Type `127.0.0.1:8002/docs` in your browser to see all the APIs.
+
+![](./pic/all_API.png)
+
+- Insert data.
+
+  Download the sample [game_sound.zip](https://github.com/shiyu22/bootcamp/blob/0.11.0/solutions/audio_search/data/game_sound.zip?raw=true) and upload it into the system.
+
+  > The sound data in the zip archive must be in wav format.
+
+  ![](./pic/insert.png)
+
+- Search for similar audio clips.
+
+  You can upload [test.wav](https://github.com/shiyu22/bootcamp/blob/0.11.0/solutions/audio_search/data/test.wav) to search for the most similar sound clips.
+  
+  ![](./pic/search.png)
+
+Please refer to https://zilliz.com/demos/ to take a try in the front-end interface.
@@ -0,0 +1,17 @@
+import os
+from milvus import *
+
+MILVUS_HOST = os.getenv("MILVUS_HOST", "127.0.0.1")
+MILVUS_PORT = os.getenv("MILVUS_PORT", 19530)
+VECTOR_DIMENSION = os.getenv("VECTOR_DIMENSION", 2048)
+METRIC_TYPE = os.getenv("METRIC_TYPE", MetricType.IP)
+TOP_K = os.getenv("TOP_K", 100)
+
+UPLOAD_PATH = os.getenv("UPLOAD_PATH", "./tmp")
+DEFAULT_TABLE = os.getenv("DEFAULT_TABLE", "milvus_audio")
+
+MYSQL_HOST = os.getenv("MYSQL_HOST", "127.0.0.1")
+MYSQL_PORT = os.getenv("MYSQL_PORT", 3306)
+MYSQL_USER = os.getenv("MYSQL_USER", "root")
+MYSQL_PWD = os.getenv("MYSQL_PWD", "123456")
+MYSQL_DB = os.getenv("MYSQL_DB", "mysql")