ivanjx
diff --git a/‎docker-compose.yml‎
Lines changed: 5 additions & 3 deletions b/‎docker-compose.yml‎
Lines changed: 5 additions & 3 deletions
diff --git a/‎docker/main/build_nginx.sh‎
Lines changed: 1 addition & 1 deletion b/‎docker/main/build_nginx.sh‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docker/main/rootfs/usr/local/nginx/conf/nginx.conf‎
Lines changed: 1 addition & 1 deletion b/‎docker/main/rootfs/usr/local/nginx/conf/nginx.conf‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docker/tensorrt/Dockerfile.base‎
Lines changed: 14 additions & 1 deletion b/‎docker/tensorrt/Dockerfile.base‎
Lines changed: 14 additions & 1 deletion
diff --git a/‎docker/tensorrt/detector/rootfs/etc/ld.so.conf.d/cuda_tensorrt.conf‎
Lines changed: 1 addition & 2 deletions b/‎docker/tensorrt/detector/rootfs/etc/ld.so.conf.d/cuda_tensorrt.conf‎
Lines changed: 1 addition & 2 deletions
diff --git a/‎docs/docs/configuration/bird_classification.md‎
Lines changed: 31 additions & 0 deletions b/‎docs/docs/configuration/bird_classification.md‎
Lines changed: 31 additions & 0 deletions
diff --git a/‎docs/docs/configuration/face_recognition.md‎
Lines changed: 35 additions & 19 deletions b/‎docs/docs/configuration/face_recognition.md‎
Lines changed: 35 additions & 19 deletions
@@ -1,7 +1,8 @@
 services:
   devcontainer:
     container_name: frigate-devcontainer
-    # add groups from host for render, plugdev, video
+    # Check host system's actual render/video/plugdev group IDs with 'getent group render', 'getent group video', and 'getent group plugdev'
+    # Must add these exact IDs in container's group_add section or OpenVINO GPU acceleration will fail
     group_add:
       - "109" # render
       - "110" # render
@@ -35,6 +36,7 @@ services:
      # - /dev/bus/usb:/dev/bus/usb # Uncomment for Google Coral USB
   mqtt:
     container_name: mqtt
-    image: eclipse-mosquitto:1.6
+    image: eclipse-mosquitto:2.0
+    command: mosquitto -c /mosquitto-no-auth.conf # enable no-auth mode
     ports:
-      - "1883:1883"
+      - "1883:1883"
@@ -2,7 +2,7 @@
 
 set -euxo pipefail
 
-NGINX_VERSION="1.25.3"
+NGINX_VERSION="1.27.4"
 VOD_MODULE_VERSION="1.31"
 SECURE_TOKEN_MODULE_VERSION="1.5"
 SET_MISC_MODULE_VERSION="v0.33"
 
@@ -30,7 +30,7 @@ http {
 
     gzip on;
     gzip_comp_level 6;
-    gzip_types text/plain text/css application/json application/x-javascript application/javascript text/javascript image/svg+xml image/x-icon image/bmp image/png image/gif image/jpeg image/jpg;
+    gzip_types text/plain text/css application/json application/x-javascript application/javascript text/javascript image/svg+xml image/x-icon image/bmp;
     gzip_proxied no-cache no-store private expired auth;
     gzip_vary on;
 
 
@@ -21,7 +21,20 @@ RUN --mount=type=bind,source=docker/tensorrt/detector/tensorrt_libyolo.sh,target
 RUN mkdir -p /usr/local/cuda-deps
 RUN if [ "$TARGETARCH" = "amd64" ]; then \
       cp /usr/local/cuda-12.3/targets/x86_64-linux/lib/libcurand.so.* /usr/local/cuda-deps/ && \
-      cp /usr/local/cuda-12.3/targets/x86_64-linux/lib/libnvrtc.so.* /usr/local/cuda-deps/ ; \
+      cp /usr/local/cuda-12.3/targets/x86_64-linux/lib/libnvrtc.so.* /usr/local/cuda-deps/ && \
+      cd /usr/local/cuda-deps/ && \
+      for lib in libnvrtc.so.*; do \
+        if [[ "$lib" =~ libnvrtc.so\.([0-9]+\.[0-9]+\.[0-9]+) ]]; then \
+          version="${BASH_REMATCH[1]}"; \
+          ln -sf "libnvrtc.so.$version" libnvrtc.so; \
+        fi; \
+      done && \
+      for lib in libcurand.so.*; do \
+        if [[ "$lib" =~ libcurand.so\.([0-9]+\.[0-9]+\.[0-9]+\.[0-9]+) ]]; then \
+          version="${BASH_REMATCH[1]}"; \
+          ln -sf "libcurand.so.$version" libcurand.so; \
+        fi; \
+      done; \
     fi
 
 # Frigate w/ TensorRT Support as separate image
 
@@ -1,8 +1,7 @@
 /usr/local/lib
 /usr/local/cuda
+/usr/local/lib/python3.11/dist-packages/tensorrt
 /usr/local/lib/python3.11/dist-packages/nvidia/cudnn/lib
 /usr/local/lib/python3.11/dist-packages/nvidia/cuda_runtime/lib
 /usr/local/lib/python3.11/dist-packages/nvidia/cublas/lib
-/usr/local/lib/python3.11/dist-packages/nvidia/cuda_nvrtc/lib
-/usr/local/lib/python3.11/dist-packages/tensorrt
 /usr/local/lib/python3.11/dist-packages/nvidia/cufft/lib
@@ -0,0 +1,31 @@
+---
+id: bird_classification
+title: Bird Classification
+---
+
+Bird classification identifies known birds using a quantized Tensorflow model. When a known bird is recognized, its common name will be added as a `sub_label`. This information is included in the UI, filters, as well as in notifications.
+
+## Minimum System Requirements
+
+Bird classification runs a lightweight tflite model on the CPU, there are no significantly different system requirements than running Frigate itself.
+
+## Model
+
+The classification model used is the MobileNet INat Bird Classification, [available identifiers can be found here.](https://raw.githubusercontent.com/google-coral/test_data/master/inat_bird_labels.txt)
+
+## Configuration
+
+Bird classification is disabled by default, it must be enabled in your config file before it can be used. Bird classification is a global configuration setting.
+
+```yaml
+classification:
+  bird:
+    enabled: true
+```
+
+## Advanced Configuration
+
+Fine-tune bird classification with these optional parameters:
+
+- `threshold`: Classification confidence score required to set the sub label on the object.
+  - Default: `0.9`.
@@ -7,21 +7,26 @@ Face recognition identifies known individuals by matching detected faces with pr
 
 ## Model Requirements
 
-Frigate has support for CV2 Local Binary Pattern Face Recognizer to recognize faces, which runs locally. A lightweight face landmark detection model is also used to align faces before running them through the face recognizer.
+### Face Detection
 
-Users running a Frigate+ model (or any custom model that natively detects faces) should ensure that `face` is added to the [list of objects to track](../plus/#available-label-types) either globally or for a specific camera. This will allow face detection to run at the same time as object detection and be more efficient.
+When running a Frigate+ model (or any custom model that natively detects faces) should ensure that `face` is added to the [list of objects to track](../plus/#available-label-types) either globally or for a specific camera. This will allow face detection to run at the same time as object detection and be more efficient.
 
-Users without a model that detects faces can still run face recognition. Frigate uses a lightweight DNN face detection model that runs on the CPU. In this case, you should _not_ define `face` in your list of objects to track.
+When running a default COCO model or another model that does not include `face` as a detectable label, face detection will run via CV2 using a lightweight DNN model that runs on the CPU. In this case, you should _not_ define `face` in your list of objects to track.
 
-:::note
+### Face Recognition
 
-Frigate needs to first detect a `face` before it can recognize a face.
+Frigate has support for two face recognition model types:
 
-:::
+- **small**: Frigate will run a FaceNet embedding model to recognize faces, which runs locally on the CPU. This model is optimized for efficiency and is not as accurate.
+- **large**: Frigate will run a large ArcFace embedding model that is optimized for accuracy. It is only recommended to be run when an integrated or dedicated GPU is available.
+
+In both cases, a lightweight face landmark detection model is also used to align faces before running recognition.
 
 ## Minimum System Requirements
 
-Face recognition is lightweight and runs on the CPU, there are no significantly different system requirements than running Frigate itself.
+The `small` model is optimized for efficiency and runs on the CPU, most CPUs should run the model efficiently.
+
+The `large` model is optimized for accuracy, an integrated or discrete GPU is highly recommended.
 
 ## Configuration
 
@@ -47,12 +52,15 @@ Fine-tune face recognition with these optional parameters:
 
 ### Recognition
 
+- `model_size`: Which model size to use, options are `small` or `large`
+- `unknown_score`: Min score to mark a person as a potential match, matches at or below this will be marked as unknown.
+  - Default: `0.8`.
 - `recognition_threshold`: Recognition confidence score required to add the face to the object as a sub label.
   - Default: `0.9`.
 - `blur_confidence_filter`: Enables a filter that calculates how blurry the face is and adjusts the confidence based on this.
   - Default: `True`.
 
-## Dataset
+## Creating a Robust Training Set
 
 The number of images needed for a sufficient training set for face recognition varies depending on several factors:
 
@@ -61,11 +69,9 @@ The number of images needed for a sufficient training set for face recognition v
 
 However, here are some general guidelines:
 
-- Minimum: For basic face recognition tasks, a minimum of 10-20 images per person is often recommended.
-- Recommended: For more robust and accurate systems, 30-50 images per person is a good starting point.
-- Ideal: For optimal performance, especially in challenging conditions, 100 or more images per person can be beneficial.
-
-## Creating a Robust Training Set
+- Minimum: For basic face recognition tasks, a minimum of 5-10 images per person is often recommended.
+- Recommended: For more robust and accurate systems, 20-30 images per person is a good starting point.
+- Ideal: For optimal performance, especially in challenging conditions, 50-100 images per person can be beneficial.
 
 The accuracy of face recognition is heavily dependent on the quality of data given to it for training. It is recommended to build the face training library in phases.
 
@@ -76,8 +82,9 @@ When choosing images to include in the face training set it is recommended to al
 - If it is difficult to make out details in a persons face it will not be helpful in training.
 - Avoid images with extreme under/over-exposure.
 - Avoid blurry / pixelated images.
-- Be careful when uploading images of people when they are wearing clothing that covers a lot of their face as this may confuse the model.
-- Do not upload too many similar images at the same time, it is recommended to train no more than 4-6 similar images for each person to avoid overfitting.
+- Avoid training on infrared (gray-scale). The models are trained on color images and will be able to extract features from gray-scale images.
+- Using images of people wearing hats / sunglasses may confuse the model.
+- Do not upload too many similar images at the same time, it is recommended to train no more than 4-6 similar images for each person to avoid over-fitting.
 
 :::
 
@@ -87,7 +94,7 @@ When first enabling face recognition it is important to build a foundation of st
 
 Then it is recommended to use the `Face Library` tab in Frigate to select and train images for each person as they are detected. When building a strong foundation it is strongly recommended to only train on images that are straight-on. Ignore images from cameras that recognize faces from an angle.
 
-Aim to strike a balance between the quality of images while also having a range of conditions (day / night, different weather conditions, different times of day, etc.) in order to have diversity in the images used for each person and not have overfitting.
+Aim to strike a balance between the quality of images while also having a range of conditions (day / night, different weather conditions, different times of day, etc.) in order to have diversity in the images used for each person and not have over-fitting.
 
 Once a person starts to be consistently recognized correctly on images that are straight-on, it is time to move on to the next step.
 
@@ -97,13 +104,22 @@ Once straight-on images are performing well, start choosing slightly off-angle i
 
 ## FAQ
 
-### Why is every face tagged as a known face and not unknown?
+### Why can't I bulk upload photos?
 
-Any recognized face with a score >= `min_score` will show in the `Train` tab along with the recognition score. A low scoring face is effectively the same as `unknown`, but includes more information. This does not mean the recognition is not working well, and is part of the importance of choosing the correct `recognition_threshold`.
+It is important to methodically add photos to the library, bulk importing photos (especially from a general photo library) will lead to over-fitting in that particular scenario and hurt recognition performance.
+
+### Why can't I bulk reprocess faces?
+
+Face embedding models work by breaking apart faces into different features. This means that when reprocessing an image, only images from a similar angle will have its score affected.
 
 ### Why do unknown people score similarly to known people?
 
-This can happen for a few different reasons, but this is usually an indicator that the training set needs to be improved. This is often related to overfitting:
+This can happen for a few different reasons, but this is usually an indicator that the training set needs to be improved. This is often related to over-fitting:
+
 - If you train with only a few images per person, especially if those images are very similar, the recognition model becomes overly specialized to those specific images.
 - When you provide images with different poses, lighting, and expressions, the algorithm extracts features that are consistent across those variations.
 - By training on a diverse set of images, the algorithm becomes less sensitive to minor variations and noise in the input image.
+
+### I see scores above the threshold in the train tab, but a sub label wasn't assigned?
+
+The Frigate considers the recognition scores across all recognition attempts for each person object. The scores are continually weighted based on the area of the face, and a sub label will only be assigned to person if a person is confidently recognized consistently. This avoids cases where a single high confidence recognition would throw off the results.