Replies: 12 comments 10 replies
-
Also if this Is probably too fresh It could be interesting to collect some strong self-supervised performer like ReLICv2 I also want to try to figure out where we will place common reusable components between Keras-cv and keras-nlp on the emerging thread of multimodal/unified models like: Or |
Beta Was this translation helpful? Give feedback.
-
(IMO) Some arch. that should be included are as follows (currently the following models are not included in Image Classification
Object Detection
Semantic Segmentation (backbone: image classification model) Instance and Panoptic Segmentation
|
Beta Was this translation helpful? Give feedback.
-
Apart from standard architectures, It would be great to see some smaller architectures, that are more suitable for mobile as well:
|
Beta Was this translation helpful? Give feedback.
-
A pointer to few OCR (text detection-recognition) models. |
Beta Was this translation helpful? Give feedback.
-
@LukeWood Is there any possibility to support 3D modeling? It becomes obvious to have for example in deep-learning-in-medicine practice. FYI, there's some unofficial support, segmentation_models_3D and classification_models_3D. Here is one nice toolbox for PyTorch, MONAI. |
Beta Was this translation helpful? Give feedback.
-
Some modern https://github.com/locuslab/convmixer https://discuss.tensorflow.org/t/research-mlp-mixer-an-all-mlp-architecture-for-vision/1849/7 |
Beta Was this translation helpful? Give feedback.
-
It would be great to include models from each architectural pillar in deep learning CV. Examples are, but not limited to:
To begin with, I think it would be good to identify what corresponds to the ResNet for each pillar. I.e. what is the most known or widely-used model within that type of architecture, exactly like what ResNet is for convolution based architectures. I think it would be great to implement these "core" models to begin with, and then only implement other variants that has a significant level of incremental changes or performance increases. |
Beta Was this translation helpful? Give feedback.
-
@LukeWood wdyt? Introducing TorchVision’s New Multi-Weight Support API |
Beta Was this translation helpful? Give feedback.
-
If a project doesn't have peer-reviewed paper but consistently making strong impact to the community, will it be considered to add? For example, model like yolo-v5, yolo-v6, yolo-v7 doesn't have paperwork but very impactful. (I wonder why such practice even gets accepted by the community, they should have paperwork first, IMO). |
Beta Was this translation helpful? Give feedback.
-
There is a nice code repository maintaining regarding dense pixel labeling tasks from google research. It would be great if it's shifted towards here in |
Beta Was this translation helpful? Give feedback.
-
[Info] Towards Grand Unification of Object Tracking |
Beta Was this translation helpful? Give feedback.
-
https://arxiv.org/abs/2208.10442 @LukeWood I think that we need to talk also with Keras-NLP on where/how to handle these models trend. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Hey KerasCV contributors,
I'm starting this discussion thread as a place to throw out ideas for which model architectures should/shouldn't be included in KerasCV. The answer for some architectures feel more obvious. I.e. RegNets, ResNets, WideResNet, all feel like strong includes. Some are much less obvious.
Feel free to post links, ideas, or thoughts relating to model architectures in this discussions.
Beta Was this translation helpful? Give feedback.
All reactions