#nanollava β Public Fediverse posts
Live and recent posts from across the Fediverse tagged #nanollava, aggregated by home.social.
-
Edge-Ready #Vision Language Model Advances Visual #AI Processing π
π§ #OmniVision (968M params) sets new benchmark as world's smallest #VisionLanguageModel
π Architecture combines #Qwen2 (0.5B) for text & #SigLIP (400M) for vision processing
π‘ Key Innovations:
β’ 9x token reduction (729 β 81) for faster processing
β’ Enhanced accuracy through #DPO training
β’ Only 988MB RAM & 948MB storage required
β’ Outperforms #nanoLLAVA across multiple benchmarksπ― Use Cases:
β’ Image analysis & description
β’ Visual memory assistance
β’ Recipe generation from food images
β’ Technical documentation supportTry it now: https://huggingface.co/spaces/NexaAIDev/omnivlm-dpo-demo
Source: https://nexa.ai/blogs/omni-vision -
Edge-Ready #Vision Language Model Advances Visual #AI Processing π
π§ #OmniVision (968M params) sets new benchmark as world's smallest #VisionLanguageModel
π Architecture combines #Qwen2 (0.5B) for text & #SigLIP (400M) for vision processing
π‘ Key Innovations:
β’ 9x token reduction (729 β 81) for faster processing
β’ Enhanced accuracy through #DPO training
β’ Only 988MB RAM & 948MB storage required
β’ Outperforms #nanoLLAVA across multiple benchmarksπ― Use Cases:
β’ Image analysis & description
β’ Visual memory assistance
β’ Recipe generation from food images
β’ Technical documentation supportTry it now: https://huggingface.co/spaces/NexaAIDev/omnivlm-dpo-demo
Source: https://nexa.ai/blogs/omni-vision -
Edge-Ready #Vision Language Model Advances Visual #AI Processing π
π§ #OmniVision (968M params) sets new benchmark as world's smallest #VisionLanguageModel
π Architecture combines #Qwen2 (0.5B) for text & #SigLIP (400M) for vision processing
π‘ Key Innovations:
β’ 9x token reduction (729 β 81) for faster processing
β’ Enhanced accuracy through #DPO training
β’ Only 988MB RAM & 948MB storage required
β’ Outperforms #nanoLLAVA across multiple benchmarksπ― Use Cases:
β’ Image analysis & description
β’ Visual memory assistance
β’ Recipe generation from food images
β’ Technical documentation supportTry it now: https://huggingface.co/spaces/NexaAIDev/omnivlm-dpo-demo
Source: https://nexa.ai/blogs/omni-vision -
Edge-Ready #Vision Language Model Advances Visual #AI Processing π
π§ #OmniVision (968M params) sets new benchmark as world's smallest #VisionLanguageModel
π Architecture combines #Qwen2 (0.5B) for text & #SigLIP (400M) for vision processing
π‘ Key Innovations:
β’ 9x token reduction (729 β 81) for faster processing
β’ Enhanced accuracy through #DPO training
β’ Only 988MB RAM & 948MB storage required
β’ Outperforms #nanoLLAVA across multiple benchmarksπ― Use Cases:
β’ Image analysis & description
β’ Visual memory assistance
β’ Recipe generation from food images
β’ Technical documentation supportTry it now: https://huggingface.co/spaces/NexaAIDev/omnivlm-dpo-demo
Source: https://nexa.ai/blogs/omni-vision -
Edge-Ready #Vision Language Model Advances Visual #AI Processing π
π§ #OmniVision (968M params) sets new benchmark as world's smallest #VisionLanguageModel
π Architecture combines #Qwen2 (0.5B) for text & #SigLIP (400M) for vision processing
π‘ Key Innovations:
β’ 9x token reduction (729 β 81) for faster processing
β’ Enhanced accuracy through #DPO training
β’ Only 988MB RAM & 948MB storage required
β’ Outperforms #nanoLLAVA across multiple benchmarksπ― Use Cases:
β’ Image analysis & description
β’ Visual memory assistance
β’ Recipe generation from food images
β’ Technical documentation supportTry it now: https://huggingface.co/spaces/NexaAIDev/omnivlm-dpo-demo
Source: https://nexa.ai/blogs/omni-vision