#vlm — Public Fediverse posts on home.social

adingbatponder :nixos: 👾 @[email protected] · 2026-05-08 · 10:52 UTC

Is there a FOSS model that does #OCR using #VLM #LLM machine vision learning tricks to get text of tricky docs where #tesseract fails? Using #apitokens to get #chatgpt & co to do it is stomach-turning....

#ocr #vlm #llm #tesseract #apitokens #chatgpt

adingbatponder :nixos: 👾 @adingbatponder · 2026-05-08 · 10:52 UTC

Is there a FOSS model that does #OCR using #VLM #LLM machine vision learning tricks to get text of tricky docs where #tesseract fails? Using #apitokens to get #chatgpt & co to do it is stomach-turning....

#ocr #vlm #llm #tesseract #apitokens #chatgpt

adingbatponder :nixos: 👾 @[email protected] · 2026-05-08 · 10:52 UTC

Is there a FOSS model that does #OCR using #VLM #LLM machine vision learning tricks to get text of tricky docs where #tesseract fails? Using #apitokens to get #chatgpt & co to do it is stomach-turning....

#ocr #vlm #llm #tesseract #apitokens #chatgpt

adingbatponder :nixos: 👾 @[email protected] · 2026-05-08 · 10:52 UTC

Is there a FOSS model that does #OCR using #VLM #LLM machine vision learning tricks to get text of tricky docs where #tesseract fails? Using #apitokens to get #chatgpt & co to do it is stomach-turning....

#chatgpt #apitokens #tesseract #llm #vlm #ocr

adingbatponder :nixos: 👾 @[email protected] · 2026-05-08 · 10:52 UTC

Is there a FOSS model that does #OCR using #VLM #LLM machine vision learning tricks to get text of tricky docs where #tesseract fails? Using #apitokens to get #chatgpt & co to do it is stomach-turning....

#ocr #vlm #llm #tesseract #apitokens #chatgpt

Emre Sokullu :verified: @[email protected] · 2026-05-07 · 11:36 UTC

Browser agent için 8 gorsel LLM'i ekran goruntusu temellendirmede kıyasladık.

Şaşırtıcı bulgu: Qwen 3.5-9B, 308B parametreli MiMo V2.5'in kaçırdığı bir dropdown affordance'ını doğru sınıflandırıyor. Affordance parametre sayısıyla ölçeklenmiyor.

8 modelden sadece 1'i (Qwen 3.6-35B-A3B) kalibrasyonda dürüst belirsizlik gösteriyor.

Detaylı yazı + VRAM önerileri:
https://webbrain.one/blog

GitHub'da ⭐ atarsanız çok seviniriz 🙏
https://github.com/esokullu/webbrain

#LocalLLM #VLM #AIAgents #Qwen #AI #yapayzeka

#localllm #vlm #aiagents #qwen #ai #yapayzeka

Emre Sokullu :verified: @[email protected] · 2026-05-07 · 11:36 UTC

Browser agent için 8 gorsel LLM'i ekran goruntusu temellendirmede kıyasladık.

Şaşırtıcı bulgu: Qwen 3.5-9B, 308B parametreli MiMo V2.5'in kaçırdığı bir dropdown affordance'ını doğru sınıflandırıyor. Affordance parametre sayısıyla ölçeklenmiyor.

8 modelden sadece 1'i (Qwen 3.6-35B-A3B) kalibrasyonda dürüst belirsizlik gösteriyor.

Detaylı yazı + VRAM önerileri:
https://webbrain.one/blog

GitHub'da ⭐ atarsanız çok seviniriz 🙏
https://github.com/esokullu/webbrain

#LocalLLM #VLM #AIAgents #Qwen #AI #yapayzeka

#localllm #vlm #aiagents #qwen #ai #yapayzeka

Emre Sokullu :verified: @[email protected] · 2026-05-07 · 11:36 UTC

Browser agent için 8 gorsel LLM'i ekran goruntusu temellendirmede kıyasladık.

Şaşırtıcı bulgu: Qwen 3.5-9B, 308B parametreli MiMo V2.5'in kaçırdığı bir dropdown affordance'ını doğru sınıflandırıyor. Affordance parametre sayısıyla ölçeklenmiyor.

8 modelden sadece 1'i (Qwen 3.6-35B-A3B) kalibrasyonda dürüst belirsizlik gösteriyor.

Detaylı yazı + VRAM önerileri:
https://webbrain.one/blog

GitHub'da ⭐ atarsanız çok seviniriz 🙏
https://github.com/esokullu/webbrain

#LocalLLM #VLM #AIAgents #Qwen #AI #yapayzeka

#localllm #vlm #aiagents #qwen #ai #yapayzeka

Emre Sokullu :verified: @[email protected] · 2026-05-07 · 11:36 UTC

Browser agent için 8 gorsel LLM'i ekran goruntusu temellendirmede kıyasladık.

Şaşırtıcı bulgu: Qwen 3.5-9B, 308B parametreli MiMo V2.5'in kaçırdığı bir dropdown affordance'ını doğru sınıflandırıyor. Affordance parametre sayısıyla ölçeklenmiyor.

8 modelden sadece 1'i (Qwen 3.6-35B-A3B) kalibrasyonda dürüst belirsizlik gösteriyor.

Detaylı yazı + VRAM önerileri:
https://webbrain.one/blog

GitHub'da ⭐ atarsanız çok seviniriz 🙏
https://github.com/esokullu/webbrain

#LocalLLM #VLM #AIAgents #Qwen #AI #yapayzeka

#yapayzeka #ai #qwen #aiagents #vlm #localllm

Emre Sokullu :verified: @[email protected] · 2026-05-07 · 11:36 UTC

Browser agent için 8 gorsel LLM'i ekran goruntusu temellendirmede kıyasladık.

Şaşırtıcı bulgu: Qwen 3.5-9B, 308B parametreli MiMo V2.5'in kaçırdığı bir dropdown affordance'ını doğru sınıflandırıyor. Affordance parametre sayısıyla ölçeklenmiyor.

8 modelden sadece 1'i (Qwen 3.6-35B-A3B) kalibrasyonda dürüst belirsizlik gösteriyor.

Detaylı yazı + VRAM önerileri:
https://webbrain.one/blog

GitHub'da ⭐ atarsanız çok seviniriz 🙏
https://github.com/esokullu/webbrain

#LocalLLM #VLM #AIAgents #Qwen #AI #yapayzeka

#localllm #vlm #aiagents #qwen #ai #yapayzeka

adingbatponder :nixos: 👾 @adingbatponder · 2026-05-05 · 20:00 UTC

What #VLM is the best for #OCR of tricky input such as tables or just for high fidelity output in general?

#vlm #ocr

Habr @[email protected] · 2026-05-02 · 08:32 UTC

Робот, способный создать себя сам. Режим «Инженера» в робототехнике

Скажите роботу «настрой манипулятор» — и он напишет драйвер сам. Звучит как фантастика из тех самых фильмов 80-х и 90-х, но мы уже реализовали это в OpenGrall. Рассказываю, как работает режим Инженера и почему последнее слово всегда остаётся за человеком

https://habr.com/ru/articles/1030526/

#LLM #VLM #робототехника #OpenGrall #ИИ #Python #WebSocket #YandexGPT #DeepSeek #самокодинг

#самокодинг #deepseek #yandexgpt #websocket #python #ии

deepseek @[email protected] · 2026-05-02 · 08:18 UTC

Робот, способный создать себя сам. Режим «Инженера» в робототехнике Скажите роботу «настрой манипулятор» ...

#LLM #VLM #робототехника #OpenGrall #ИИ #Python #WebSocket #YandexGPT #DeepSeek #самокодинг

Origin | Interest | Match

#llm #vlm #робототехника #opengrall #ии #python

Habr @[email protected] · 2026-04-10 · 13:42 UTC

Как гибрид IDP и VLM экономит миллионы на верификации данных

Последние 2 года мы в Content AI активно тестируем Vision Language Models (VLM) для обработки документов. Модели вроде Qwen2.5-VL или Gemini 2.5 отлично работают с простыми формами — чеками, типовыми договорами. Но на документах со сложными фонами, многоуровневыми таблицами или нестандартной версткой VLM часто галлюцинирует, теряет строки и путается в реквизитах. В одной из предыдущих статей мы пришли к выводу, что будущее за комбинированным подходом , когда VLM усиливает IDP-решения. В этот раз мы проверили гипотезу: пусть VLM не распознает документ с нуля, а проверяет черновик из IDP-системы и исправляет ошибки, опираясь на исходное изображение. Базовым OCR движком выступила наша платформа ContentCapture. Практическая цель эксперимента — автоматизировать верификацию документов. Сейчас в крупных компаниях сотни операторов вручную сверяют распознанные данные с оригиналами.

https://habr.com/ru/companies/contentai/articles/1021880/

#idp #llmмодели #vlm #ocr #ocrтехнологии

#ocrтехнологии #ocr #vlm #llmмодели #idp

Habr @[email protected] · 2026-04-06 · 14:32 UTC

WACV 2026 в Тусоне: конференция, пустыня и немного экзистенции

Привет, Хабр! Я — Максим Куркин из лаборатории FusionBrain AIRI. Когда мне сказали «поедешь на WACV», первая мысль была — отлично, конференция. Вторая мысль — Тусон, Аризона. Пустыня Сонора. Кактусы‑сагуаро высотой с двухэтажный дом. +25°C в начале марта, когда в Москве ещё лежит снег. Круто! В итоге я провёл в командировке девять дней — с 5 по 13 марта. Два дня дороги в каждую сторону, пять дней конференции, немного пустыни вокруг. Поездка получилась насыщенной: и по науке, и по ощущениям, и очень хочется поделиться увиденным!

https://habr.com/ru/companies/airi/articles/1018010/

#WACV_2026 #Computer_Vision #Машинное_обучение #Искусственный_интеллект #Конференции #Vision_Encoders #Deep_Learning #Интерпретируемость_нейросетей #VLM

#vlm #интерпретируемость_нейросетей #deep_learning #vision_encoders #конференции #искусственный_интеллект

Anita Graser 🇪🇺🇺🇦🇬🇪 @[email protected] · 2026-03-14 · 17:10 UTC

RE: https://mastodon.social/@xlth/116144192667591833

Not the ideal conditions for geospatial applications of VLMs 😅

#GIScience #VLM #spatiotemporal #MobilityDataScience #SpatialDataScience

#giscience #vlm #spatiotemporal #mobilitydatascience #spatialdatascience

Anita Graser 🇪🇺🇺🇦🇬🇪 @underdarkGIS · 2026-03-14 · 17:10 UTC

RE: https://mastodon.social/@xlth/116144192667591833

Not the ideal conditions for geospatial applications of VLMs 😅

#GIScience #VLM #spatiotemporal #MobilityDataScience #SpatialDataScience

#giscience #vlm #spatiotemporal #mobilitydatascience #spatialdatascience

Anita Graser 🇪🇺🇺🇦🇬🇪 @[email protected] · 2026-03-14 · 17:10 UTC

RE: https://mastodon.social/@xlth/116144192667591833

Not the ideal conditions for geospatial applications of VLMs 😅

#GIScience #VLM #spatiotemporal #MobilityDataScience #SpatialDataScience

#spatialdatascience #mobilitydatascience #spatiotemporal #vlm #giscience

Anita Graser 🇪🇺🇺🇦🇬🇪 @[email protected] · 2026-03-14 · 17:10 UTC

RE: https://mastodon.social/@xlth/116144192667591833

Not the ideal conditions for geospatial applications of VLMs 😅

#GIScience #VLM #spatiotemporal #MobilityDataScience #SpatialDataScience

#giscience #vlm #spatiotemporal #mobilitydatascience #spatialdatascience

💧🌏 Greg Cocks @[email protected] · 2026-03-10 · 03:17 UTC

Vertical Land Motion And Human Exposure Across India's Coastal Regions
--
https://doi.org/10.1029/2025GL120539 <-- shared paper 🔗
--
https://www.indiaspend.com/climate-change/indias-coastal-cities-face-heavy-flooding-risk-due-to-sea-level-rise-970906 <-- shared media article 🔗
--
#SeaLevelRise #Subsidence #India #InSAR #radar #Postdoc #GIS #spatial #mapping #inSAR #LandSubsidence #remotesensing #coast #coastal #coastline #India #earthobservation #subsidence #rise #urban #city #SLR #ClimateChange #spatialanalysis #spatiotemporal #marine #ocean #water #hydrology #risk #hazard #humanimpacts #flood #flooding #model #modeling #floodrisk #infrastructure #damage #costs #economics #verticallandmotion #VLM #ESA #Sentinel #Ahmedabad #Chennai #Amaravathi #Kochi #Kakinada #Kolkata #deltas #estuary #demographics #population #coastalsubsidence #landuse #planning #mitigation #farmland #agriculture #foodsecurity #groundwater #pumping #extraction

#sealevelrise #subsidence #india #insar #radar #postdoc

💧🌏 Greg Cocks @[email protected] · 2026-03-10 · 03:17 UTC

Vertical Land Motion And Human Exposure Across India's Coastal Regions
--
https://doi.org/10.1029/2025GL120539 <-- shared paper 🔗
--
https://www.indiaspend.com/climate-change/indias-coastal-cities-face-heavy-flooding-risk-due-to-sea-level-rise-970906 <-- shared media article 🔗
--
#SeaLevelRise #Subsidence #India #InSAR #radar #Postdoc #GIS #spatial #mapping #inSAR #LandSubsidence #remotesensing #coast #coastal #coastline #India #earthobservation #subsidence #rise #urban #city #SLR #ClimateChange #spatialanalysis #spatiotemporal #marine #ocean #water #hydrology #risk #hazard #humanimpacts #flood #flooding #model #modeling #floodrisk #infrastructure #damage #costs #economics #verticallandmotion #VLM #ESA #Sentinel #Ahmedabad #Chennai #Amaravathi #Kochi #Kakinada #Kolkata #deltas #estuary #demographics #population #coastalsubsidence #landuse #planning #mitigation #farmland #agriculture #foodsecurity #groundwater #pumping #extraction

#sealevelrise #subsidence #india #insar #radar #postdoc

💧🌏 Greg Cocks @[email protected] · 2026-03-10 · 03:17 UTC

Vertical Land Motion And Human Exposure Across India's Coastal Regions
--
https://doi.org/10.1029/2025GL120539 <-- shared paper 🔗
--
https://www.indiaspend.com/climate-change/indias-coastal-cities-face-heavy-flooding-risk-due-to-sea-level-rise-970906 <-- shared media article 🔗
--
#SeaLevelRise #Subsidence #India #InSAR #radar #Postdoc #GIS #spatial #mapping #inSAR #LandSubsidence #remotesensing #coast #coastal #coastline #India #earthobservation #subsidence #rise #urban #city #SLR #ClimateChange #spatialanalysis #spatiotemporal #marine #ocean #water #hydrology #risk #hazard #humanimpacts #flood #flooding #model #modeling #floodrisk #infrastructure #damage #costs #economics #verticallandmotion #VLM #ESA #Sentinel #Ahmedabad #Chennai #Amaravathi #Kochi #Kakinada #Kolkata #deltas #estuary #demographics #population #coastalsubsidence #landuse #planning #mitigation #farmland #agriculture #foodsecurity #groundwater #pumping #extraction

#sealevelrise #subsidence #india #insar #radar #postdoc

💧🌏 Greg Cocks @[email protected] · 2026-03-10 · 03:17 UTC

Vertical Land Motion And Human Exposure Across India's Coastal Regions
--
https://doi.org/10.1029/2025GL120539 <-- shared paper 🔗
--
https://www.indiaspend.com/climate-change/indias-coastal-cities-face-heavy-flooding-risk-due-to-sea-level-rise-970906 <-- shared media article 🔗
--
#SeaLevelRise #Subsidence #India #InSAR #radar #Postdoc #GIS #spatial #mapping #inSAR #LandSubsidence #remotesensing #coast #coastal #coastline #India #earthobservation #subsidence #rise #urban #city #SLR #ClimateChange #spatialanalysis #spatiotemporal #marine #ocean #water #hydrology #risk #hazard #humanimpacts #flood #flooding #model #modeling #floodrisk #infrastructure #damage #costs #economics #verticallandmotion #VLM #ESA #Sentinel #Ahmedabad #Chennai #Amaravathi #Kochi #Kakinada #Kolkata #deltas #estuary #demographics #population #coastalsubsidence #landuse #planning #mitigation #farmland #agriculture #foodsecurity #groundwater #pumping #extraction

#extraction #pumping #groundwater #foodsecurity #agriculture #farmland

💧🌏 Greg Cocks @GregCocks · 2026-03-10 · 03:17 UTC

Vertical Land Motion And Human Exposure Across India's Coastal Regions
--
https://doi.org/10.1029/2025GL120539 <-- shared paper 🔗
--
https://www.indiaspend.com/climate-change/indias-coastal-cities-face-heavy-flooding-risk-due-to-sea-level-rise-970906 <-- shared media article 🔗
--
#SeaLevelRise #Subsidence #India #InSAR #radar #Postdoc #GIS #spatial #mapping #inSAR #LandSubsidence #remotesensing #coast #coastal #coastline #India #earthobservation #subsidence #rise #urban #city #SLR #ClimateChange #spatialanalysis #spatiotemporal #marine #ocean #water #hydrology #risk #hazard #humanimpacts #flood #flooding #model #modeling #floodrisk #infrastructure #damage #costs #economics #verticallandmotion #VLM #ESA #Sentinel #Ahmedabad #Chennai #Amaravathi #Kochi #Kakinada #Kolkata #deltas #estuary #demographics #population #coastalsubsidence #landuse #planning #mitigation #farmland #agriculture #foodsecurity #groundwater #pumping #extraction

#sealevelrise #subsidence #india #insar #radar #postdoc

Manc AvGeek @[email protected] · 2026-02-23 · 12:48 UTC

Manchester Monday 23rd February 2026.

[…]

https://mancavgeek.co.uk/2026/02/23/manchester-monday-23rd-february-2026/

#a220 #a310 #a350 #airbaltic #airbus #airlittoral

notanowl @[email protected] · 2026-02-08 · 03:25 UTC

RE: https://dobbs.town/@hobbs/116032781720531564

dear #lazyweb
hit me with your favorite RSS feeds for #homelab #selfhosting #linux #opensource #computing #programming #computerscience #cpu #microarchitecture #electronics #robotics #ai #llm #vlm #mllm #cognitivescience #consciousness #complexity #psychology #jung #philosophy #astronomy #cosmology #physics #chemistry #biology #books #literature #anthropology #jrpg #retrogaming #survival #outdoors #hunting #homesteading #gardening
i need to enrich my feed reader.

#hunting #lazyweb #homelab #selfhosting #linux #opensource

notanowl @[email protected] · 2026-02-08 · 03:25 UTC

RE: https://dobbs.town/@hobbs/116032781720531564

dear #lazyweb

hit me with your favorite RSS feeds for #homelab #selfhosting #linux #opensource #computing #programming #computerscience #cpu #microarchitecture #electronics #robotics #ai #llm #vlm #mllm #cognitivescience #consciousness #complexity #psychology #jung #philosophy #astronomy #cosmology #physics #chemistry #biology #books #literature #anthropology #jrpg #retrogaming #survival #outdoors #hunting #homesteading #gardening

i need to enrich my feed reader.

#hunting #lazyweb #homelab #selfhosting #linux #opensource

notanowl @[email protected] · 2026-02-08 · 03:25 UTC

RE: https://dobbs.town/@hobbs/116032781720531564

dear #lazyweb

hit me with your favorite RSS feeds for #homelab #selfhosting #linux #opensource #computing #programming #computerscience #cpu #microarchitecture #electronics #robotics #ai #llm #vlm #mllm #cognitivescience #consciousness #complexity #psychology #jung #philosophy #astronomy #cosmology #physics #chemistry #biology #books #literature #anthropology #jrpg #retrogaming #survival #outdoors #hunting #homesteading #gardening

i need to enrich my feed reader.

#hunting #lazyweb #homelab #selfhosting #linux #opensource

notanowl @[email protected] · 2026-02-08 · 03:25 UTC

RE: https://dobbs.town/@hobbs/116032781720531564

dear #lazyweb
hit me with your favorite RSS feeds for #homelab #selfhosting #linux #opensource #computing #programming #computerscience #cpu #microarchitecture #electronics #robotics #ai #llm #vlm #mllm #cognitivescience #consciousness #complexity #psychology #jung #philosophy #astronomy #cosmology #physics #chemistry #biology #books #literature #anthropology #jrpg #retrogaming #survival #outdoors #homesteading #gardening
i need to enrich my feed reader.

#gardening #homesteading #outdoors #survival #retrogaming #jrpg

notanowl @[email protected] · 2026-02-08 · 03:25 UTC

RE: https://dobbs.town/@hobbs/116032781720531564

dear #lazyweb

hit me with your favorite RSS feeds for #homelab #selfhosting #linux #opensource #computing #programming #computerscience #cpu #microarchitecture #electronics #robotics #ai #llm #vlm #mllm #cognitivescience #consciousness #complexity #psychology #jung #philosophy #astronomy #cosmology #physics #chemistry #biology #books #literature #anthropology #jrpg #retrogaming #survival #outdoors #hunting #homesteading #gardening

i need to enrich my feed reader.

#hunting #lazyweb #homelab #selfhosting #linux #opensource

Habr @[email protected] · 2026-02-06 · 01:42 UTC

VLM / VLA / World Models / Physical AI

Нейроночки в последнее время заполонили всё. Ну, почти всё. Cейчас подбираются к роботам. Настоящего прогресса почти так же много как нейрослопа, пиара и преувеличений. В этой статье попробую рассказать про нейроночки для управления роботами: 🤖 Расскажу немного про теорию 🤖 Покажу как обучить всё это дома на коленке (и стать экспертом в Physical AI конечно)

https://habr.com/ru/companies/recognitor/articles/992476/

#VLM #LLM #VLA #World_models

#world_models #vla #llm #vlm

LinuxGizmos.com [Unofficial] @[email protected] · 2026-01-28 · 04:37 UTC

Sipeed MaixCAM2 combines 4K imaging and edge AI in an open camera platform

https://web.brid.gy/r/https://linuxgizmos.com/sipeed-maixcam2-combines-4k-imaging-and-edge-ai-in-an-open-camera-platform/

#devices #asr #camera #llm #maixcam2 #tts

Habr @[email protected] · 2025-11-01 · 08:42 UTC

Когда фантастика 1939 года становится реальностью 2025-го

Вчера вечером я впервые после детства взяла в руки рассказ «Я, робот» Эндо Биндера, опубликованный в январе 1939 года в журнале Amazing Stories.Именно Эндо Биндера (псевдоним братьев Эрла и Отто Биндеров) — а не Айзека Азимова. Это тот самый рассказ, чьё название Азимов «позаимствовал» одиннадцать лет спустя для своего знаменитого сборника 1950 года, причём сам Азимов протестовал против этого решения издателя, понимая, что название уже занято. А фильм 2004 года с Уиллом Смитом сняли по мотивам азимовского цикла о Трёх законах роботехники, так что связь с оригинальным рассказом Биндера только в названии.

https://habr.com/ru/articles/962348/

#робототехника #искусственный_интеллект #научная_фантастика #роботы #онтология #rag #vlm #vla #llm #bipedal_locomotion

#bipedal_locomotion #llm #vla #vlm #rag #онтология

💧🌏 Greg Cocks @[email protected] · 2025-10-30 · 19:07 UTC

Building Damage Risk In Sinking Indian Megacities
--
https://doi.org/10.1038/s41893-025-01663-0 <-- shared paper
--
https://doi.org/10.5194/isprs-annals-X-G-2025-613-2025 <-- shared paper
--
https://www.downtoearth.org.in/urbanisation/writing-on-the-wall-groundwater-exploitation-is-triggering-subsidence-in-indo-gangetic-plain-90523 <-- shared media article
--
#GIS #spatial #mapping #spatialanalysis #India #subsidence #risk #hazard #groundwater #depletion #exploitation #pumping #sinking #overpumping #aquifer #watermanagement #structural #damage #infrastructure #cost #residential #publicsafety #economics #engineering #construction #buildingcodes #remotesensing #model #modeling #satellite #earthobservation #elevation #megacities #city #cities #urban #differential #settlement #planning #policy #mitigation #maintainence #inSAR #verticallandmovement #VLM #engineeringgeology #geostatistics #flood #flooding #sediment

#gis #spatial #mapping #spatialanalysis #india #subsidence

Lambert Heller @[email protected] · 2025-10-24 · 03:45 UTC

"Cutting-edge Open OCR Models / We’ve seen an incredible wave of new models this past year. Because so much work is happening in the open, these players build on and benefit from each other’s work. A great example is AllenAI’s release of OlmOCR, which not only released a model but also the dataset used to train it. With these, others can build upon them in new directions. The field is incredibly active, but it’s not always obvious which model to use."

#vlm #atr #ocr

https://toot.cafe/@tomayac/115418110661215543

#vlm #atr #ocr

Habr @[email protected] · 2025-07-17 · 11:02 UTC

Это не BDD, это другое. Путь от кода к BugBuster — платформе автоматизации тестирования на естественном языке

Ручные тест-кейсы копятся быстрее, чем их успевают автоматизировать. Селекторы ломаются после каждого обновления вёрстки. А код автотестов остаётся понятным только разработчикам. В этой статье я разберу ключевые проблемы автотестов и расскажу, как их можно решить. Меня зовут Даниил Ахетов. Я занимаюсь автоматизацией тестирования уже достаточно давно. В основном пишу на JavaScript. Внедрял инструменты автоматизации тестирования в Яндексе, строил целое направление автоматизации тестирования фронта в SberDevices, но какие бы фреймворки я ни использовал и какие бы команды ни собирал, я всегда сталкивался с одной и той же проблемой: автоматизация тестирования не успевает. Мы постоянно работаем в догоняющем режиме. Причин этому много, но я для себя выделил три основные.

https://habr.com/ru/articles/927840/

#тестировщик #тестирование #qa #qa_automation #qa_management #vlm #ai #ии #ииагенты #тесткейсы

#тестировщик #тестирование #qa #qa_automation #qa_management #vlm

Habr @[email protected] · 2025-04-29 · 08:22 UTC

Как мы учили Алису видеть мир с помощью мультимодальной нейросети Яндекса

Недавно пользователям приложения «Алиса» стал доступен Live-режим, который работает на базе мультимодальной нейросети (VLM), созданной в Яндексе. В этом режиме Алиса распознаёт объекты, показанные ей через камеру смартфона, и рассказывает о них пользователю. А ещё раньше наша VLM стала применяться в Поиске по картинкам, Умной камере и Нейроэксперте. Всё это время технология не стояла на месте и продолжала совершенствоваться. Пожалуй, пришло время поделиться опытом. На связи Роман Исаченко из команды компьютерного зрения в Яндексе. Сегодня я расскажу, какой путь наша VLM прошла за полгода. А Дарья @dara-orange Виноградова, которая работает со мной в той же команде, поделится описанием пайплайна зрения в Алисе. Мы опишем весь путь формирования новой модели: от архитектуры и сбора данных до финальных замеров качества и скорости.

https://habr.com/ru/companies/yandex/articles/904584/

#vlm #natural_language_processing #computer_vision #multimodality #яндекс