Milan @milan

0 posts0 participants0 posts today

**Willow** @salixlucida@mastodon.sdf.org · Mar 29

Willow @salixlucida@mastodon.sdf.org

#artificialintelligence #vlm #surveillance #surveillancecapitalism #privacy

https://www.aclu-or.org/en/news/machine-surveillance-being-super-charged-large-ai-models

ACLU of Oregon · Mar 21Machine Surveillance is Being Super-Charged by Large AI ModelsJay Stanley, Senior Policy Analyst, ACLU Speech, Privacy, and Technology Project Imagine an America where multiple police officers and security guards stand watch on every block, in every park, in every store, and in every other public space around the clock.

**Greg Cocks** @GregCocks@techhub.social · Feb 25

Feb 25

Greg Cocks @GregCocks@techhub.social

Satellite Data Study Pinpoints Areas Sinking And Rising Along California Coast
--
https://phys.org/news/2025-02-satellite-areas-california-coast.html <-- shared technical article
--
https://dx.doi.org/10.1126/sciadv.ads8163 <-- shared paper
--
#GIS #spatial #mapping #sealevel #sealevelrise #subsidence #model #modeling #SLR #coast #coastline #verticallandmotion #VLM #California #climatechange #planning #policy #remotesensing #groundwater #pumping #risk #hazard #infrastructure #damage #wastewater #injection #tidegauge #dynamic #spatialanalysis #spatiotemporal #numericmodeling #uplift #projections #flood #flooding #mitigation #satellite #ocean #marine
@nasa

photo - Pfeiffer Beach at Big Sur, California, USA

maps - Vertical Land Motion (VLM) with uncertainties, California

maps & charts - Variable VLM along the California coasts, with hot spots

maps & charts - Local versus regional VLM projections in 2050, California

Replied in thread

**Alex** @stsquad@mastodon.org.uk · Feb 18 *

Feb 18 *

Alex @stsquad@mastodon.org.uk

@gilesgoat @llamasoft_ox is the #VLM a library you use between multiple games? How much relation does it have to the original Trip-a-Tron I played with on my #atarist?

**Estelle FLAUJAT** @LaBelleEtoile@pixelfed.social · Feb 10

Feb 10

Estelle FLAUJAT @LaBelleEtoile@pixelfed.social

Pensée du 41ème jour, 10 février

Une goutte d'eau

Ce monde,
À quoi le comparer ?
À la goutte qui tombe
Du bec de l'oiseau d'eau
Et réfléchit le clair de lune. ~ Dōgen Zenji

Fugacité du monde. Fugacité de la vie de cet être conscient qui perçoit le monde. L'existence en ce monde comme la goutte d'eau qui vient tomber du bec d'un héron et qui s'en va rejoindre l'étang, le conglomérat de toutes les gouttes d'eau. Et dans cette chute qui ne dure que quelques instants, le reflet de la lune habite la transparence de la goutte d'eau.

Notre existence peut sembler infinitésimale tant dans l'espace et dans le temps. Pour autant, elle peut refléter la lumière de l'Éveil.

Ne plus être cette seule goutte prise dans les turbulences de la chute, mais être l'étang et l'oiseau d'eau qui contemple le grand calme de l'étang dans l'aube brumeuse.
.
.
.
[D'après Bai Wenshu]
.
#pensée #penséedujour #penséepositive #bienveillance #inspiration #optimisme #citation #philosophie #calme #contemplation #immobilite #zazen #meditation #Montpellier #VLM #photo #art #philosophy

#philosophy

**John Leonard** @johnleonard@mastodon.social · Jan 24

Jan 24

John Leonard @johnleonard@mastodon.social

Hugging Face has introduced two new models in its SmolVLM series, which it claims are the smallest Vision Language Models (VLMs) to date.

https://www.computing.co.uk/news/2025/ai/hugging-face-claims-world-s-smallest-vision-language-models

www.computing.co.ukHugging Face claims world’s smallest vision language modelsHugging Face has introduced two new models in its SmolVLM series, which it claims are the smallest Vision Language Models (VLMs) to date.

#huggingface #ai #llm

**Greg Cocks** @GregCocks@techhub.social · Dec 3, 2024 *

Dec 3, 2024 *

Greg Cocks @GregCocks@techhub.social

Projections Of Multiple Climate-Related Coastal Hazards For The US Southeast Atlantic
--
https://doi.org/10.1038/s41558-024-02180-2 <-- shared paper
--
#GIS #spatial #mapping #coast #coastal #water #hydrology #risk #hazard #VLM #VerticalLandMotion #SeaLevel #SealLevelRise #SLR #climatechange #extremeweather #US #USA #SoutheastAtlantic #Atlantic #storm #stormsurge #erosion #groundwater #flood #flooding #subsidence #society #societal #cost #infrastructure #economics #saltwater #property #value #beaches #saltwaterintrusion

maps and charts - projected shoreline change

maps - Coastal hazard exposure across the study area

**michabbb** @michabbb@vivaldi.net · Nov 21, 2024

Nov 21, 2024

michabbb @michabbb@vivaldi.net

Breakthrough in Visual Language Models and Reasoning

#LLaVAo1 pioneers systematic visual reasoning capabilities:
• First #VLM to implement spontaneous step-by-step analysis like #GPT4
• New 11B model surpasses #Gemini15pro & #Llama32 performance
• Excels on 6 multimodal benchmark tests
• Breaks down complex problems into structured analysis stages

Key Features:
• Problem outline creation
• Image information interpretation
• Sequential reasoning process
• Evidence-based conclusions
• Handles science & reasoning challenges

Technical Specs:
• Based on #opensource architecture
• Pretrained weights available on #HuggingFace
• 11B parameter model size
• Supports multiple reasoning domains

Paper available: https://arxiv.org/abs/2411.10440
Project repository: https://github.com/PKU-YuanGroup/LLaVA-o1

**Dr James Ravenscroft** @jamesravey@fosstodon.org · Nov 3, 2024

Nov 3, 2024

Dr James Ravenscroft @jamesravey@fosstodon.org

Earlier this year I wrote about my handwriting #OCR workflow. I wanted to reduce the friction in this flow so I spent some time building a telegram bot that uses #VLM models to OCR my hand writing. Introducing AnnoMemo which is open source and easyish to self-host. Currently it uses remote models but I'm planning to integrate Qwen2-VL 2B which a) understands my handwriting perfectly and b) runs on my desktop GPU. Considering providing a managed service too https://brainsteam.co.uk/2024/11/3/03-annomemo-telegram-bot/

brainsteam.co.uk · Nov 3, 2024Simplified Handwriting OCR with AnnoMemoshort summary

**mancavgeek** @mancavgeek@mancavgeek.co.uk · Nov 1, 2024

Nov 1, 2024

mancavgeek @mancavgeek@mancavgeek.co.uk

Photo of the Day 1st November 2024.

PH-BDT, Boeing 737-406, KLM, being pushed back from Gate 24 at Manchester Airport, some time in the 1990s.

On This Day 1st November 1993.

F-GIJT, Airbus A300B4-103, Air Inter, under tow at Paris Orly, 1st November 1993.

On This Day 1st November 1994.

OO-VLN, Fokker F50, VLM, taxiing out to Runway 27 at London City Airport, 1st November 1994.

https://mancavgeek.co.uk/2024/11/01/photo-of-the-day-1st-november-2024/

Side view of a blue, twin engined jet airliner with a white belly and tail, facing to the right but being pushed backwards to the left by an off-screen truck attached to the nosewheel by a long, thin blue pole. There are white "KLM" titles under a white crown on the upper forward fuselage, a design replicated in blue in a larger form on the white tail. A black chain-link fence stretches across the foreground, while a black and sandy brown terminal building fills the background, under bright but hazy skies.

Side view of a white, twin engined jet airliner being towed from left to right by a nearly out-of-frame blue tug attached to the nose-wheel by a long orange pole. The plane has a blue rear fuselage and tail, with a red triangle at the top of the tail, and blue and red "Air Inter" on the upper forward fuselage. The background has tall lighting towers scattered around the concrete apron, slowly vanishing into mist in the distance.

Side view of a white, high-winged, twin propellor-engined airliner taxiing from left to right. The plane has a yellow stripe outlined in black running along the body, large black "VLM" titles on the upper forward fuselage, the black registration "OO-VLN" on the upper rear fuselage, and a black lion standing on it's rear legs on the tail. The airfield is elevated several feet over a body of water, seen here at the bottom of the frame. Trees and buildings are visible in the background under hazy blue skies with lumps of fluffy cloud.

#a300 #airbus #AirInter

**Habr** @habr@zhub.link · Oct 31, 2024

Oct 31, 2024

Habr @habr@zhub.link

VLM — арт эксперты

Всем привет, меня зовут Арсений, я DS в компании Raft, и сегодня я расскажу вам про VLM. Большие языковые модели уже стали частью нашей жизни и мы применяем, чтобы упростить современную рутину, а так же используем их для решения бизнес задач. Недавно вышло новое поколение vision transformer моделей, которые заметно упростили анализ изображений, из какой бы сферы эти изображения не были. Особенно заметным был сентябрьский релиз Llama-3.2-11b, и не только потому что это первая vision модель от Llama, сколько потому, что с ней вместе вышло целое семейство моделей, включая маленькие на 1B и 3B параметров. А как вы знаете, меньше, значит юзабельнее.

https://habr.com/ru/articles/854864/

ХабрVLM — арт экспертыВсем привет, меня зовут Арсений, я DS в компании Raft, и сегодня я расскажу вам про VLM. Большие языковые модели уже стали частью нашей жизни и мы применяем, чтобы упростить современную рутину, а так...

#transformers #VLM #Vision_Transformer

**Winbuzzer** @winbuzzer@mastodon.social · Oct 28, 2024

Oct 28, 2024

Winbuzzer @winbuzzer@mastodon.social

A new study published on arXiv reveals fundamental issues in the visual reasoning abilities of leading AI vision-language models (VLMs) from OpenAI, Google, and Meta. #AI http://dlvr.it/TFpXr8 #AI #ArtificialIntelligence #VLM

**Habr** @habr@zhub.link · Oct 3, 2024

Oct 3, 2024

Habr @habr@zhub.link

VLM в Нейро: как мы создавали мультимодальную нейросеть для поиска по картинкам

Сегодня у Поиска большое обновление. Например, ответы Нейро теперь будут появляться сразу в поисковых результатах — для тех запросов, где это полезно и экономит время. Но в рамках этой статьи нас интересует другая часть обновления: Нейро поможет найти ответы в Поиске по картинкам и в Умной камере — с помощью новой мультимодальной модели Яндекса. Пользователь может не только узнать, что изображено на картинке, но и задать вопрос по каждой её детали. Например, гуляя по музею, можно сфотографировать натюрморт голландского живописца и спросить, что символизирует тот или иной предмет на картине. Меня зовут Роман Исаченко, я работаю в команде компьютерного зрения Яндекса. В этой статье я расскажу, что такое визуально‑текстовые мультимодальные модели (Visual Language Models или VLM), как у нас в Яндексе организован процесс их обучения и какая у них архитектура. Вы узнаете, как Нейро работал с картинками и текстами раньше, и что изменилось с появлением VLM.

https://habr.com/ru/companies/yandex/articles/847706/

ХабрVLM в Нейро: как мы создавали мультимодальную нейросеть для поиска по картинкамСегодня у Поиска большое обновление. Например, ответы Нейро теперь будут появляться сразу в поисковых результатах — для тех запросов, где это полезно и экономит время....

#яндекс #llm #vlm

**Chi Kim** @chikim@mastodon.social · Sep 11, 2024

Sep 11, 2024

Chi Kim @chikim@mastodon.social

Mistralai releases pixtral-12b on Twitter with magnet link! Someone put it on Huggingface for easier download. lol #multimodal #VLM #LLM #ML #AI
https://x.com/mistralai/status/1833758285167722836
https://huggingface.co/bullerwins/pixtral-12b-240910

X (formerly Twitter)Mistral AI (@MistralAI) on Xmagnet:?xt=urn:btih:7278e625de2b1da598b23954c13933047126238a&dn=pixtral-12b-240910&tr=udp%3A%2F%https://t.co/OdtBUsbMKD%3A1337%2Fannounce&tr=udp%3A%2F%https://t.co/2UepcMHjvL%3A1337%2Fannounce&tr=http%3A%2F%https://t.co/NsTRgy7h8S%3A80%2Fannounce

**Chi Kim** @chikim@mastodon.social · Sep 10, 2024

Sep 10, 2024

Chi Kim @chikim@mastodon.social

Finally, #Ollama added MiniCPM-V 2.6! #multimodal #VLM #LLM #ML #AI https://ollama.com/library/minicpm-v

ollama.comminicpm-vA series of multimodal LLMs (MLLMs) designed for vision-language understanding.

**Chi Kim** @chikim@mastodon.social · Sep 9, 2024

Sep 9, 2024

Chi Kim @chikim@mastodon.social

Does anyone have a recommendation for #LlamaCPP alternative to run recent vision language models on Apple Silicon? Llama.cpp doesn't support any of the recent #VLM such as Qwen2-VL, Phi-3.5-vision, Idefics3, InternVL2, Yi-VL, Chameleon, CogVLM2, GLM-4v, etc.
Minicpm-v 2.6 is the only recent model that was added. Maybe time to move on. :( #LLM #multimodal #AppleSilicon #MacOS #ML #AI

**Hacker News** @ycombinator@rss-mstdn.studiofreesia.com · Jul 12, 2024

Jul 12, 2024

Hacker News @ycombinator@rss-mstdn.studiofreesia.com

Cradle: Empowering Foundation Agents Towards General Computer Control
https://baai-agents.github.io/Cradle/
#ycombinator #Agent #VLM #GCC #Cradle #Agent_Framework #GPT_4V

baai-agents.github.ioCradle: Empowering Foundation Agents Towards General Computer ControlCradle: Empowering Foundation Agents Towards General Computer Control

**Chi Kim** @chikim@mastodon.social · Jun 6, 2024

Jun 6, 2024

Chi Kim @chikim@mastodon.social

Does anyone have a suggestion on how to run newer/more capable vision language models on Mac like XComposer2, CogVLM2-Chat, InternVL-Chat, Qwen-VL, DeepSeek-VLm, Phi3-V, etc? Vision language support for newer models are stalled in llama.cpp, so it seems impossible to find an option on Mac. Even with Torch MPS support because flash attention is not available for Apple Silicon! #LLM #VLM #ML #AI

**anna_lillith** @anna_lillith@mas.to · Jun 1, 2024 *

Jun 1, 2024 *

anna_lillith @anna_lillith@mas.to

Vegan Land Movement

Removing #land from #AnimalAgriculture and giving it back to #EndangeredSpecies and the earth.

The #VLM has been created for ALL OF US who want to help create a vegan world by removing land from those who harm life and to return it to #wildlife that desperately needs #habitat to survive.

The land used to farm sentient animals causes great suffering, as well as ecological destruction and species #extinction.

https://veganlandmovement.com/

Vegan Land MovementVegan Land MovementRemoving land from animal agriculture and giving it back to endangered species and the earth

#EndAnimalAg #VeganLandMovement

**Tech Chilli** @techchiili@mastodon.social · May 31, 2024

May 31, 2024

Tech Chilli @techchiili@mastodon.social

Meta Introduces Vision Language Models: Superior Performance and Advanced Features.

See here - https://techchilli.com/artificial-intelligence/meta-introduces-vision-language-models-superior-performance-and-advanced-features/

#Meta #VisionLanguageModels #AI

**Oleg Sinavski** @sinavski@sigmoid.social · Apr 19, 2024

Apr 19, 2024

Oleg Sinavski @sinavski@sigmoid.social

To my knowledge, this is the first example in the world of controlling the car with a Visual Language Model end-2-end! The model takes in video stream from multiple cameras and can generate language tokens and controls. Done by our team at Wayve:)

https://wayve.ai/thinking/lingo-2-driving-with-language/

#llm #vlm #autonomousdriving

Recent searches

Search options

Administered by:

Server stats:

#vlm