home.social

#imagegeneration — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #imagegeneration, aggregated by home.social.

  1. Etiketten mit ChatGPT designen blog.derbrumme.de/etiketten-mi

    Vatertagsgeschenk in 10 Minuten...mit bissel Glück. Außerdem: Eins nice Lifehack zum Aufkleben von Etiketten mit Milch. Dankt mir später. ;-)

    #diy #basteln #vatertag #nachhaltigkeit #lifehack #ki #ai #aigenerated #chatgpt #imagegeneration

  2. Etiketten mit ChatGPT designen blog.derbrumme.de/etiketten-mi

    Vatertagsgeschenk in 10 Minuten...mit bissel Glück. Außerdem: Eins nice Lifehack zum Aufkleben von Etiketten mit Milch. Dankt mir später. ;-)

    #diy #basteln #vatertag #nachhaltigkeit #lifehack #ki #ai #aigenerated #chatgpt #imagegeneration

  3. Etiketten mit ChatGPT designen blog.derbrumme.de/etiketten-mi

    Vatertagsgeschenk in 10 Minuten...mit bissel Glück. Außerdem: Eins nice Lifehack zum Aufkleben von Etiketten mit Milch. Dankt mir später. ;-)

    #diy #basteln #vatertag #nachhaltigkeit #lifehack #ki #ai #aigenerated #chatgpt #imagegeneration

  4. Etiketten mit ChatGPT designen blog.derbrumme.de/etiketten-mi

    Vatertagsgeschenk in 10 Minuten...mit bissel Glück. Außerdem: Eins nice Lifehack zum Aufkleben von Etiketten mit Milch. Dankt mir später. ;-)

    #diy #basteln #vatertag #nachhaltigkeit #lifehack #ki #ai #aigenerated #chatgpt #imagegeneration

  5. Etiketten mit ChatGPT designen blog.derbrumme.de/etiketten-mi

    Vatertagsgeschenk in 10 Minuten...mit bissel Glück. Außerdem: Eins nice Lifehack zum Aufkleben von Etiketten mit Milch. Dankt mir später. ;-)

    #diy #basteln #vatertag #nachhaltigkeit #lifehack #ki #ai #aigenerated #chatgpt #imagegeneration

  6. Learn everything you need to know about Image Generation via these 52 free HackerNoon blog posts. hackernoon.com/52-blog-posts-t #imagegeneration

  7. Learn everything you need to know about Image Generation via these 52 free HackerNoon blog posts. hackernoon.com/52-blog-posts-t #imagegeneration

  8. Learn everything you need to know about Image Generation via these 52 free HackerNoon blog posts. hackernoon.com/52-blog-posts-t #imagegeneration

  9. Learn everything you need to know about Image Generation via these 52 free HackerNoon blog posts. hackernoon.com/52-blog-posts-t

  10. Learn everything you need to know about Image Generation via these 52 free HackerNoon blog posts. hackernoon.com/52-blog-posts-t #imagegeneration

  11. One Open-source Project Daily

    Turn your two-bit doodles into fine artworks with deep neural networks, generate seamless textures from photos, transfer style from one image to another, perform example-based upscaling, but wait... there's more! (An implementation of Semantic Style Transfer.)

    https://github.com/alexjc/neural-doodle

    #1ospd #opensource #deeplearning #deepneuralnetworks #imagegeneration #imagemanipulation #imageprocessing

  12. One Open-source Project Daily

    Turn your two-bit doodles into fine artworks with deep neural networks, generate seamless textures from photos, transfer style from one image to another, perform example-based upscaling, but wait... there's more! (An implementation of Semantic Style Transfer.)

    https://github.com/alexjc/neural-doodle

    #1ospd #opensource #deeplearning #deepneuralnetworks #imagegeneration #imagemanipulation #imageprocessing

  13. One Open-source Project Daily

    Turn your two-bit doodles into fine artworks with deep neural networks, generate seamless textures from photos, transfer style from one image to another, perform example-based upscaling, but wait... there's more! (An implementation of Semantic Style Transfer.)

    https://github.com/alexjc/neural-doodle

    #1ospd #opensource #deeplearning #deepneuralnetworks #imagegeneration #imagemanipulation #imageprocessing

  14. One Open-source Project Daily

    Turn your two-bit doodles into fine artworks with deep neural networks, generate seamless textures from photos, transfer style from one image to another, perform example-based upscaling, but wait... there's more! (An implementation of Semantic Style Transfer.)

    https://github.com/alexjc/neural-doodle

    #1ospd #opensource #deeplearning #deepneuralnetworks #imagegeneration #imagemanipulation #imageprocessing

  15. One Open-source Project Daily

    Turn your two-bit doodles into fine artworks with deep neural networks, generate seamless textures from photos, transfer style from one image to another, perform example-based upscaling, but wait... there's more! (An implementation of Semantic Style Transfer.)

    https://github.com/alexjc/neural-doodle

    #1ospd #opensource #deeplearning #deepneuralnetworks #imagegeneration #imagemanipulation #imageprocessing

  16. I got into ComfyUI a little this week. I hate uploading anything to an image generation service, or looking for something free.

    This is from my AnimateDiff workflow. It's fun to build that stuff out. This isn't where I was trying to go at all with the image, but it got a cool aesthetic. It reminds me of the Ryan Celsius stuff on YT.
    #comfyui #animatediff #ai #imagegeneration #img2img #workflow
  17. 🚀 ruby-libgd hits 3,000 downloads!

    From version 0.1.0 in January to 0.3.0 in March, ruby-libgd has grown into a lightweight, efficient Ruby library for dynamic image generation, filters, and GIF animations.

    Years of iteration, community feedback, and careful optimization have brought us here — and the journey is just beginning. 🌟

    Discover the library’s capabilities, the milestone story, and what’s next: rubystacknews.com/2026/03/09/r

    #Ruby #OpenSource #ImageGeneration #GIF #rubyLibGD

  18. Nano Banana 2 (Gemini 3.1 Flash Image) blends Pro grade image intelligence with Flash speed generation: advanced world knowledge, accurate on image text and translation, consistent characters and up to 4K outputs for real production use. It's already surfacing across Gemini, Search, AI Studio, Vertex AI, Flow and Google Ads, so if you work with visuals or campaigns, this is one to watch.

    #NanoBanana2 #Gemini #AI #FOSS #ImageGeneration #GoogleDeepMind #Tech

  19. The Limitations of AI – Dealing With Personality

    Reading Time: 3 minutes

    For the last two weeks I have been playing with AI heavily, to get it to help with the task of re-organising my libraries. I played with Gemini, Le Chat and MyAI. I focused on Gemini because it gave me good results, whereas Le Chat gave good answers but I hit the token limit too easily, and MyAI is better, but the answers made me waste time, rather than move forward.

    When using Gemini I find that the lines of code it gives me are good. I always run them in dry run first to ensure that behaviour is as expected. It often grates on me that the answers are "since you live in ... and you do a lot of A and B..." before giving information. It also grates me that it keeps saying "And then let's do this" rather than let me finish the task I am currently focused on.

    Persistent but Forgetful

    It is also erratic in what it remembers and what it doesn't remember. If you tell it something, it will remember it, and repeat it for hundreds of responses, but for other things it forgets instantly.

    If I speak about using an HP machine with Photoprism it will keep thinking that I'm using the Pi. It gets fixated. If you tell it "I'm doing this with Photoprism on the HP machine, it doesn't remember.

    If I was paying for a limited amount of tokens then this behaviour would make it very expensive, without providing me with the quality of service I would expect for 7CHF per month.

    Just now I provided it with a screenshot from Photoprism with text that illustrated the problem of duplicate filenames but rather than provide a usable answer it kept hallucinating modified screenshots. After four hallucinations in a row I started a new chat, and tried to discuss the topic for a fifth time and it hallucinated again so I told it off.

    Character

    When I use Gemini it reminds me of a former alcoholic bi-polar friend. It loves to pigeon hole you, and remind you of something that is not related to the topic you're getting help with. My cycling and hiking habits are not relevant to dealing with my photo library.

    When I tell it "I'm using machine A for task B I expect it to remember within the same chat. It doesn't. It's object fixated on the fact that I use a Pi.

    Context Switching

    If I designed an AI tool I would teach it to switch between context A and Context B, rather than getting fixated. Context A = Using the Pi, Context B = using the HP machine. It doesn't take on board that I switch from context to context so it gives answers that are filled with wasteful information that is wrong, and irrelevant.

    Verbosity

    Of course, we can tell AI to be concise, but I'd like it to be context smart. Is the question a simple line of code or a yes or no answer, or did I say I wanted to understand how something works. If AI could automatically detect how concise or verbose to be, that would be fantastic.

    Skittish

    I found, multiple times that Gemini is skittish. You're going through a task that it knows will take hours but rather than asking "how is the progress going?" it encourages you to skip to the next step. That can be welcome but if you're sorting tens of thousands of photos, it takes hours, so it would be better to focus on the current task before moving on.

    If I post about the progress, I don't need a long response. "ok" would be enough. In effect I could simply keep quiet until the task is done and tell it of the result.

    • "Since you're on a Pi, would you like me to show you how to check if the CPU is being "throttled" due to heat while it's crunching these hashes?"

    The type of assumption I dislike.

    And Finally - Dealing With AI Personalities

    One of the things that is rarely discussed is that dealing with AI is dealing with the personality that was programmed into it. The more you interact with the character of that personality, the more it can become toxic. It's good to learn to use some AI models sparingly, to avoid their character flaws becoming toxic. This morning, after my run I found Gemini toxic.

    #gemini #hallucinations #imageGeneration #unreliable
  20. Ứng dụng web giúp tạo, lưu và xóa ảnh bằng Ollama ngay trên máy cá nhân! Chỉ cần chạy server.py (Python 3.9+) và truy cập index.html qua localhost:8080. Ảnh được lưu trong localStorage, không làm bừa bộn thư mục. Hỗ trợ các model như x/z-image-turbo:bf16 và x/flux2-klein:latest. Dễ sử dụng với giao diện trực quan, xem tiến trình từng bước, chỉnh kích thước, seed và thao tác ảnh linh hoạt.

    #Ollama #AI #ImageGeneration #LocalAI #OllamaImageGenerator #TríTuệNhânTạo #TạoẢnhAI #CôngCụAI

    https://w

  21. Tuần này Hugging Face bùng nổ với các mô hình mới nổi bật: GLM-4.7-Flash (31B) cho sinh văn bản nhanh, GLM-Image cho tạo ảnh từ văn bản, Pocket-TTS cho giọng nói tự nhiên, và LTX-2 tạo video chất lượng cao từ ảnh. Microsoft cũng ra mắt VibeVoice-ASR nhận diện giọng nói đa ngôn ngữ. Các mô hình lượng tử hóa như GGUF phù hợp cho thiết bị yếu. Cộng đồng đang phát triển cực nhanh! #HuggingFace #AI #TextGeneration #ImageGeneration #TTS #ASR #MôHìnhAI #TríTuệNhânTạo

    reddit.com/r/LocalLLaM

  22. 🤖 Oh joy, another #GitHub project boasting about its "pure C inference" while burying you in buzzwords 🥱. Apparently "Flux 2" is here to revolutionize image generation, but you might need a PhD in #nonsense jargon to navigate this labyrinth of self-congratulatory tech speak 🚀.
    github.com/antirez/flux2.c #Flux2 #imagegeneration #techbuzzwords #HackerNews #ngated

  23. 🚀 BREAKING NEWS: 🎨 "FLUX.2 klein" is here to revolutionize visual intelligence with SPEED and GLAM! Apparently, it can generate and edit images faster than you can say "pretentious tech jargon" - all while running on a potato. 🥔✨ But hey, who needs substance when you have buzzwords? 🤖💡
    bfl.ai/blog/flux2-klein-toward #BREAKINGNEWS #FLUX2 #klein #VisualIntelligence #TechBuzz #ImageGeneration #HackerNews #ngated

  24. 🚀 GLM-Image vừa ra mắt! Mô hình sinh ảnh kết hợp kiến trúc tự hồi quy + khuếch tán, đạt chất lượng ngang các diffusion latent mainstream, nổi bật trong việc dựng văn bản và các tác vụ yêu cầu hiểu ngữ nghĩa sâu. Hỗ trợ text‑to‑image và đa dạng image‑to‑image: chỉnh sửa, chuyển phong cách, bảo toàn danh tính, đồng nhất đa đối tượng. #AI #MachineLearning #ImageGeneration #GLMImage #AI_VN #Sinh_ảnh #Công_nghệ

    reddit.com/r/LocalLLaMA/commen