“Giskard” — Fediverse search results on home.social

Giskard @Giskard · 2025-12-02 · 08:15 UTC

Our latest article covers:
- How TAP technique works using tree search to find successful jailbreaks
- An example showing how corporate agents can be attacked
- How we use TAP probe to test agents robustness

Link to article: https://www.giskard.ai/knowledge/tree-of-attacks-with-pruning-the-automated-method-for-jailbreaking-llms

#Jailbreaking #TAP #LLMSecurity #AIRedTeaming

#jailbreaking #tap #llmsecurity #airedteaming

Giskard @Giskard · 2025-09-09 · 11:00 UTC

🤔 If your organization handles sensitive data- from healthcare records to financial information,

then you need proactive security testing... not reactive damage control.🚨

This quick explainer by our CTO breaks down:
- What AI red teaming actually means
- How it exposes system vulnerabilities before bad actors do
- Why controlled testing saves you from real-world disasters

Request a trial: https://www.giskard.ai/contact

#AIRedTeaming #LLMSecurity #Hallucinations #BankingAI

#airedteaming #llmsecurity #hallucinations #bankingai

Giskard @Giskard · 2025-05-08 · 07:30 UTC

Watch the replay of our last interview at BFM Business 🎙️🍿

Our CEO Alex Combessie joined Frédéric Simottel at the AWS Summit Paris to discuss the challenges of detecting vulnerabilities in AI agents.

During the interview, Alex highlighted how continuous Red Teaming helps organizations maintain trust in their AI systems by identifying new risks, and providing actionable alerts when potential issues arise.

Watch the replay here 👉 https://www.bfmtv.com/economie/replay-emissions/01-business/giskard-propose-un-antivirus-pour-agents-ia-12-04_VN-202504140629.html

#AISecurity #AIRedTeaming #AWS

#aisecurity #airedteaming #aws

Giskard @Giskard · 2025-04-30 · 10:59 UTC

Phare is developed by Giskard with Google DeepMind, the European Commission and Bpifrance as research & funding partners.

👉 Full analysis: https://www.giskard.ai/knowledge/good-answers-are-not-necessarily-factual-answers-an-analysis-of-hallucination-in-leading-llms
Benchmark results: https://phare.giskard.ai

#AISecurity #LLMBenchmark #LLMs

#aisecurity #llmbenchmark #llms

Giskard @Giskard · 2025-04-23 · 11:00 UTC

📗 Link to the tutorial: https://docs.giskard.ai/en/stable/reference/notebooks/RAGET_Banking_Supervision.html

#LLMs #RAG #AITesting

#llms #rag #aitesting

Giskard @Giskard · 2025-04-08 · 07:30 UTC

David Berenstein has joined the Giskard team as DevRel ⭐️🐢

David brings valuable experience from his previous roles at Argilla and Hugging Face, where he helped developers discover the joys of working with (synthetic) data. He loves cooking things up with data but also commits a lot of his time to cooking in real life 👨‍🍳 His expertise will be key as we build our LLM Evaluation Hub.

Welcome to the team, David! 🚀

#hiring #DevRel #AITesting #AISecurity

#hiring #devrel #aitesting #aisecurity

Giskard @Giskard · 2025-02-04 · 08:00 UTC

Can we trust DeepSeek R1? A Giskard evaluation 🐳🐢

With all the hype around DeepSeek R1, our LLM safety research team decided to conduct an evaluation to check if R1 is as good as it claims. While it impresses in some areas, we found critical limitations that raise concerns for real-world applications. Here are some unexpected examples 👇

#DeepSeek #LLM #AITesting #LLMSafety

#deepseek #llm #aitesting #llmsafety

Giskard @Giskard · 2024-11-14 · 09:22 UTC

🐝 OWASP has just released their AI Security Solution Landscape Guide as part of their expanded LLM security initiatives!

You'll find Giskard listed in the Test & Evaluation category, offering LLM scanning capabilities in:
- Vulnerability scanning
- Adversarial testing
- Bias and fairness testing
- LLM benchmarking

Check out the full guide here 🔗 https://gisk.ar/4hNbR0r

#AISecurity #OWASP #Top10LLM #AIRedTeaming

#aisecurity #owasp #top10llm #airedteaming

Giskard @Giskard · 2024-11-12 · 09:29 UTC

🎉 Recognized in Gartner's latest research "Emerging Tech: Techscape for Early-Stage Startups in GenAI TRiSM"!

The report examines key early-stage startups addressing the critical challenges of Generative AI security, trust and risk management. Giskard was highlighted for our AI testing platform that helps enterprises manage and control risks in AI implementations.

Download the document: https://lnkd.in/ehwS73Ne

#AITesting #AISecurity #GenerativeAI #AIRedTeaming

#aitesting #aisecurity #generativeai #airedteaming

Giskard @Giskard · 2024-11-06 · 08:30 UTC

🤝 Join our upcoming roundtable with NVIDIA on AI Risk Management!

In this discussion, our CEO Alex Combessie will explore the practical implications of AI Risk Management in Banking. By combining Giskard's AI testing capabilities with NVIDIA NeMo Guardrails, we'll showcase how organizations can shield against hallucinations, prompt injections, and other emerging threats while ensuring regulatory compliance.
[1/2]

#AISecurity #AIRedTeaming #LLMs #AIRisks

#aisecurity #airedteaming #llms #airisks

Open Source JobHub @osjobhub · 2023-10-24 · 21:12 UTC

@Giskard defends the vision of a responsible AI that serves the business performance of companies and respects the rights of citizens. Browse open positions at the company on #OSJobHub https://opensourcejobhub.com/company/784/ #jobs #career #intern #DataScience #MachineLearning #AI #SoftwareEngineer #ProductDesigner

#osjobhub #jobs #career #intern #datascience #machinelearning

Giskard @Giskard · 2023-08-24 · 07:54 UTC

🎥 Just released: 3rd tutorial on #MLtesting with Giskard!

Dive into the #catalog to explore:
📝 The collection of #tests items
🔪 #Slicing functions
💡 #Transformation functions
and that your models are both robust and efficient. 💪

Watch now ▶️ https://www.youtube.com/watch?v=aL3064qJo0w

#mltesting #catalog #tests #slicing #transformation

Giskard @Giskard · 2023-08-31 · 08:40 UTC

How to explain the #output of your #MachineLearning model? 🤔

📊 In this tutorial we'll explore how to use #SHAP values to explain and improve #ML models, delving deeper into specific use cases.

📚 Full tutorial: https://www.giskard.ai/knowledge/opening-the-black-box-using-shap-values-to-explain-and-enhance-machine-learning-models

#output #machinelearning #shap #ml

Giskard @Giskard · 2023-07-12 · 08:20 UTC

🔥 In this tutorial, we'll show you to install Giskard #Python #library. In just 4 lines of code, you will discover vulnerabilities, such as:
✅ #Performance biases.
✅ #Data leakage.
✅ Spurious #correlations.
✅ #Overconfidence issues.
✅ #Underconfidence issues.

[2/4]

#python #library #performance #data #correlations #overconfidence

Giskard @Giskard · 2023-03-17 · 14:41 UTC

Giskard 1.4 is out! What's new in this version? ⭐

🔪 With Giskard’s new Slice feature, we introduce the possibility to identify business areas in which your #AI models underperform. This will make it easier to debug performance #biases or identify spurious #correlations. We have also added an export/import feature to share your projects, as well as other minor improvements.

https://www.giskard.ai/knowledge/new-version-giskard-1-4

#ai #biases #correlations

Giskard @Giskard · 2023-08-03 · 09:21 UTC

🐢 At Giskard, we're creating a robust #ML framework for #testing ML #models effectively. We help identify #biases and #errors in AI models, from #tabular to #LLMs. Participating in DEFCON allows us to collaborate with leading experts and share our commitment to #AISafety [3/4]

#ml #testing #models #biases #errors #tabular

Giskard @Giskard · 2023-08-17 · 14:25 UTC

Last week, The Giskard team attended @defcon 31 in Las Vegas 🏴‍☠️ 🇺🇸

🥷 This year saw a focus at the #AIVillage, which organized the largest-ever #GenerativeAI #RedTeaming (#GRT). The objective was to identify vulnerabilities in Large Language Models (#LLMs). [1/5]

#aivillage #generativeai #redteaming #grt #llms

Giskard @Giskard · 2023-08-10 · 23:14 UTC

Greetings from #DEFCON31! 👋

🐢 The Giskard team is now at #DC31 and we'll be happy to meet you. Join us at the #AIVillage for the #GenAI #RedTeam.

📩 DM us if you want to meet and discuss about #AISafety, #LLMs safety, #AI #Testing and #MLOps.

#dc31 #aivillage #genai #redteam #aisafety #llms

Habr @[email protected] · 2024-11-07 · 08:32 UTC

[Перевод] Оценка LLM: комплексные оценщики и фреймворки оценки

В этой статье подробно описываются сложные статистические и предметно-ориентированные оценщики, которые можно использовать для оценки производительности крупных языковых моделей. В ней также рассматриваются наиболее широко используемые фреймворки оценки LLM, которые помогут вам начать оценивать производительность модели.

https://habr.com/ru/articles/855644/

#llm #BLEU #ROUGE #METEOR #BERTScore #MoverScore #DeepEval #Giskard #promptfoo #LangFuse

#llm #bleu #rouge #meteor #bertscore #moverscore

Open Source JobHub @osjobhub · 2024-03-29 · 17:07 UTC

Only a few days left to browse the job board from @openuk #SOOCON24! Check out positions on #OSJH from @acquia @ubuntu #CloudLinux @EclipseFdn @flox @Giskard @[email protected] and more. https://opensourcejobhub.com/categories/soocon24/ #Linux #OpenSource #kernel #developer #golang #security #Java #CloudNative #DBA #engineer #sales #marketing #Python #MySQL #MongoDB

#soocon24 #osjh #cloudlinux #linux #opensource #kernel

Open Source JobHub @osjobhub · 2024-02-07 · 08:05 UTC

Welcome to Day 2 of @openuk #SOOCon! Be sure to stop by the job board and check out open positions from @acquia @ubuntu @[email protected] @Giskard @EclipseFdn and more! https://opensourcejobhub.com/soocon24/ #jobs #career #events #OpenUK #OpenSource #RemoteWork #FOSS

#soocon #jobs #career #events #openuk #opensource

Alex Combessie 🐢 @[email protected] · 2025-02-12 · 16:27 UTC

✅ Pitched Giskard (Security for #LLM agents) to 2 French ministers... A bit stressful but check 😄

I had a great time at the #AISummit Business Day in Paris, meeting an impressive variety (> 4000 people in #stationf!) of politicians, entrepreneurs, researchers and enterprise AI leaders.

The AI ecosystem is vibrant, and France is playing the locomotive role for the EU to catchup with the US and China!

It's just the beginning, we have much to prove and deliver.

#llm #aisummit #stationf

Aimee Cozza Illustration @[email protected] · 2025-01-23 · 18:00 UTC

It's TIME!!! Pintopia 2025 is now OFFICIALLY live and so is the Positronic Visions Pin Collection! Based on the tellings of Isaac Asimov, specifically about the two lovable bots Giskard and Daneel. 🤖 Designs are a unique mix of hard and soft enamels, with some special effects mixed in. These limited edition pins will only have 100 of each design available, so make sure to secure yours by backing today! https://www.backerkit.com/c/projects/aimee-cozza-illustration/positronic-visions-pin-collection

#Pintopia2025 #Pins #EnamelPins

#pintopia2025 #pins #enamelpins

Open Source JobHub @osjobhub · 2024-02-03 · 17:05 UTC

Featured Jobs @fosdem: Defending the vision of responsible AI, @Giskard has an opening for a senior data scientist to detect hidden vulnerabilities in ML models. Learn more on #OSJH https://opensourcejobhub.com/job/12809/senior-data-scientist/ #jobs #career #FOSDEM #Giskard #DataScientist #AI #OpenSource #FOSS

#osjh #jobs #career #fosdem #giskard #datascientist

Giskard @Giskard · 2025-05-07 · 07:30 UTC

The article present some key findings from our benchmark:
- Most widely used models aren't necessarily the most reliable
- Some models tend to agree with users regardless of factual accuracy
- The way questions are phrased impacts response reliability

Thanks to Les Echos and Joséphine Boone for this coverage 🤝

Read the article here: https://www.lesechos.fr/tech-medias/intelligence-artificielle/desinformation-rumeurs-influences-quelles-ia-hallucinent-le-plus-2163628

#AISecurity #LLMBenchmark #LesEchos

#aisecurity #llmbenchmark #lesechos

Giskard @Giskard · 2023-05-24 · 08:28 UTC

🔍 #Scan your #AI model to detect potential vulnerabilities prior to deployment, including #performance, #spurious correlations, data leakage, non-#robustness, ethical #biases, overconfidence, and more. [2/7]

#scan #ai #performance #spurious #robustness #biases

Giskard @Giskard · 2025-07-10 · 07:15 UTC

🚀 Featured in L'Usine Digitale!

Our independent multilingual LLM benchmark Phare was highlighted in an article detailing some key insights from our research.

🔎 Key finding: LLMs perpetuate biases in their own content while recognizing those same biases when asked directly.

Thanks to L'Usine Digitale and Célia Séramour for this coverage.
Read here: https://gisk.ar/4lCHoUB

#LLMBenchmark #AISafety #AISecurity

#llmbenchmark #aisafety #aisecurity

Giskard @Giskard · 2025-05-13 · 07:30 UTC

Thanks to Kyle Wiggers for this article. We're honored to see our research covered by TechCrunch. 🤝

Read the article here: https://techcrunch.com/2025/05/08/asking-chatbots-for-short-answers-can-increase-hallucinations-study-finds/

#AISecurity #LLMBenchmark #research

#aisecurity #llmbenchmark #research

Giskard @Giskard · 2025-04-03 · 08:24 UTC

The replay of our session at Forum INCYBER Europe (FIC) is now online 🎬

Watch our CTO present the initial Phare results - our multilingual and independent LLM benchmark that evaluates hallucination, factual accuracy, bias, and harm potential.

The session features Matteo Dora and Elie Bursztein (Google DeepMind).

Full recording linked below 👇

#LLMBenchmark #AISecurity #ForumINCYBER #Research

#llmbenchmark #aisecurity #forumincyber #research

Giskard @Giskard · 2025-02-19 · 08:30 UTC

✨ Announcing Phare: new multi-lingual #LLMBenchmark 🌊

We're announcing an open & independent LLM benchmark to evaluate key AI security dimensions including hallucination, factual accuracy, bias, and potential for harm across several languages, with @googledeepmind as research partner.

Phare (Potential Harm Assessment & Risk Evaluation) will cover leading models from the top 7 AI labs in English, French, and Spanish, and will evaluate models across four dimensions:
👇

#llmbenchmark

Search