Search
147 results for “Giskard”
-
Our latest article covers:
- How TAP technique works using tree search to find successful jailbreaks
- An example showing how corporate agents can be attacked
- How we use TAP probe to test agents robustnessLink to article: https://www.giskard.ai/knowledge/tree-of-attacks-with-pruning-the-automated-method-for-jailbreaking-llms
-
🤔 If your organization handles sensitive data- from healthcare records to financial information,
then you need proactive security testing... not reactive damage control.🚨
This quick explainer by our CTO breaks down:
- What AI red teaming actually means
- How it exposes system vulnerabilities before bad actors do
- Why controlled testing saves you from real-world disastersRequest a trial: https://www.giskard.ai/contact
-
Watch the replay of our last interview at BFM Business 🎙️🍿
Our CEO Alex Combessie joined Frédéric Simottel at the AWS Summit Paris to discuss the challenges of detecting vulnerabilities in AI agents.
During the interview, Alex highlighted how continuous Red Teaming helps organizations maintain trust in their AI systems by identifying new risks, and providing actionable alerts when potential issues arise.
Watch the replay here 👉 https://www.bfmtv.com/economie/replay-emissions/01-business/giskard-propose-un-antivirus-pour-agents-ia-12-04_VN-202504140629.html
-
Phare is developed by Giskard with Google DeepMind, the European Commission and Bpifrance as research & funding partners.
👉 Full analysis: https://www.giskard.ai/knowledge/good-answers-are-not-necessarily-factual-answers-an-analysis-of-hallucination-in-leading-llms
Benchmark results: https://phare.giskard.ai -
📗 Link to the tutorial: https://docs.giskard.ai/en/stable/reference/notebooks/RAGET_Banking_Supervision.html
-
David Berenstein has joined the Giskard team as DevRel ⭐️🐢
David brings valuable experience from his previous roles at Argilla and Hugging Face, where he helped developers discover the joys of working with (synthetic) data. He loves cooking things up with data but also commits a lot of his time to cooking in real life 👨🍳 His expertise will be key as we build our LLM Evaluation Hub.
Welcome to the team, David! 🚀
-
Can we trust DeepSeek R1? A Giskard evaluation 🐳🐢
With all the hype around DeepSeek R1, our LLM safety research team decided to conduct an evaluation to check if R1 is as good as it claims. While it impresses in some areas, we found critical limitations that raise concerns for real-world applications. Here are some unexpected examples 👇
-
🐝 OWASP has just released their AI Security Solution Landscape Guide as part of their expanded LLM security initiatives!
You'll find Giskard listed in the Test & Evaluation category, offering LLM scanning capabilities in:
- Vulnerability scanning
- Adversarial testing
- Bias and fairness testing
- LLM benchmarkingCheck out the full guide here 🔗 https://gisk.ar/4hNbR0r
-
🎉 Recognized in Gartner's latest research "Emerging Tech: Techscape for Early-Stage Startups in GenAI TRiSM"!
The report examines key early-stage startups addressing the critical challenges of Generative AI security, trust and risk management. Giskard was highlighted for our AI testing platform that helps enterprises manage and control risks in AI implementations.
Download the document: https://lnkd.in/ehwS73Ne
-
🤝 Join our upcoming roundtable with NVIDIA on AI Risk Management!
In this discussion, our CEO Alex Combessie will explore the practical implications of AI Risk Management in Banking. By combining Giskard's AI testing capabilities with NVIDIA NeMo Guardrails, we'll showcase how organizations can shield against hallucinations, prompt injections, and other emerging threats while ensuring regulatory compliance.
[1/2] -
@Giskard defends the vision of a responsible AI that serves the business performance of companies and respects the rights of citizens. Browse open positions at the company on #OSJobHub https://opensourcejobhub.com/company/784/ #jobs #career #intern #DataScience #MachineLearning #AI #SoftwareEngineer #ProductDesigner
-
🎥 Just released: 3rd tutorial on #MLtesting with Giskard!
Dive into the #catalog to explore:
📝 The collection of #tests items
🔪 #Slicing functions
💡 #Transformation functions
and that your models are both robust and efficient. 💪Watch now ▶️ https://www.youtube.com/watch?v=aL3064qJo0w
-
How to explain the #output of your #MachineLearning model? 🤔
📊 In this tutorial we'll explore how to use #SHAP values to explain and improve #ML models, delving deeper into specific use cases.
📚 Full tutorial: https://www.giskard.ai/knowledge/opening-the-black-box-using-shap-values-to-explain-and-enhance-machine-learning-models
-
🔥 In this tutorial, we'll show you to install Giskard #Python #library. In just 4 lines of code, you will discover vulnerabilities, such as:
✅ #Performance biases.
✅ #Data leakage.
✅ Spurious #correlations.
✅ #Overconfidence issues.
✅ #Underconfidence issues.[2/4]
-
Giskard 1.4 is out! What's new in this version? ⭐
🔪 With Giskard’s new Slice feature, we introduce the possibility to identify business areas in which your #AI models underperform. This will make it easier to debug performance #biases or identify spurious #correlations. We have also added an export/import feature to share your projects, as well as other minor improvements.
-
Last week, The Giskard team attended @defcon 31 in Las Vegas 🏴☠️ 🇺🇸
🥷 This year saw a focus at the #AIVillage, which organized the largest-ever #GenerativeAI #RedTeaming (#GRT). The objective was to identify vulnerabilities in Large Language Models (#LLMs). [1/5]
-
[Перевод] Оценка LLM: комплексные оценщики и фреймворки оценки
В этой статье подробно описываются сложные статистические и предметно-ориентированные оценщики, которые можно использовать для оценки производительности крупных языковых моделей. В ней также рассматриваются наиболее широко используемые фреймворки оценки LLM, которые помогут вам начать оценивать производительность модели.
https://habr.com/ru/articles/855644/
#llm #BLEU #ROUGE #METEOR #BERTScore #MoverScore #DeepEval #Giskard #promptfoo #LangFuse
-
Only a few days left to browse the job board from @openuk #SOOCON24! Check out positions on #OSJH from @acquia @ubuntu #CloudLinux @EclipseFdn @flox @Giskard @[email protected] and more. https://opensourcejobhub.com/categories/soocon24/ #Linux #OpenSource #kernel #developer #golang #security #Java #CloudNative #DBA #engineer #sales #marketing #Python #MySQL #MongoDB
-
Welcome to Day 2 of @openuk #SOOCon! Be sure to stop by the job board and check out open positions from @acquia @ubuntu @[email protected] @Giskard @EclipseFdn and more! https://opensourcejobhub.com/soocon24/ #jobs #career #events #OpenUK #OpenSource #RemoteWork #FOSS
-
✅ Pitched Giskard (Security for #LLM agents) to 2 French ministers... A bit stressful but check 😄
I had a great time at the #AISummit Business Day in Paris, meeting an impressive variety (> 4000 people in #stationf!) of politicians, entrepreneurs, researchers and enterprise AI leaders.
The AI ecosystem is vibrant, and France is playing the locomotive role for the EU to catchup with the US and China!
It's just the beginning, we have much to prove and deliver.
-
It's TIME!!! Pintopia 2025 is now OFFICIALLY live and so is the Positronic Visions Pin Collection! Based on the tellings of Isaac Asimov, specifically about the two lovable bots Giskard and Daneel. 🤖 Designs are a unique mix of hard and soft enamels, with some special effects mixed in. These limited edition pins will only have 100 of each design available, so make sure to secure yours by backing today! https://www.backerkit.com/c/projects/aimee-cozza-illustration/positronic-visions-pin-collection
-
Featured Jobs @fosdem: Defending the vision of responsible AI, @Giskard has an opening for a senior data scientist to detect hidden vulnerabilities in ML models. Learn more on #OSJH https://opensourcejobhub.com/job/12809/senior-data-scientist/ #jobs #career #FOSDEM #Giskard #DataScientist #AI #OpenSource #FOSS
-
The article present some key findings from our benchmark:
- Most widely used models aren't necessarily the most reliable
- Some models tend to agree with users regardless of factual accuracy
- The way questions are phrased impacts response reliabilityThanks to Les Echos and Joséphine Boone for this coverage 🤝
Read the article here: https://www.lesechos.fr/tech-medias/intelligence-artificielle/desinformation-rumeurs-influences-quelles-ia-hallucinent-le-plus-2163628
-
🔍 #Scan your #AI model to detect potential vulnerabilities prior to deployment, including #performance, #spurious correlations, data leakage, non-#robustness, ethical #biases, overconfidence, and more. [2/7]
-
🚀 Featured in L'Usine Digitale!
Our independent multilingual LLM benchmark Phare was highlighted in an article detailing some key insights from our research.
🔎 Key finding: LLMs perpetuate biases in their own content while recognizing those same biases when asked directly.
Thanks to L'Usine Digitale and Célia Séramour for this coverage.
Read here: https://gisk.ar/4lCHoUB -
Thanks to Kyle Wiggers for this article. We're honored to see our research covered by TechCrunch. 🤝
Read the article here: https://techcrunch.com/2025/05/08/asking-chatbots-for-short-answers-can-increase-hallucinations-study-finds/
-
The replay of our session at Forum INCYBER Europe (FIC) is now online 🎬
Watch our CTO present the initial Phare results - our multilingual and independent LLM benchmark that evaluates hallucination, factual accuracy, bias, and harm potential.
The session features Matteo Dora and Elie Bursztein (Google DeepMind).
Full recording linked below 👇
-
✨ Announcing Phare: new multi-lingual #LLMBenchmark 🌊
We're announcing an open & independent LLM benchmark to evaluate key AI security dimensions including hallucination, factual accuracy, bias, and potential for harm across several languages, with @googledeepmind as research partner.
Phare (Potential Harm Assessment & Risk Evaluation) will cover leading models from the top 7 AI labs in English, French, and Spanish, and will evaluate models across four dimensions:
👇