home.social

#risk-assessment — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #risk-assessment, aggregated by home.social.

fetched live
  1. #AI #RiskAssessment #WeaponsOfMathDestruction

    Jeg blev lige mindet om en rapport, jeg skrev på ITU, på kurset "Digitalisation and Public Sector Transformation": "Preparing for a potential future of using algorithms in case decision work at Ankestyrelsen". Jeg arbejdede der og fulgte en konkurrence, de holdt, hvor medarbejdere kunne foreslå ideer til sagsbehandling vha. machine learning.

    Hele rapporten kan læses her: hartmannconsulting.dk/wp-conte

  2. DATE: May 14, 2026 at 02:00PM
    SOURCE: PSYPOST.ORG

    ** Research quality varies widely from fantastic to small exploratory studies. Please check research methods when conclusions are very important to you. **
    -------------------------------------------------

    TITLE: Women score higher than men on fluid intelligence tests when allowed to express uncertainty

    URL: psypost.org/updating-the-multi

    Traditional tests of intelligence and literacy may be fundamentally flawed because they force test-takers to choose a single answer rather than allowing them to express their level of confidence in different options. When people are given financial incentives and allowed to distribute their answers based on how sure they are, women actually score higher than men. The research was published in the Journal of Political Economy.

    For decades, psychologists and economists have measured cognitive ability using multiple-choice tests. These assessments score responses as strictly right or wrong. Glenn W. Harrison of Georgia State University, Don Ross of University College Cork, and J. Todd Swarthout of Georgia State University suspected this format misses a vital component of human cognition. Knowing how strongly to believe in an answer is a skill in itself.

    The researchers note that the standard format forces people to mask their thought processes. If someone is somewhat confident in an answer but still perceives some risk of being wrong, the rigid format does not capture that nuance. The test format demands absolute certainty even when a person possesses healthy skepticism.

    To address this, the team examined the Raven Advanced Progressive Matrices test. This assessment presents a grid of shapes with one missing piece and asks the test-taker to identify the pattern. It is widely used to measure fluid intelligence, which is the ability to solve new logic problems without relying on prior knowledge.

    The researchers wrote that, “The measurement of intelligence should identify and measure an individual’s subjective confidence that a response to a test question is correct.” They noted that existing tests completely fail to achieve this goal.

    The standard version of this puzzle allows test-takers unlimited time and offers no financial motivation. The researchers created a computerized version that offered monetary rewards for correct answers. They divided participants into different groups to test how the structure of the task changed their performance.

    In the baseline group, participants took a traditional version for a flat fee of five dollars. In another group, participants were paid based on their accuracy but were still forced to pick just one answer. A third group experienced a radically different test structure.

    These participants were given eighty digital tokens to allocate across eight possible answers. If they were completely sure, they could place all eighty tokens on a single choice for a maximum reward of two dollars per puzzle. If they were unsure, they could spread their tokens out over multiple likely answers to guarantee a smaller payout.

    This token system measures what the researchers refer to as confidence. In this context, confidence does not mean optimism. It refers to the precision of a person’s belief. A person who places ten tokens on every single answer is safely guarding against risk because they have no idea which shape is correct.

    When financial incentives were combined with the ability to express varying degrees of confidence, the results shifted dramatically. In the traditional format, female participants scored lower than male participants. When participants could assign tokens based on their confidence, women outperformed men.

    The data showed that female participants were better at calculating the risk of their answers and distributing their tokens efficiently. Knowing when you are unsure is a core part of cognition. The researchers consider this risk assessment to be a fundamental element of fluid intelligence.

    The researchers also altered the order of the puzzles. The standard test starts with easy puzzles and gradually progresses to difficult ones. The researchers call this sequence a structured progression, meaning it is an environmental clue that helps a person think.

    When the researchers scrambled the order of the puzzles so that difficulty varied randomly, overall performance dropped. The gap in performance between the group forced to pick one answer and the group allowed to use tokens widened even further. This confirmed that the ability to express uncertainty is a distinct cognitive advantage when facing unpredictable problems.

    This discovery regarding gender prompted the researchers to revisit other areas where men possess a supposed advantage. They looked at studies regarding competitiveness. Past behavioral studies suggest that women back away from competitive environments, such as workplace tournaments, in favor of flat payment schedules.

    The researchers recreated these experiments using the token system and discovered that women were making the mathematically correct risk management choices. Participants had to solve logic problems under a time limit, choosing either a guaranteed payment per correct answer or a tournament style where only the top performer received a large payout.

    Men tended to choose the competitive tournament even when it resulted in a monetary loss for them. Men proved to be overly optimistic about their chances of winning. Women evaluated the risk accurately and chose the safer compensation structure, which resulted in better financial outcomes.

    The team also looked at financial literacy tests. Standard surveys report that women choose the “do not know” option much more often than men when asked financial questions. This has led to the assumption that women possess lower financial literacy.

    The researchers presented participants with a standard question about calculating purchasing power based on interest and inflation rates. When the researchers allowed subjects to use tokens to answer the question, they found that women were just more open about their lack of complete certainty. The bias in their actual knowledge was tiny and not statistically significant.

    Many women distributed their tokens broadly, meaning they were aware that they lacked the exact knowledge and guarded their bets accordingly. This behavior signals an intellectual awareness of uncertainty. Someone who knows they are guessing is more likely to seek out a financial advisor or a textbook to learn the correct answer.

    Individuals who place all their tokens on a highly incorrect answer represent a much larger danger. The researchers noted that these individuals are completely confident in their incorrect knowledge. These are the people most likely to make catastrophic financial decisions without consulting outside help.

    The authors specify that their findings on motivation might involve variables that are difficult to isolate. Participants might bring personal motivations into the laboratory that interact with the monetary incentives offered by the experimenters.

    Future studies could attempt to separate these personal drives from the financial rewards to see how they impact token distribution. The research team also plans to further investigate data suggesting that Black participants similarly perform drastically better when allowed to express their confidence through the token system.

    The study, “Gender, Confidence, and the Mismeasure of Intelligence, Competitiveness, and Literacy,” was authored by Glenn W. Harrison, Don Ross, and J. Todd Swarthout.

    URL: psypost.org/updating-the-multi

    -------------------------------------------------

    DAILY EMAIL DIGEST: Email [email protected] -- no subject or message needed.

    Private, vetted email list for mental health professionals: clinicians-exchange.org

    Unofficial Psychology Today Xitter to toot feed at Psych Today Unofficial Bot @PTUnofficialBot

    NYU Information for Practice puts out 400-500 good quality health-related research posts per week but its too much for many people, so that bot is limited to just subscribers. You can read it or subscribe at @PsychResearchBot

    Since 1991 The National Psychologist has focused on keeping practicing psychologists current with news, information and items of interest. Check them out for more free articles, resources, and subscription information: nationalpsychologist.com

    EMAIL DAILY DIGEST OF RSS FEEDS -- SUBSCRIBE: subscribe-article-digests.clin

    READ ONLINE: read-the-rss-mega-archive.clin

    It's primitive... but it works... mostly...

    -------------------------------------------------

    #psychology #counseling #socialwork #psychotherapy @psychotherapist @psychotherapists @psychology @socialpsych @socialwork @psychiatry #mentalhealth #psychiatry #healthcare #depression #psychotherapist #FluidIntelligence #ConfidenceInAssessment #TokenBasedTesting #GenderDifferences #RiskAssessment #UncertaintyExpression #CognitiveMeasurement #EconomicIncentives #FinancialLiteracy #RavenMatrices

  3. Government of Canada: Government of Canada creates platform for Canada-wide flood risk awareness: Canada’s Flood Risk Finder. “With a simple address search, users will be able to quickly find information about the flood risk in their area, rated on a four-point scale from low to extreme. This new tool complements other sources of data, like local, provincial, and territorial maps, and fills […]

    https://rbfirehose.com/2026/04/29/government-of-canada-creates-platform-for-canada-wide-flood-risk-awareness-canadas-flood-risk-finder-government-of-canada/
  4. A critique of the linear non-threshold model used by the U.S. EPA to assess cancer risk:

    acsh.org/news/2026/04/28/asses

    As noted in the article, this is a politically contentious topic. The author has had a long career in both the public and private sector, and does not appear to have a political agenda.

    #Toxicology #RiskAssessment #Cancer #Environment #EnvironmentalToxicology #USEPA

  5. PsyPost: Women perceive AI as riskier than men do, study finds. “An online survey found that women consistently perceive AI to be riskier than men. The key drivers behind this view are women’s higher general risk aversion and their greater exposure to AI-related risks.”

    https://rbfirehose.com/2026/04/28/psypost-women-perceive-ai-as-riskier-than-men-do-study-finds/
  6. YouTube’s Risk Assessments Are Not Publicly Testable

    By Cliff Potts, CSO, and Editor-in-Chief of WPS News

    Baybay City, Leyte, Philippines — April 26, 2026

    Reporting

    Under the Digital Services Act (DSA), very large online platforms are required to conduct regular risk assessments addressing systemic harms, including the amplification of illegal content, threats to civic discourse, and impacts on fundamental rights. YouTube has stated that it complies with these obligations through internal evaluations and mitigation plans submitted to EU authorities.

    What remains unavailable is the evidence needed to independently test those claims.

    Public disclosures summarize conclusions but not methods. They describe risks in general terms without detailing assumptions, metrics, or counterfactuals. External researchers, journalists, and civil-society groups are asked to trust that assessments are rigorous while being denied access to the data that would allow verification.

    In effect, YouTube reports that it has assessed risk—without showing how.

    Analysis

    A risk assessment that cannot be tested is a corporate assertion, not an accountability mechanism.

    Meaningful oversight requires more than assurances. It requires visibility into the indicators used, the thresholds applied, and the trade-offs accepted. Without this information, regulators cannot determine whether mitigation measures address root causes or merely manage appearances.

    This opacity reflects incentives shaped at the parent level. Google has long resisted external auditing of its core systems, citing security and proprietary concerns. While some confidentiality is legitimate, blanket opacity prevents independent scrutiny of claims that directly affect public life.

    The result is a one-sided process: platforms define risk, evaluate themselves, and report outcomes in summary form. EU oversight is left to review conclusions rather than interrogate evidence.

    What Remains Unclear

    YouTube does not disclose the specific metrics used to assess systemic risk within EU member states, nor how those metrics vary by language, topic, or election cycle. It also does not publish the results of stress tests showing how changes to recommendations or monetization would alter risk profiles.

    Without access to these details, neither regulators nor the public can judge whether risk mitigation is proportionate or effective.

    Why This Matters

    The DSA was designed to move beyond trust-based governance. Its purpose is to replace assurances with evidence. When platforms provide only summaries, that purpose is undermined.

    If risk assessments remain shielded from independent evaluation, enforcement becomes reactive rather than preventive. Harm is identified after it spreads, not before it is amplified.

    For EU regulators, the question is straightforward: can a system built on self-assessment deliver public accountability? Until YouTube’s risk evaluations are open to meaningful testing, that question remains unanswered.

    References (APA)

    European Commission. (2024). Digital Services Act: Systemic risk assessment and mitigation obligations.
    European Digital Rights (EDRi). (2023). Platform risk assessments and the limits of self-reporting.
    Pasquale, F. (2020). New laws of robotics: Defending human expertise in the age of AI. Harvard University Press.

    #algorithms #DigitalServicesAct #Google #platformAccountability #riskAssessment #YouTube
  7. 📊 Our comprehensive risk assessment of Broadcom ($AVGO) uncovers 8 critical exposures beyond standard disclosures - from customer concentration (~50% revenue from top 5) to NVIDIA's ($NVDA) platform encroachment in AI networking. post.kapualabs.com/yckjunx9 #Semiconductors #AI #Investing #RiskAssessment

  8. From a game theory perspective: The Strait of Hormuz presents a classic Schelling game where maritime transit risk converts tactical incidents into systemic consequences. Analysis outlines four strategic equilibria with probabilities and critical tripwires. post.kapualabs.com/bc63pp23 #Geopolitics #RiskAssessment #GameTheory #StraitOfHormuz

  9. CW: good thread on LLM risk assessment / measurement

    Excellent thoughtful thread here from @glyph and others, about the nature of LLM coding etc, and how we can evaluate its effects without being misled by subjective impressions:

    mastodon.social/@glyph/1162202

    #LLMs #coding #RiskAssessment #measurement

  10. Risk modeling identifies your actual assets, realistic threats, and vulnerabilities. Then threat modeling maps HOW those threats exploit vulnerabilities. Skip risk modeling and you only model attack paths you randomly thought of, not ones that threaten your business.

    No defense standard lets you skip risk assessment. GrapheneOS does threat modeling for their chosen threats but without your risk model, critical HOWs never got modeled.

    Threat modeling without risk modeling is blind darts. You only secure the attack paths you randomly thought of.

    2/2

    #InfoSec #CyberSecurity #ThreatModeling #RiskManagement #Security #RiskAssessment #AppSec

  11. "Idk, I think it's less about empathy or compassion, and more about making sure the (hopefully small) group of uncaring people who enter medicine or allied health professions are incentivised to act pro-socially.
    Like, in your region, what studying conditions and working conditions are clinicians expected to tolerate? Do they receive a living wage throughout study and work? Is their livelihood for a single person or big enough they can build a household?

    Do they feel comfortable being a whistleblower when something goes wrong? (Numbers-wise, mistakes will happen over time.)
    How robust is the inspection or regulation? How flexible are the logistics of each sector?
    How many crises at once can be weathered without change to patient care? How long will it take to build back up that 'extra' capacity / contingency margin?

    Who (in what position, in which teams) is responsible (or in what order) for noticing angels of death? What about anyone with less final but still harmful motives?

    How severe are the consequences for failure? How many people feel compelled to hide errors rather than remedy them? What options for remediation are accessible to all staff, including contractors?"

    #MedMastodon #DoLessHarm #SystemsThinking #HarmReduction #Medical #Medicine #MD #Systems #NonExpert #LayPerson #RubberNecking #Risk #RiskAssessment #RiskManagement #Risk_Management #Audit #Efficiency #Efficacy #Flexibility #TrueCrime #Crime

  12. The suspected rail sabotage in northern Italy highlights a recurring challenge: protecting physical infrastructure during high-profile global events.

    With fires, damaged signaling components, and hours-long delays reported, the incident underscores how transport systems remain exposed to disruption even without advanced technical methods.

    Source: therecord.media/italy-suspecte

    💬 How should critical infrastructure protection evolve for large-scale international events?

    🔔 Follow TechNadu for ongoing analysis of infrastructure and security risks

    #CriticalInfrastructure #InfrastructureSecurity #PhysicalSecurity #RiskAssessment #PublicTransport #TechNadu

  13. I have some concerns about financial risk factors we’re facing in 2026 - so I wrote a thing expressing my take on global threats and risks facing investors, financial advisors and policy makers. Just my personal research and opinions mind you. Those interested in #investing and #Finance may find some value; or at least grist for a conversation. I do include quotes from recognized authorities and some relevant charts and graphs. I’d love to discuss, have a legit conversation about these issues - so feel free to have a think and respond, argue, clarify…whatever. I surely didn’t cover all the global risks, focusing on those I identified in my 2023 report (linked in the update) on the subject. #RiskAssessment #GlobalEconomy #StockMarket docs.google.com/document/d/1fx