home.social

#classifier — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #classifier, aggregated by home.social.

  1. @phpmacher #RSS nutzen: Sicher. Immer noch populär. Ob als Eingangsformat für andere Medien oder direkte Nutzung durch Menschen.
    Gibt diverse Systeme die lokal aggregieren und filtern, auch self-hostable. Teilweise über die eingebetteten #Metadaten, teilweise mit #Regex gegen andere Inhaltsfelder.
    Der Schritt von da zu einer Art #Bayesfilter oder den NN wie sie #rspamd zur Klassifizierung nutzt oder sogar einem SLM/kleineren #LLM auf #Ollama, ggf. mit #TPU / #APU Support sollte überschaubar groß sein.
    Eins der Probleme wird aber die zunehmend schlechte Feedqualität was Tags/Metadaten und der notorische (wenn gleich sehr verständliche) Hang zur Nicht-Auslieferung des Volltext im Feed sein. Da müsste man ggf auf Verdacht das Original fetchen, ggf. als zweite Stufe.

    Und natürlich ist ein #Recommender-System was deutlich anderes als ein mehr oder minder ausgefeilter #Classifier. Aber lassen wir das...

  2. 'Consistent Multiclass Algorithms for Complex Metrics and Constraints', by Harikrishna Narasimhan et al.

    jmlr.org/papers/v25/22-1137.ht

    #multiclass #classifier #classification

  3. 'Regimes of No Gain in Multi-class Active Learning', by Gan Yuan, Yunfan Zhao, Samory Kpotufe.

    jmlr.org/papers/v25/23-0234.ht

    #classifier #classification #classes

  4. How would you go about creating a filter that blocks posts about things that people hate?

    I've thought I could build a text classifier, but it could be hard to train since I'd need to guess whether or not the author hates the thing they are posting about.

    I wouldn't want it to become a filter for all current events news, but I suspect that's what it would become.

    #fediverse #mastodon #machineLearning #tfidf #classification #socialMedia #classifier #textAnalysis #programming #tech #technology

  5. 'Multi-class Probabilistic Bounds for Majority Vote Classifiers with Partially Labeled Data', by Vasilii Feofanov, Emilie Devijver, Massih-Reza Amini.

    jmlr.org/papers/v25/23-0121.ht

    #classifiers #classifier #labeling

  6. 'On the Learnability of Out-of-distribution Detection', by Zhen Fang, Yixuan Li, Feng Liu, Bo Han, Jie Lu.

    jmlr.org/papers/v25/23-1257.ht

    #learnability #classifier #detection

  7. 'A Multilabel Classification Framework for Approximate Nearest Neighbor Search', by Ville Hyvönen, Elias Jääsaari, Teemu Roos.

    jmlr.org/papers/v25/23-0286.ht

    #classification #classifiers #classifier

  8. Except for naming the shapes I figure something like following might work?

    - Render each Unicode script in several fonts.

    - Training per-script classifiers (OCRs) on those.

    - Run the per-script classifiers on each others training data to identify homographs.

    There's a tricky bits, like creating non-confusable out-of-script scribbles to include in training data, etc.

    Still... this doesn't seem like all that impossible?

    What am I missing?

    #unicode #homography #ml #classifier

  9. Делаем intent classifier для службы поддержки без доменного датасета на русском

    В этой статье я продемонстрирую, как без собственного датасета сделать классификатор намерений пользователя для службы поддержки в сфере e-commerce. И более того, я расскажу, как у меня получилось сделать классификатор для русского языка без датасета на русском языке. Меня зовут Елизавета Колмакова, я Data Scientist в компании, которая разрабатывает айти-решения для крупного ритейла.

    habr.com/ru/articles/792542/

    #intent_recognition #intent #encoder #служба_поддержки #no_dataset #classifier #telegrambot #nlp #bot

  10. Делаем intent classifier для службы поддержки без доменного датасета на русском

    В этой статье я продемонстрирую, как без собственного датасета сделать классификатор намерений пользователя для службы поддержки в сфере e-commerce. И более того, я расскажу, как у меня получилось сделать классификатор для русского языка без датасета на русском языке. Меня зовут Елизавета Колмакова, я Data Scientist в компании, которая разрабатывает айти-решения для крупного ритейла.

    habr.com/ru/articles/792542/

    #intent_recognition #intent #encoder #служба_поддержки #no_dataset #classifier #telegrambot #nlp #bot

  11. 'Set-valued Classification with Out-of-distribution Detection for Many Classes', by Zhou Wang, Xingye Qiao.

    jmlr.org/papers/v24/23-0712.ht

    #classifier #classification #classes

  12. I already have a Mastodon client (schizo.social) so I'm already on my way!

    I want to pull the user's favorites and run them through a #machineLearning #classifier of some sort.

    Then when I pull their Timeline I can compare each post to their favorites and determine a similarity score.

    I envision a sort of slider that lets them set a threshold of "favoriteness" to filter by; posts that are unlikely to be favorites will be hidden.

  13. 'Erratum: Risk Bounds for the Majority Vote: From a PAC-Bayesian Analysis to a Learning Algorithm', by Louis-Philippe Vignault, Audrey Durand, Pascal Germain.

    jmlr.org/papers/v24/22-0747.ht

    #bound #classifier #majority

  14. I think what I'll do is tally up the words in every toot and then use tf-idf to calculate a match score for each toot. Then I can specify a threshold that will show/hide toots in my timeline based on these.

    This will require manual labeling to train the classifier, but I think it should work.

    en.wikipedia.org/wiki/Tf%E2%80

    #ml #classification #classifier #machinelearning

  15. 'Leaky Hockey Stick Loss: The First Negatively Divergent Margin-based Loss Function for Classification', by Oh-Ran Kwon, Hui Zou.

    jmlr.org/papers/v24/22-1104.ht

    #classifier #classification #svm

  16. 'Random Forests for Change Point Detection', by Malte Londschien, Peter Bühlmann, Solt Kovács.

    jmlr.org/papers/v24/22-0512.ht

    #changeforest #classifier #classifiers

  17. Mitigating Confirmation Bias in Semi-supervised Learning via Efficient Bayesian Model Averaging

    Charlotte Loh, Rumen Dangovski, Shivchander Sudalairaj et al.

    Action editor: Frederic Sala.

    openreview.net/forum?id=PRrKOa

    #supervised #labeling #classifier

  18. I have plenty more achievable goals for schizo.social (like multi-account, or #Calckey support) but something I'd love to try is #classifying posts with #machineLearning #tfidf

    I'd like to be able to define "labels" and then train it to identify those on the fly. Then either mute or highlight posts that #classify highly.

    Not so much an #algorithm, as a #filter.

    #ai #ml #webDev #classifier

  19. I have plenty more achievable goals for schizo.social (like multi-account, or #Calckey support) but something I'd love to try is #classifying posts with #machineLearning #tfidf

    I'd like to be able to define "labels" and then train it to identify those on the fly. Then either mute or highlight posts that #classify highly.

    Not so much an #algorithm, as a #filter.

    #ai #ml #webDev #classifier

  20. I have plenty more achievable goals for schizo.social (like multi-account, or #Calckey support) but something I'd love to try is #classifying posts with #machineLearning #tfidf

    I'd like to be able to define "labels" and then train it to identify those on the fly. Then either mute or highlight posts that #classify highly.

    Not so much an #algorithm, as a #filter.

    #ai #ml #webDev #classifier

  21. I have plenty more achievable goals for schizo.social (like multi-account, or support) but something I'd love to try is posts with

    I'd like to be able to define "labels" and then train it to identify those on the fly. Then either mute or highlight posts that highly.

    Not so much an , as a .

  22. I have plenty more achievable goals for schizo.social (like multi-account, or #Calckey support) but something I'd love to try is #classifying posts with #machineLearning #tfidf

    I'd like to be able to define "labels" and then train it to identify those on the fly. Then either mute or highlight posts that #classify highly.

    Not so much an #algorithm, as a #filter.

    #ai #ml #webDev #classifier

  23. Invariant Feature Coding using Tensor Product Representation

    YUSUKE Mukuta, Tatsuya Harada

    Action editor: Seungjin Choi.

    openreview.net/forum?id=uv32JO

    #tensor #representations #classifier

  24. Predicting Out-of-Domain Generalization with Neighborhood Invariance

    Nathan Hoyen Ng, Neha Hulkund, Kyunghyun Cho, Marzyeh Ghassemi

    Action editor: Vincent Dumoulin.

    openreview.net/forum?id=jYkWdJ

    #generalization #classifier #classification

  25. Finding Competence Regions in Domain Generalization

    Jens Müller, Stefan T. Radev, Robert Schmier, Felix Draxler, Carsten Rother, Ullrich Koethe

    Action editor: Hanwang Zhang.

    openreview.net/forum?id=TSy0vu

    #classifiers #accuracy #classifier

  26. Assuming Locally Equal Calibration Errors for Non-Parametric Multiclass Calibration

    Kaspar Valk, Meelis Kull

    Action editor: Aditya Menon.

    openreview.net/forum?id=na5sHG

    #classifiers #classifier #calibration

  27. 'Risk Bounds for Positive-Unlabeled Learning Under the Selected At Random Assumption', by Olivier Coudray, Christine Keribin, Pascal Massart, Patrick Pamphile.

    jmlr.org/papers/v24/22-067.htm

    #labeled #classification #classifier

  28. Bridging performance gap between minimal and maximal SVM models

    Ondrej Such, René Fabricius

    openreview.net/forum?id=SM1Bkj

    #svm #svms #classifier

  29. Bridging performance gap between minimal and maximal SVM models

    Ondrej Such, René Fabricius

    openreview.net/forum?id=SM1Bkj

    #svm #svms #classifier

  30. Bridging performance gap between minimal and maximal SVM models

    Ondrej Such, René Fabricius

    openreview.net/forum?id=SM1Bkj

    #svm #svms #classifier

  31. Bridging performance gap between minimal and maximal SVM models

    Ondrej Such, René Fabricius

    openreview.net/forum?id=SM1Bkj

    #svm #svms #classifier

  32. Bridging performance gap between minimal and maximal SVM models

    Ondrej Such, René Fabricius

    openreview.net/forum?id=SM1Bkj

    #svm #svms #classifier

  33. Estimating the Density Ratio between Distributions with High Discrepancy using Multinomial Logist...

    Akash Srivastava, Seungwook Han, Kai Xu, Benjamin Rhodes, Michael U. Gutmann

    openreview.net/forum?id=jM8nzU

    #classification #classify #classifier

  34. Estimating the Density Ratio between Distributions with High Discrepancy using Multinomial Logist...

    Akash Srivastava, Seungwook Han, Kai Xu, Benjamin Rhodes, Michael U. Gutmann

    openreview.net/forum?id=jM8nzU

    #classification #classify #classifier

  35. Estimating the Density Ratio between Distributions with High Discrepancy using Multinomial Logist...

    Akash Srivastava, Seungwook Han, Kai Xu, Benjamin Rhodes, Michael U. Gutmann

    openreview.net/forum?id=jM8nzU

    #classification #classify #classifier

  36. Estimating the Density Ratio between Distributions with High Discrepancy using Multinomial Logist...

    Akash Srivastava, Seungwook Han, Kai Xu, Benjamin Rhodes, Michael U. Gutmann

    openreview.net/forum?id=jM8nzU

    #classification #classify #classifier

  37. Estimating the Density Ratio between Distributions with High Discrepancy using Multinomial Logist...

    Akash Srivastava, Seungwook Han, Kai Xu, Benjamin Rhodes, Michael U. Gutmann

    openreview.net/forum?id=jM8nzU

    #classification #classify #classifier

  38. KRADA: Known-region-aware Domain Alignment for Open-set Domain Adaptation in Semantic Segmentation

    Chenhong Zhou, Feng Liu, Chen Gong, Rongfei Zeng, Tongliang Liu, William Cheung, Bo Han

    openreview.net/forum?id=5II12y

    #classify #labeled #classifier

  39. KRADA: Known-region-aware Domain Alignment for Open-set Domain Adaptation in Semantic Segmentation

    Chenhong Zhou, Feng Liu, Chen Gong, Rongfei Zeng, Tongliang Liu, William Cheung, Bo Han

    openreview.net/forum?id=5II12y

    #classify #labeled #classifier

  40. KRADA: Known-region-aware Domain Alignment for Open-set Domain Adaptation in Semantic Segmentation

    Chenhong Zhou, Feng Liu, Chen Gong, Rongfei Zeng, Tongliang Liu, William Cheung, Bo Han

    openreview.net/forum?id=5II12y

    #classify #labeled #classifier

  41. KRADA: Known-region-aware Domain Alignment for Open-set Domain Adaptation in Semantic Segmentation

    Chenhong Zhou, Feng Liu, Chen Gong, Rongfei Zeng, Tongliang Liu, William Cheung, Bo Han

    openreview.net/forum?id=5II12y

    #classify #labeled #classifier