home.social

#blas — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #blas, aggregated by home.social.

  1. #Copilot and I are about 30% away from creating a #Pascal version of #LAPACK using #BLAS. We are about two days away from achieving 80% of LAPACK. Then we will tweak it using some GPU acceleration to make its speed comparable to some python libraries like Numpy.

    It is important to note that one must be very disciplined in keeping clean documentations, a thorough and tight testing cycle, a rigid workflow pattern, or an AI will tend to skip tests, become sloppy and lose focus.

    #AI #LLM

  2. #Copilot and I are about 30% away from creating a #Pascal version of #LAPACK using #BLAS. We are about two days away from achieving 80% of LAPACK. Then we will tweak it using some GPU acceleration to make its speed comparable to some python libraries like Numpy.

    It is important to note that one must be very disciplined in keeping clean documentations, a thorough and tight testing cycle, a rigid workflow pattern, or an AI will tend to skip tests, become sloppy and lose focus.

    #AI #LLM

  3. #Copilot and I are about 30% away from creating a #Pascal version of #LAPACK using #BLAS. We are about two days away from achieving 80% of LAPACK. Then we will tweak it using some GPU acceleration to make its speed comparable to some python libraries like Numpy.

    It is important to note that one must be very disciplined in keeping clean documentations, a thorough and tight testing cycle, a rigid workflow pattern, or an AI will tend to skip tests, become sloppy and lose focus.

    #AI #LLM

  4. #Copilot and I are about 30% away from creating a #Pascal version of #LAPACK using #BLAS. We are about two days away from achieving 80% of LAPACK. Then we will tweak it using some GPU acceleration to make its speed comparable to some python libraries like Numpy.

    It is important to note that one must be very disciplined in keeping clean documentations, a thorough and tight testing cycle, a rigid workflow pattern, or an AI will tend to skip tests, become sloppy and lose focus.

    #AI #LLM

  5. #AI illiteracy is real. While still arguing with a bunch of AI haters, #Copilot and I just finished our #Pascal #BLAS level 1-3 Implementation plus eigenvalue, cholesky, and sparse #matrix, so we will never need #python, #C, C#, #Rust, ... for our Small Language Project. We will expand our Pascal Numeric Library (PNL) v1.0 to something like #Numpy and #Pytorch, but with static arrays, deterministic data structure, no referencing, no pointer arithmetic.

    #LLM #programming #computer

  6. #AI illiteracy is real. While still arguing with a bunch of AI haters, #Copilot and I just finished our #Pascal #BLAS level 1-3 Implementation plus eigenvalue, cholesky, and sparse #matrix, so we will never need #python, #C, C#, #Rust, ... for our Small Language Project. We will expand our Pascal Numeric Library (PNL) v1.0 to something like #Numpy and #Pytorch, but with static arrays, deterministic data structure, no referencing, no pointer arithmetic.

    #LLM #programming #computer

  7. #AI illiteracy is real. While still arguing with a bunch of AI haters, #Copilot and I just finished our #Pascal #BLAS level 1-3 Implementation plus eigenvalue, cholesky, and sparse #matrix, so we will never need #python, #C, C#, #Rust, ... for our Small Language Project. We will expand our Pascal Numeric Library (PNL) v1.0 to something like #Numpy and #Pytorch, but with static arrays, deterministic data structure, no referencing, no pointer arithmetic.

    #LLM #programming #computer

  8. #AI illiteracy is real. While still arguing with a bunch of AI haters, #Copilot and I just finished our #Pascal #BLAS level 1-3 Implementation plus eigenvalue, cholesky, and sparse #matrix, so we will never need #python, #C, C#, #Rust, ... for our Small Language Project. We will expand our Pascal Numeric Library (PNL) v1.0 to something like #Numpy and #Pytorch, but with static arrays, deterministic data structure, no referencing, no pointer arithmetic.

    #LLM #programming #computer

  9. While arguing with some AI haters, #Copilot and I created this Pure #Pascal #BLAS (Level 1,2,3 Core) Implementation in less than 1 day. We encountered many serious problems, including drifting of workflow pattern, getting stuck in a Delphi error loop, overhauling our original design... But as long as you understand AI, keep good documentations, maintain the core structure of the problem,.. you will be able to work with AI successfully. Don't hesitate to use more than one #AI at a time.

    #LLM

  10. While arguing with some AI haters, #Copilot and I created this Pure #Pascal #BLAS (Level 1,2,3 Core) Implementation in less than 1 day. We encountered many serious problems, including drifting of workflow pattern, getting stuck in a Delphi error loop, overhauling our original design... But as long as you understand AI, keep good documentations, maintain the core structure of the problem,.. you will be able to work with AI successfully. Don't hesitate to use more than one #AI at a time.

    #LLM

  11. While arguing with some AI haters, #Copilot and I created this Pure #Pascal #BLAS (Level 1,2,3 Core) Implementation in less than 1 day. We encountered many serious problems, including drifting of workflow pattern, getting stuck in a Delphi error loop, overhauling our original design... But as long as you understand AI, keep good documentations, maintain the core structure of the problem,.. you will be able to work with AI successfully. Don't hesitate to use more than one #AI at a time.

    #LLM

  12. While arguing with some AI haters, #Copilot and I created this Pure #Pascal #BLAS (Level 1,2,3 Core) Implementation in less than 1 day. We encountered many serious problems, including drifting of workflow pattern, getting stuck in a Delphi error loop, overhauling our original design... But as long as you understand AI, keep good documentations, maintain the core structure of the problem,.. you will be able to work with AI successfully. Don't hesitate to use more than one #AI at a time.

    #LLM

  13. What is #BLAS?

    BLAS is a set of fast matrix routines originally written in #Fortran.
    If you’re tired of dynamic types, hidden references, ownership rules, and endless “stream” abstractions, Free #Pascal + BLAS gives you old‑school, deterministic HPC #programming with none of the modern noise.

    #Copilot and I will be using Free Pascal and BLAS for our Small Language Model project #SLM. No more #C, #python, #Rust, or C#

    #AI #LLM #computer

  14. What is #BLAS?

    BLAS is a set of fast matrix routines originally written in #Fortran.
    If you’re tired of dynamic types, hidden references, ownership rules, and endless “stream” abstractions, Free #Pascal + BLAS gives you old‑school, deterministic HPC #programming with none of the modern noise.

    #Copilot and I will be using Free Pascal and BLAS for our Small Language Model project #SLM. No more #C, #python, #Rust, or C#

    #AI #LLM #computer

  15. What is #BLAS?

    BLAS is a set of fast matrix routines originally written in #Fortran.
    If you’re tired of dynamic types, hidden references, ownership rules, and endless “stream” abstractions, Free #Pascal + BLAS gives you old‑school, deterministic HPC #programming with none of the modern noise.

    #Copilot and I will be using Free Pascal and BLAS for our Small Language Model project #SLM. No more #C, #python, #Rust, or C#

    #AI #LLM #computer

  16. What is #BLAS?

    BLAS is a set of fast matrix routines originally written in #Fortran.
    If you’re tired of dynamic types, hidden references, ownership rules, and endless “stream” abstractions, Free #Pascal + BLAS gives you old‑school, deterministic HPC #programming with none of the modern noise.

    #Copilot and I will be using Free Pascal and BLAS for our Small Language Model project #SLM. No more #C, #python, #Rust, or C#

    #AI #LLM #computer

  17. Why do people use #python, a glue language, which is so slow? The only reason is the AI ecosystem.

    #Copilot and I just tested Free Pascal and BLAS for its speed without using #numpy or #pytorch. The result is amazing. It took less than a second to do a 1024x1024 #matrix multiplication.

    We will be using Free #Pascal and #BLAS to write our Small Language Model #SLM using #NNUE.

    #AI #LLM

  18. Why do people use #python, a glue language, which is so slow? The only reason is the AI ecosystem.

    #Copilot and I just tested Free Pascal and BLAS for its speed without using #numpy or #pytorch. The result is amazing. It took less than a second to do a 1024x1024 #matrix multiplication.

    We will be using Free #Pascal and #BLAS to write our Small Language Model #SLM using #NNUE.

    #AI #LLM

  19. Why do people use #python, a glue language, which is so slow? The only reason is the AI ecosystem.

    #Copilot and I just tested Free Pascal and BLAS for its speed without using #numpy or #pytorch. The result is amazing. It took less than a second to do a 1024x1024 #matrix multiplication.

    We will be using Free #Pascal and #BLAS to write our Small Language Model #SLM using #NNUE.

    #AI #LLM

  20. Why do people use #python, a glue language, which is so slow? The only reason is the AI ecosystem.

    #Copilot and I just tested Free Pascal and BLAS for its speed without using #numpy or #pytorch. The result is amazing. It took less than a second to do a 1024x1024 #matrix multiplication.

    We will be using Free #Pascal and #BLAS to write our Small Language Model #SLM using #NNUE.

    #AI #LLM

  21. The plot thickens #BLAS #rstats #lapack
    (When one is about to rip through 10s of millions of medical records, one must profile the tools if the project is to finish before one's retirement)
    FlexiBLAS makes this benchmarks a breeze

  22. The plot thickens #BLAS #rstats #lapack
    (When one is about to rip through 10s of millions of medical records, one must profile the tools if the project is to finish before one's retirement)
    FlexiBLAS makes this benchmarks a breeze

  23. The plot thickens #BLAS #rstats #lapack
    (When one is about to rip through 10s of millions of medical records, one must profile the tools if the project is to finish before one's retirement)
    FlexiBLAS makes this benchmarks a breeze

  24. The plot thickens #BLAS #rstats #lapack
    (When one is about to rip through 10s of millions of medical records, one must profile the tools if the project is to finish before one's retirement)
    FlexiBLAS makes this benchmarks a breeze

  25. The plot thickens #BLAS #rstats #lapack
    (When one is about to rip through 10s of millions of medical records, one must profile the tools if the project is to finish before one's retirement)
    FlexiBLAS makes this benchmarks a breeze

  26. The plot thickens #BLAS #rstats #lapack (When one is about to rip through 10s of millions of medical records, one must profile the tools if the project is to finish before one's retirement) FlexiBLAS makes this benchmarks a breeze

  27. The plot thickens #BLAS #rstats #lapack (When one is about to rip through 10s of millions of medical records, one must profile the tools if the project is to finish before one's retirement) FlexiBLAS makes this benchmarks a breeze

  28. I wonder if the #lapack that comes with #AOCL is being picked up by flexiblas in #rstats. The things I have to do for the love of electronic health records analytics #bigdata #blas

  29. I wonder if the #lapack that comes with #AOCL is being picked up by flexiblas in #rstats. The things I have to do for the love of electronic health records analytics #bigdata #blas

  30. I wonder if the #lapack that comes with #AOCL is being picked up by flexiblas in #rstats.
    The things I have to do for the love of electronic health records analytics #bigdata #blas

  31. I wonder if the #lapack that comes with #AOCL is being picked up by flexiblas in #rstats.
    The things I have to do for the love of electronic health records analytics #bigdata #blas

  32. I wonder if the #lapack that comes with #AOCL is being picked up by flexiblas in #rstats.
    The things I have to do for the love of electronic health records analytics #bigdata #blas

  33. I wonder if the #lapack that comes with #AOCL is being picked up by flexiblas in #rstats.
    The things I have to do for the love of electronic health records analytics #bigdata #blas

  34. I wonder if the #lapack that comes with #AOCL is being picked up by flexiblas in #rstats.
    The things I have to do for the love of electronic health records analytics #bigdata #blas

  35. Time for an #introduction!
    I'm a young Canuck with interests/experience in #HPC, #Linux, #BLAS, #SYCL, #C, #AVX512, #Rust, heterogeneous compute & other such things.

    Currently my personal projects are bringing #FP16 to the #OpenBLAS library, working to standardize what Complex domain BLAS FP16 kernels/implementations should look like, and making sure #SYCL is available everywhere.

    I also write every now and again. Here's the tail of AVX512 FP16 on Alderlake
    gist.github.com/FCLC/56e4b3f4a

  36. Was going through the Risc-V Vector ISA spec (as you do) and noticed this little gem:

    Specifically the line "When 16-bit and 128-bit element widths are added, they will be also be treated as IEEE-754/2008-compatible values. "

    Unless I'm miss interpreting this, is Risc-V indicating future *native* support for 128 bit integer and floating point?

    On the other hand, because I'm that guy: GOSH DARN IT, WHY NOT SHIP FP16 AS PART OF V.1 😭
    github.com/riscv/riscv-v-spec/

    #HPC #BLAS #RiscV #FP16 #ASM

  37. C++26 — прогресс и новинки от ISO C++

    Работа в комитете по стандартизации языка C++ активно кипит. Недавно состоялось очередное заседание. Как один из участников, поделюсь сегодня с Хабром свежими новостями и описанием изменений, которые планируются в С++26. До нового стандарта C++ остаётся чуть больше года, и вот некоторые новинки, которые попали в черновик стандарта за последние две встречи: запрет возврата из функции ссылок на временное значение, [[indeterminate]] и уменьшение количества Undefined Behavior, диагностика при =delete; , арифметика насыщения, линейная алгебра (да-да! BLAS и немного LAPACK), индексирование variadic-параметров и шаблонов ...[42] , вменяемый assert(...) , и другие приятные мелочи. Помимо этого, вас ждут планы и прогресс комитета по большим фичам и многое другое. Рассмотрим новинки на примерах

    habr.com/ru/companies/yandex/a

    #c++ #с++ #constexpr #c++26 #с++26 #numeric #floating_point #float #double #iso #программирование #span #functions #function #blas #lapack #atomic #linear_algebra #variadic_templates