home.social

#httrack — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #httrack, aggregated by home.social.

  1. Tìm kiếm công cụ tự lưu trữ để phản chiếu trang web toàn diện với giao diện web, cho phép thu thập toàn bộ trang web và xuất tệp HTML có thể duyệt. #phản chiếu trang web #tự lưu trữ #wget #HTTrack #selfhosted #web_mirroring #docker #unraid

    reddit.com/r/selfhosted/commen

  2. the biggest things i need ai to do for me is to have a high initial elo ranking but also be trainable to scan all local docs and then also bring in a lots of real time data and open datasets 24/7, display results on series of dashboards #rag #pydantic #yacy #httrack #cached version #best stacks #free for commercial use #competitive intel #tailored data

  3. the biggest things i need ai to do for me is to have a high initial elo ranking but also be trainable to scan all local docs and then also bring in a lots of real time data and open datasets 24/7, display results on series of dashboards #rag #pydantic #yacy #httrack #cached version #best stacks #free for commercial use #competitive intel #tailored data

  4. the biggest things i need ai to do for me is to have a high initial elo ranking but also be trainable to scan all local docs and then also bring in a lots of real time data and open datasets 24/7, display results on series of dashboards #rag #pydantic #yacy #httrack #cached version #best stacks #free for commercial use #competitive intel #tailored data

  5. the biggest things i need ai to do for me is to have a high initial elo ranking but also be trainable to scan all local docs and then also bring in a lots of real time data and open datasets 24/7, display results on series of dashboards #rag #pydantic #yacy #httrack #cached version #best stacks #free for commercial use #competitive intel #tailored data

  6. 🌐🤦‍♂️ "Look Ma, I copied the entire internet! #HTTrack, the digital hoarder's dream, lets you download the web so you can finally browse those cat memes offline. Because nothing screams cutting-edge technology like reading 2005 forum threads in 2023." 📂😂
    httrack.com/ #OfflineBrowsing #DigitalHoarding #InternetArchive #CatMemes #Nostalgia #HackerNews #ngated

  7. 🌐🤦‍♂️ "Look Ma, I copied the entire internet! #HTTrack, the digital hoarder's dream, lets you download the web so you can finally browse those cat memes offline. Because nothing screams cutting-edge technology like reading 2005 forum threads in 2023." 📂😂
    httrack.com/ #OfflineBrowsing #DigitalHoarding #InternetArchive #CatMemes #Nostalgia #HackerNews #ngated

  8. 🌐🤦‍♂️ "Look Ma, I copied the entire internet! #HTTrack, the digital hoarder's dream, lets you download the web so you can finally browse those cat memes offline. Because nothing screams cutting-edge technology like reading 2005 forum threads in 2023." 📂😂
    httrack.com/ #OfflineBrowsing #DigitalHoarding #InternetArchive #CatMemes #Nostalgia #HackerNews #ngated

  9. 🌐🤦‍♂️ "Look Ma, I copied the entire internet! #HTTrack, the digital hoarder's dream, lets you download the web so you can finally browse those cat memes offline. Because nothing screams cutting-edge technology like reading 2005 forum threads in 2023." 📂😂
    httrack.com/ #OfflineBrowsing #DigitalHoarding #InternetArchive #CatMemes #Nostalgia #HackerNews #ngated

  10. 🌐🤦‍♂️ "Look Ma, I copied the entire internet! #HTTrack, the digital hoarder's dream, lets you download the web so you can finally browse those cat memes offline. Because nothing screams cutting-edge technology like reading 2005 forum threads in 2023." 📂😂
    httrack.com/ #OfflineBrowsing #DigitalHoarding #InternetArchive #CatMemes #Nostalgia #HackerNews #ngated

  11. #HTTrack - Der #Website Downloader

    In diesem Tutorial zeige ich dir, wie du ganze Websites mit HTTrack für den Offline-Zugriff speichern kannst. Egal, ob für die eigene Sicherung oder einfach zum Stöbern ohne Internet – ich zeige dir Schritt für Schritt, wie es funktioniert.

    gnulinux.ch/httrack-der-websit

  12. HTTrack - Der Website Downloader

    In diesem Tutorial zeige ich dir, wie du ganze Websites mit HTTrack für den Offline-Zugriff speichern kannst. Egal, ob für die eigene Sicherung oder einfach zum Stöbern ohne Internet – ich zeige dir Schritt für Schritt, wie es funktioniert.

    #httrack #Curl #wget #Website #Linux

    gnulinux.ch/httrack-der-websit

  13. HTTrack - Der Website Downloader

    In diesem Tutorial zeige ich dir, wie du ganze Websites mit HTTrack für den Offline-Zugriff speichern kannst. Egal, ob für die eigene Sicherung oder einfach zum Stöbern ohne Internet – ich zeige dir Schritt für Schritt, wie es funktioniert.

    #httrack #Curl #wget #Website #Linux

    gnulinux.ch/httrack-der-websit

  14. HTTrack - Der Website Downloader

    In diesem Tutorial zeige ich dir, wie du ganze Websites mit HTTrack für den Offline-Zugriff speichern kannst. Egal, ob für die eigene Sicherung oder einfach zum Stöbern ohne Internet – ich zeige dir Schritt für Schritt, wie es funktioniert.

    #httrack #Curl #wget #Website #Linux

    gnulinux.ch/httrack-der-websit

  15. HTTrack - Der Website Downloader

    In diesem Tutorial zeige ich dir, wie du ganze Websites mit HTTrack für den Offline-Zugriff speichern kannst. Egal, ob für die eigene Sicherung oder einfach zum Stöbern ohne Internet – ich zeige dir Schritt für Schritt, wie es funktioniert.

    #httrack #Curl #wget #Website #Linux

    gnulinux.ch/httrack-der-websit

  16. HTTrack - Der Website Downloader

    In diesem Tutorial zeige ich dir, wie du ganze Websites mit HTTrack für den Offline-Zugriff speichern kannst. Egal, ob für die eigene Sicherung oder einfach zum Stöbern ohne Internet – ich zeige dir Schritt für Schritt, wie es funktioniert.

    #httrack #Curl #wget #Website #Linux

    gnulinux.ch/httrack-der-websit

  17. I am looking for archive.org as a self hosted service.

    I want to have an automated static copy of a website, which preserves old copied versions.

    It should provide a #crawler and a web interface to access the archived versions of the website.

    The use case is a lousy CMS which often destroys content. I want to be able to restore content from the archive and to have a static website copy in the worst case.

    #SelfHosting #WebsiteArchive #Archive #OffsiteBackp #Backup #HTTrack #WebsiteCopy

  18. I am looking for archive.org as a self hosted service.

    I want to have an automated static copy of a website, which preserves old copied versions.

    It should provide a #crawler and a web interface to access the archived versions of the website.

    The use case is a lousy CMS which often destroys content. I want to be able to restore content from the archive and to have a static website copy in the worst case.

    #SelfHosting #WebsiteArchive #Archive #OffsiteBackp #Backup #HTTrack #WebsiteCopy

  19. I am looking for archive.org as a self hosted service.

    I want to have an automated static copy of a website, which preserves old copied versions.

    It should provide a #crawler and a web interface to access the archived versions of the website.

    The use case is a lousy CMS which often destroys content. I want to be able to restore content from the archive and to have a static website copy in the worst case.

    #SelfHosting #WebsiteArchive #Archive #OffsiteBackp #Backup #HTTrack #WebsiteCopy

  20. I am looking for archive.org as a self hosted service.

    I want to have an automated static copy of a website, which preserves old copied versions.

    It should provide a #crawler and a web interface to access the archived versions of the website.

    The use case is a lousy CMS which often destroys content. I want to be able to restore content from the archive and to have a static website copy in the worst case.

    #SelfHosting #WebsiteArchive #Archive #OffsiteBackp #Backup #HTTrack #WebsiteCopy

  21. Actually, lemme think out loud about what I need #HTTrack to do, before I forget. It needs to pull jpg, png and svg images, javascript and any external CSS*) from any level within the Comicfury.com domain, but external links need to be skipped for mirroring.

    *)AFAIK all CSS within ComicFury is inline! A baffling decision but one that will make my life easier with this. But I may be mistaken.

  22. Actually, lemme think out loud about what I need #HTTrack to do, before I forget. It needs to pull jpg, png and svg images, javascript and any external CSS*) from any level within the Comicfury.com domain, but external links need to be skipped for mirroring.

    *)AFAIK all CSS within ComicFury is inline! A baffling decision but one that will make my life easier with this. But I may be mistaken.

  23. Actually, lemme think out loud about what I need #HTTrack to do, before I forget. It needs to pull jpg, png and svg images, javascript and any external CSS*) from any level within the Comicfury.com domain, but external links need to be skipped for mirroring.

    *)AFAIK all CSS within ComicFury is inline! A baffling decision but one that will make my life easier with this. But I may be mistaken.

  24. Actually, lemme think out loud about what I need #HTTrack to do, before I forget. It needs to pull jpg, png and svg images, javascript and any external CSS*) from any level within the Comicfury.com domain, but external links need to be skipped for mirroring.

    *)AFAIK all CSS within ComicFury is inline! A baffling decision but one that will make my life easier with this. But I may be mistaken.

  25. Actually, lemme think out loud about what I need #HTTrack to do, before I forget. It needs to pull jpg, png and svg images, javascript and any external CSS*) from any level within the Comicfury.com domain, but external links need to be skipped for mirroring.

    *)AFAIK all CSS within ComicFury is inline! A baffling decision but one that will make my life easier with this. But I may be mistaken.

  26. search engine on a stick would be a fun project 1tb nvme enc persistent bootable and you can spider your own sites in addition to top 10k sites already crawled and indexed - yacy could stand to be much more automated - it is a bit of work to get it set - not the config just all the sites loaded #httrack

  27. search engine on a stick would be a fun project 1tb nvme enc persistent bootable and you can spider your own sites in addition to top 10k sites already crawled and indexed - yacy could stand to be much more automated - it is a bit of work to get it set - not the config just all the sites loaded #httrack

  28. search engine on a stick would be a fun project 1tb nvme enc persistent bootable and you can spider your own sites in addition to top 10k sites already crawled and indexed - yacy could stand to be much more automated - it is a bit of work to get it set - not the config just all the sites loaded #httrack

  29. #HTTrack seems to be unmaintained (last release in 2017).

    Any maintained recent opensource mirroring solution than can offload auth to a browser (for example, like #destreamer can)?

    httrack.com/

  30. #HTTrack seems to be unmaintained (last release in 2017).

    Any maintained recent opensource mirroring solution than can offload auth to a browser (for example, like #destreamer can)?

    httrack.com/

  31. #HTTrack seems to be unmaintained (last release in 2017).

    Any maintained recent opensource mirroring solution than can offload auth to a browser (for example, like #destreamer can)?

    httrack.com/

  32. #HTTrack seems to be unmaintained (last release in 2017).

    Any maintained recent opensource mirroring solution than can offload auth to a browser (for example, like #destreamer can)?

    httrack.com/

  33. #HTTrack seems to be unmaintained (last release in 2017).

    Any maintained recent opensource mirroring solution than can offload auth to a browser (for example, like #destreamer can)?

    httrack.com/

  34. Manchmal will man ja auch eine ganze Webpräsenz sichern. #Httrack ist dafür auch ein gutes Tool, aber die Voreinstellungen müssen angepasst werden. #OSINT bashinho.de/2024/01/18/webseit

  35. Manchmal will man ja auch eine ganze Webpräsenz sichern. #Httrack ist dafür auch ein gutes Tool, aber die Voreinstellungen müssen angepasst werden. #OSINT bashinho.de/2024/01/18/webseit

  36. Manchmal will man ja auch eine ganze Webpräsenz sichern. #Httrack ist dafür auch ein gutes Tool, aber die Voreinstellungen müssen angepasst werden. #OSINT bashinho.de/2024/01/18/webseit

  37. Manchmal will man ja auch eine ganze Webpräsenz sichern. #Httrack ist dafür auch ein gutes Tool, aber die Voreinstellungen müssen angepasst werden. #OSINT bashinho.de/2024/01/18/webseit

  38. Manchmal will man ja auch eine ganze Webpräsenz sichern. #Httrack ist dafür auch ein gutes Tool, aber die Voreinstellungen müssen angepasst werden. #OSINT bashinho.de/2024/01/18/webseit

  39. В очередной раз убеждаюсь, что #wget великая вещь!

    Одна мелкая бура сообщила о своём закрытии, и я решил её сохранить себе.

    Попробовал сначала
    #HTTrack, он пыхтел полдня и сохранил только html файлы.

    wget сначала отказывался зеркалить сайт, но я добавил
    -U и всё заработало. Примерно за 2 два часа он скачал весь сайт и все картинки.

    Теперь я обладаю ~1800 картинками среднего качества и не знаю что с этим делать.
    ​:blobcatshrug:​

  40. В очередной раз убеждаюсь, что #wget великая вещь!

    Одна мелкая бура сообщила о своём закрытии, и я решил её сохранить себе.

    Попробовал сначала
    #HTTrack, он пыхтел полдня и сохранил только html файлы.

    wget сначала отказывался зеркалить сайт, но я добавил
    -U и всё заработало. Примерно за 2 два часа он скачал весь сайт и все картинки.

    Теперь я обладаю ~1800 картинками среднего качества и не знаю что с этим делать.
    ​:blobcatshrug:​

  41. В очередной раз убеждаюсь, что #wget великая вещь!

    Одна мелкая бура сообщила о своём закрытии, и я решил её сохранить себе.

    Попробовал сначала
    #HTTrack, он пыхтел полдня и сохранил только html файлы.

    wget сначала отказывался зеркалить сайт, но я добавил
    -U и всё заработало. Примерно за 2 два часа он скачал весь сайт и все картинки.

    Теперь я обладаю ~1800 картинками среднего качества и не знаю что с этим делать.
    ​:blobcatshrug:​

  42. В очередной раз убеждаюсь, что #wget великая вещь!

    Одна мелкая бура сообщила о своём закрытии, и я решил её сохранить себе.

    Попробовал сначала
    #HTTrack, он пыхтел полдня и сохранил только html файлы.

    wget сначала отказывался зеркалить сайт, но я добавил
    -U и всё заработало. Примерно за 2 два часа он скачал весь сайт и все картинки.

    Теперь я обладаю ~1800 картинками среднего качества и не знаю что с этим делать.
    ​:blobcatshrug:​