home.social

#sitemaps — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #sitemaps, aggregated by home.social.

  1. Today's of the day is:

    Sitemaps are an outline/map of where pages are located on a website.

    They are typically used to improve search engine indexing. 🔍

  2. Evergreen crawling with XML sitemaps (updated to run in bulk)

    The script now takes a list of tuples of XML sitemap URL, and website name, and runs the through them all, creating dynamic file names, for example

    {name}_sitemap.csv
    {name}_crawl.jl
    {name}_errors.txt

    Now you just need to add a URL and a name to add a new website to the process, instead of creating new specific files from scratch.

    bit.ly/4fnhiSO

    #advertools #crawling #sitemaps #scraping #SEO #DataScience #Python