Skip to content

Hugo tutorial #1435

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 29 commits into
base: main-flask
Choose a base branch
from
Open

Conversation

hmiesen
Copy link
Contributor

@hmiesen hmiesen commented Aug 19, 2025

This pull request includes a new tutorial on building and hosting a static site with Hugo. Within the Digital Research Support (DRS) team, there was a need for a more accessible tutorial that we can refer our researchers to, tailored to their needs. If you decide to adopt this tutorial, I’ll leave it up to you whether you want it to exist alongside the existing Hugo tutorial. I would be glad to receive your feedback. Please don’t hesitate to ask questions or share comments.

hmiesen added 29 commits May 2, 2025 12:13
* Summary commit of all changes

* Changed to Tilburg Sciencehub credentials

* Removed code lines in README.md

* snake_case implementation

* extract_all_http_links function made async

* uncommented workflow_dispatch for testing purposes

* Made the script fully async

* Bugfix: wrong variabel name

* Debugging broken_link_checker.py

* Made identify_broken_links function async

* Changed username for testing purposes

* Made push_issue_git_batched async

* Bugfix: forgot await in call

* More strict external link check

* More strict external link check

* More refined filtering of broken links

* Changed name of function and variable

* Sleep between batches of requests, lower concurrency

* Semaphore in check_all_urls

* Fixed status code 999

* Fixed status code 999

* Small fixes

* Removed pandas from code

* Further code refinement

* Concurrency and retry logic implemented

* Added remarks to clarify code

* Removed duplicate code lines

* Set concurrency in first pass to 4

* Use of list of rules

* Small fixes

* Test run with 10 pages

* Test run with 10 pages

* Distinction between internal links and external links implemented

* Changed user to own account for testing purposes

* Modulair approach with classes

* Distinction between internal and external links in issue

* Filter out status code None

* Delay added per domain (for external links)

* Bug fix + smart throttling

* Convert https to http and check

* Moved conversion of https to http to extract_links and check

* Error fetching status=0 fix

* Error fetching status=0 fix

* Split by domain

* Further sharpening

* Further sharpening: skip external 403s

* Bug fix: skip external 403s

* Bug fix: skip external 403s

* Check all links

* Bugfix class of reporter

* Bugfix class of reporter

* Bugfix class of reporter

* Bugfix class of reporter

* Bugfix class of reporter

* Bugfix class of reporter

* Scrape 20 pages

* Scrape 20 pages

* Sitemap class

* Sitemap class bug fixes

* Sitemap class bug fixes

* Conversion to constants

* Class Reporter revision

* GITHUB repo constant

* LinkErrorChecker class

* LinkErrorChecker class bugfixes

* LinkErrorChecker class bugfixes

* LinkErrorChecker class bugfixes

* LinkErrorChecker improvements

* LinkErrorChecker improvements

* Error loging in main_async_scraper

* Debugging: match_broken_links

* Linkextracter class added

* Linkextracter bugfixes

* Linkextracter bugfixes

* Linkextracter bugfixes

* LinkErrorChecker bugfixes

* LinkErrorChecker bugfixes

* LinkExtractor bugfixes

* General bugfixes

* General bugfixes

* General bugfixes

* Scrape all pages

* Reverted code to non-modular version, removed doublures

* TSH credentials

* Added inline comments and docstrings to code

* Converted inline comments from what to why

* Test round

* Tilburg Sciencehub domain check

* Tilburg Sciencehub github issues credentials

* Removed print statement and obsolete file

* Modified Generate JWT step

* Indentation change

* Indentation change

* Changed Generate JWT step

* Changed Generate JWT step

* Testrun with add-mask

* Testrun with add-mask: debugging

* tilburg sciencehub credentials
@hmiesen hmiesen requested a review from lachlandeer as a code owner August 19, 2025 06:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant