~dbohdan

Hellooooo. Welcome to my user site on tilde.institute.

Find me elsewhere

dbohdan.com Main site, a personal wiki
dbohdan.sdf.org Homepage on SDF with Small Internet and NetBSD resources

More tildeverse

tilde.institute catalog

You can browse user sites on tilde.institute using the stats page. The problem is that most sites are empty. The following is a catalog of non-empty sites.

The sites were selected from over 900 listed on the stats page on 2024-07-16. This is the second catalog of this sort that I have made. The first was the “random SDF homepages” feature on my own SDF homepage. Here, I decided to use a local LLM instead of a set of heuristics to determine what sites had content. I wrote a Python script that used several common Python libraries to retrieve and parse the sites’ index pages and used ollama-python with the Llama 3 8B model to analyze them. Built with Meta Llama 3, the script is.

The prompt was:

Is a webpage with the following text likely empty, a placeholder, a test page, or otherwise not worth listing in a catalog because of sparse content? Truncated text is okay. Answer "YES" or "NO".

```
(Page text goes here.)
```

The script kept sites whose front pages either had the network say “YES” or contained an HTML element <audio>, <iframe>, <img>, <script>, or <video>.

Processing over 900 pages took an hour on a Ryzen 5 3500 CPU with Ollama configured to only use three threads, effectively three cores. The prompt ended up excluding sites that were directory listings and including some sites with only a single line of text. Because of Llama 3’s limited context length of 8192 tokens, my script did not feed the network raw HTML. Text content was extracted from the pages, shortened to 6000 characters, and inserted in the prompt. This no doubt distorted the results.

Initially, I made a typo in the prompt: “Trucated text is okay.” The number of sites increased from 134 to 141 when I fixed the typo and reran the job.

The generated list is presented below without edits. It is also available as JSON.

141 site total. Expand.

TheToon “TheToon's ~”
adi “Adrian Emil Grigore”
ajgraves
akkartik “~akkartik”
ali “person named ali”
alilleybrinker “Andrew Lilley Brinker”
amitla “Dead end”
andinus “Andinus”
aniruddh “Aniruddh Nishad Web Page - tilde.institute”
anon “Anon page!”
anselm “Anselm's Private Journal”
aob
ashley “the pacific ocean has always struck me as, kinda, big”
asp
averagelinuxusr “justanotherinternetguy”
balou “hi”
bea “Bea's page”
beepboop
birabittoh
bolachanje “Arquitecturas Inteligentes”
bollosugar “arquitecturas retro”
bossley9 “tilde.institute :: Public-access OpenBSD system”
bruce42 “bruce42 - music details”
bubbyroosh “Hello :D”
chimay “‎”
chis
claudiom “~claudiom”
cmccabe “cmccabe's OpenBSD testground</head>”
comradecrow “ComradeCrow | tilde.institute”
crystal “Crystal's Website 💜”
danazu “Input/Output”
danisanti “danisanti”
dansimon “DSM personal web site”
deejoe
den “den — den”
dikiaap “~dikiaap”
dos “Turboblack”
drevil
duitser “tilde.institute :: Public-access OpenBSD system”
eirian
elioat
engel “Sygnalista branży IT...”
ensa “ensa's ~institute”
entoreor “~entoreor”
enzuru “library of the lost and found”
evn
exologist “Exologist”
fastjack “fastjack”
fayna
fitkoh
fnt
fs7274 “Casa del hardware”
games “tilde.institute :: Public-access OpenBSD system”
gamliel “Gamliel Fishkin”
gitbucket “tilde.institute :: Public-access OpenBSD system”
guenther
hedy “hedy @ ~institute”
hepeti “Hall of Fame”
hisacro “Hisacro's Tilde Adventures”
id2
idokutela “idokutela”
isaac “Welcome!”
janus
jleeb5912 “Test Page for Apache Installation”
jonah
jovan
kneezle
kogut “kogut's website”
kurb42
kus “home | kus”
latex “latex”
lauren “lauren”
m256 “Home page”
marcor “~marcor”
maria “Desktop Architectures”
marmoset
marty
masodo “masodo”
maxl “Maxl's home page on Tilde.institute”
mcornick “Redirecting”
mcsuper5 “mcsuper5 in the tildeverse”
menix
meuk “Don't litter 🚮”
mic “mic.tilde.institute”
michael “blog”
mieum
mounderfod “mounderfod”
mshroyer “Hello from tilde.institute!”
mworkman72 “Michael G. Workman”
mxb
mydeardiary “My Dear Diary Webspace”
nameless “nameless”
neb “neb's 1. public website”
nightfox “NightFox's Den”
noogie
nsbdfl “the str8 husband diary”
pala “pala's site”
passthejoe “passthejoe at tilde.institute”
pepepalomita “Máquinas Simples en Informática”
phf “phf@tilde.institute”
proton “Ada95 CGI Testing”
punk
pyrogaming “pyrogaming's third website”
quobit “MDwiki”
r1k “r1k”
ray “Heartless Hacker”
rdh “[rdh] on ˜.institute”
rebello
rgdrake
robin “~robin”
robyn “About Robyn”
rokkor “My tilde.club page”
rqm “Shell Output”
runxiyu “Runxi Yu”
seipy “ Seipy's Tilde.institue Blog ”
shell “Hello world!”
shokara
sillylaird “SillyLaird - Laird Lonergan - OpenBSD Tilde”
sl2c “Welcome to ~sl2c!”
smlckz “smlckz's website”
somenxavier
stevieo
stoickidoomer
sy
syx
sza “Arquitecturas paralelas”
tepes
thesimonharms “thesimonharms profile”
tildebeast “Institut du Château Tildebeast”
tivrusky
tmcb “~tmcb”
uberd “~uberd”
unix-enjoyer “Welcome to the shack of the average Unix enjoyer.”
user1103 “USER 1103”
vex “vex's webpage”
vga “VGA — José Vega-Cebrián”
vgk “tilde.institute :: Public-access OpenBSD system”
wink “wink's homepage”
yeti
youngchief “youngchief btw ツ”
zaramid “My tilde website”