Conversation
ArchiveBox is a self-hosted internet archiving solution that saves websites as HTML, PDF, screenshots, WARC, and more. This adds it to the TrueNAS community app catalog with optional Sonic full-text search backend support. Co-Authored-By: Claude Opus 4.6 <[email protected]>
Expose CSRF_TRUSTED_ORIGINS explicitly (defaults to localhost:{port})
so login/API works when accessed via TrueNAS hostname. Also add
TIMEOUT, CHECK_SSL_VALIDITY, and SAVE_ARCHIVE_DOT_ORG as first-class
config options in the UI.
Co-Authored-By: Claude Opus 4.6 <[email protected]>
Expose these as explicit config options in the TrueNAS UI so users can customize browser identity, pass authentication cookies, and use persistent Chrome profiles for archiving authenticated content. Co-Authored-By: Claude Opus 4.6 <[email protected]>
Co-Authored-By: Claude Opus 4.6 <[email protected]>
|
Why is sonic added but not scheduler? Whats the reasoning? |
|
Good call, scheduler should definitely be added, thanks for catching that. In the next release/dev it's already included in the main container, so I forgot that v0.7.x still needs it separate. As for the optional envs, they are the most common options that most archivebox users will need to tweak. I just pushed a commit cut it down to a smaller set though, don't want to overwhelm users, they can always use env vars to set more. I've also updated Should I commit and push the autogenerated lib files in |
|
Sheduler seems to fail to start currently. ❯ docker logs 95fd1ac99cf3c34b863024faa5eb8113-scheduler-1 -f
^[[A[i] [2026-03-12 17:39:16] ArchiveBox v0.7.3: archivebox schedule --foreground --update --every=day
> /data
Traceback (most recent call last):
File "/usr/local/bin/archivebox", line 8, in <module>
sys.exit(main())
^^^^^^
File "/app/archivebox/cli/__init__.py", line 140, in main
run_subcommand(
File "/app/archivebox/cli/__init__.py", line 80, in run_subcommand
module.main(args=subcommand_args, stdin=stdin, pwd=pwd) # type: ignore
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/app/archivebox/cli/archivebox_schedule.py", line 98, in main
schedule(
File "/app/archivebox/util.py", line 116, in typechecked_function
return func(*args, **kwargs)
^^^^^^^^^^^^^^^^^^^^^
File "/app/archivebox/main.py", line 1183, in schedule
cron = CronTab(user=True)
^^^^^^^^^^^^^^^^^^
File "/usr/local/lib/python3.11/site-packages/crontab.py", line 246, in __init__
self.read(tabfile)
File "/usr/local/lib/python3.11/site-packages/crontab.py", line 314, in read
raise IOError(f"Read crontab {self.user}: {process.stderr}")
OSError: Read crontab archivebox: crontabs/archivebox/: fopen: Permission denied@pirate Any ideas? |
…eally great. Let user handle it completely instead
| - variable: TZ | ||
| group: ArchiveBox Configuration | ||
| label: Timezone | ||
| schema: | ||
| type: string | ||
| default: Etc/UTC | ||
| required: true | ||
| $ref: | ||
| - definitions/timezone |
There was a problem hiding this comment.
Should probs remove this as archivebox does not support any timezone other than UTC, it uses the browser's timezone to translate UTC to local times in the frontend, but backend is always UTC.
|
thanks for helping with this!
- variable: run_as
label: ""
group: User and Group Configuration
schema:
type: dict
attrs:
- variable: user
label: User ID
description: The user id that ArchiveBox files will be owned by.
schema:
type: int
min: 568
- default: 568
+ default: 911
required: true
- variable: group
label: Group ID
description: The group id that ArchiveBox files will be owned by.
schema:
type: int
min: 568
- default: 568
+ default: 911
required: trueLet me know if you want me to push this change ^ and also update the test values / ix values to 911. Also we definitely need to add |
|
Volumes are shared in multiple apps and I have not needed to add I've tried with 911 and it still the same on scheduler. The main container doesnt have this issue. But I thought that the PUID/PGID can be set to any uid. (thats usually the concept of running the container as root + utilizing PUID/PGID). @pirate Any ideas? |
|
@pirate ping. |
|
Hey sorry I'm focused on getting the next archivebox v0.9.0 release out, which simplifies a lot of this stuff. Is it ok to leave this as a draft for now and I'll update it with the new config once 0.9.0 is ready? |
|
Sure yea |
Closes #231
Description
ArchiveBox is a powerful, self-hosted internet archiving solution to collect, save, and view websites offline. It saves pages as HTML, PDF, screenshots, WARC, and more from URLs, bookmarks, browser history, RSS feeds, or any other source.
App Information
archivebox- Main web application (archivebox/archivebox:0.7.3)sonic- Optional Sonic full-text search backend (archivebox/sonic:latest)Features
Testing
basic-values.yamltest file created with all configuration optionsIcons and Screenshots
Icon: Please use the ArchiveBox logo from https://archivebox.io - a square orange icon with a box/archive symbol. I can provide specific assets if needed.
Special Notes
ADMIN_USERNAMEandADMIN_PASSWORDenvironment variables only take effect on first run to create the initial admin userChecklist
app.yamlmetadata is complete and accuratequestions.yamlhas clear labels and descriptionsREADME.mdis written/ix-dev/are modified