Question 1

How to set up OCR for scanned PDFs in sist2?

Accepted Answer

Use the --ocr-ebooks flag with --ocr-lang to specify languages like 'eng' or 'chi_sim', and ensure Tesseract language files are installed via package manager or GitHub. The Docker image includes common languages pre-installed for convenience.

Question 2

sist2 vs Recoll: which is better for personal file search?

Accepted Answer

sist2 excels with its modern web UI, incremental scanning, and support for more file types like archives and media, but Recoll is more mature and stable. Choose sist2 for speed and flexibility, but Recoll for proven reliability in production-like settings.

Question 3

Can sist2 index files over a network or SMB share?

Accepted Answer

Yes, sist2 can index any mounted directory, including network drives, but performance may suffer due to latency. Ensure the path is accessible and consider using local caching or scheduling scans during off-peak hours.

Question 4

How to backup a sist2 index when using SQLite backend?

Accepted Answer

Backup the .sist2 index files and the SQLite search index file (.sist2 for search). Simply copy these files to a safe location; restoration involves placing them back and ensuring paths are consistent in the web interface or command line.

Question 5

Is sist2 secure to expose on the internet?

Accepted Answer

No, the README explicitly warns against exposing ports publicly due to lack of built-in authentication. If needed, implement firewall rules, use HTTPS via reverse proxy, and add authentication layers to prevent unauthorized access.

Question 6

How to customize tagging with user scripts in sist2?

Accepted Answer

Write scripts using the documentation in docs/scripting.md to automate tagging based on file attributes like metadata or content. These scripts run during scanning and can enhance organization without manual intervention.

sist2

What is sist2?

Overview

Use Cases

Best For

Related Projects

Found a gem we're missing?

Not Ideal For

Pros & Cons

Pros

Cons

Frequently Asked Questions