Paperwork, best tool to keep my documents organized (simple, folder-based document archiver with OCR to make it all searchable). No more big folders of paper collecting dust.
Hier wurden ja bereits sehr viele gute Systeme vorgestellt. So mache ich es seit einigen Jahren:
Viel Erfolg bei der Digitalisierung deiner Dokumente.
Agree
To that purpose I use the app Paperwork, it performs OCR and automatically adds tags to files and scans in order to fetch the documents quickly
Edit : it is open source !
Yes, Paperless seems to fit the bill for you. The original developer moving on and there being 20 open pull requests makes me worry about it's future though. Openpaper.work might be another option but i haven't really looked at it much.
I host https://openpaper.work on my personal fiber connection (300Mbps symmetrical) in France. I've been doing it for about 2 years now. Here is the result regarding load time: https://imgur.com/a/SQ3xj9A
The problems are reliability and downtime: I have no redundant power supply in my house and no redundant Internet access. But neither have failed in the last 2 years. Regarding downtime, my server takes about 30 minutes to reboot (I've a big bunch of other things running on it), so that's actually the main problem for me.
Up to now, nobody has complained (and since I'm the one paying for it, they can't anyway :p).
My advise would just be to not host anything that requires any sort of high availability like that.
I've come across https://openpaper.work/ which seems like a great tool. I am not sure how I should combine its usage with my digital documents. I have types of documents of a certain class digitally and on paper. This causes a form of artificial segregation for these documents. Previously, I just threw digital and scanned documents into the same folder if both belonged into the same folder.
How would you handle such a case? Paperwork just seems to powerful to pass on.
Well there are a ton of programs that incorporate python code to scan a pdf, but they are arguably "finished" products, rather than "just" a library.
ocrmypdf https://github.com/ocrmypdf/OCRmyPDF
OpenPaper.work https://openpaper.work/
See https://github.com/jonaswinkler/paperless-ng, or https://openpaper.work/en/
Found out about them recently in a thread, very useful for organizing digital documents (which is what it seems you're trying to do)
I've researched some more and besides the program OCRmyPDF (thanks /u/elijahcobb) there is also Paperwork (link 1: official site, link 2: short intro video by its dev ), which AFAIK does bridge the gap and has a pretty GUI. At first glance, it actually seems to be pretty cool. Not that I particularly like that it saves every document in ~/papers/
again, but I suppose that is ok for now.
I am flairing the thread as SOLVED.
I also looked for a personal server not long ago. I also had a look at the HP Microserver, but I eventually went for a Dell Poweredge T320. I've upgraded the RAM to 42GB and I'm upgrading the disks (I'm still waiting on the 3.5" -> 2.5" adapters for my SSDs).
I've had it for ~2 weeks. Up to now I am really happy with it. However I still have to check the power consumption.
I host: - Proxmox VE 6 - ~20 LXC containers (Nextcloud, my personal emails, some pieces of the web site openpaper.work, etc). - 1 Windows VM (test&build machine) - 1 Linux VM (test machine)