What are /r/datacurator's favorite Products & Services?
From 3.5 billion Reddit comments

Search for a Subreddit:

Products

Services

Android Apps

The most popular Products mentioned in /r/datacurator:

The Complete New Yorker: Eighty Years of the Nation's Greatest Magazine (Book & 8 DVD-ROMs)

➔ 1 /r/datacurator comment

Check price

➔ View 1 /r/datacurator comment

Check price

StarTech.com Standalone Hard Drive Duplicator, Dual Bay HDD/SSD Cloner/Copier, USB 3.1 (10Gbps) to SATA III HDD/SSD Docking Station, Hard Disk Duplicator Dock, Hard Drive Cloner (SDOCK2U313R)

➔ 1 /r/datacurator comment

Check price

➔ View 1 /r/datacurator comment

Check price

Synology 2 bay NAS DiskStation DS220j (Diskless), 2-bay; 512MB DDR4

➔ 1 /r/datacurator comment

Check price

➔ View 1 /r/datacurator comment

Check price

Cell Phone Clip on Stand Holder - with Grip Flexible Long Arm Gooseneck Bracket Mount Clamp for iPhone X/8/7/6/6s/5 Samsung S8/S7, Used for Bed, Desktop (Black)

➔ 1 /r/datacurator comment

Check price

➔ View 1 /r/datacurator comment

Check price

Scanner Bin - The Clever Document Scanning Solution

➔ 1 /r/datacurator comment

Check price

➔ View 1 /r/datacurator comment

Check price

Keeping Found Things Found: The Study and Practice of Personal Information Management (Interactive Technologies)

➔ 1 /r/datacurator comment

Check price

➔ View 1 /r/datacurator comment

Check price

Western Digital 4TB WD Blue PC Internal Hard Drive HDD - 5400 RPM, SATA 6 Gb/s, 256 MB Cache, 3.5" - WD40EZAZ

➔ 1 /r/datacurator comment

Check price

➔ View 1 /r/datacurator comment

Check price

The most popular Services mentioned in /r/datacurator:

PhotoStructure

13 /r/datacurator comments

Everything

8 /r/datacurator comments

digiKam

6 /r/datacurator comments

TagSpaces

5 /r/datacurator comments

FreeFileSync

4 /r/datacurator comments

Czkawka

4 /r/datacurator comments

Buku

3 /r/datacurator comments

DropIt

3 /r/datacurator comments

jdupes

2 /r/datacurator comments

#10

Advanced Renamer

2 /r/datacurator comments

#11

Snipe-IT

2 /r/datacurator comments

#12

Nextcloud

2 /r/datacurator comments

#13

fman

2 /r/datacurator comments

#14

VisiPics

2 /r/datacurator comments

#15

Dendron

2 /r/datacurator comments

The most popular Android Apps mentioned in /r/datacurator:

Open Camera

1 /r/datacurator comment

Free

PocketBook - PDF, EPUB reader

1 /r/datacurator comment

Free

Termux

1 /r/datacurator comment

Free

PhotoScan by Google Photos

1 /r/datacurator comment

Free

Article Reader Offline

1 /r/datacurator comment

Free

The most popular reviews in /r/datacurator:

fdupe -r [path1] [path2] [path3]

will show identical duplicates, to keep only one of them use -rd

on windows you probably need cygwin or try smth from https://alternativeto.net/software/fdupes/?platform=windows

As u/creeva has pointed out, if you put all pictures containing Billy Bob into a Billy Bob folder and all pictures containing Jimmy Jon in another folder, any pictures containing BOTH Billy Bob and Jimmy Jon will either end up duplicated (one copy in each folder) or present in one folder and absent in another, meaning you'd never know if you were looking at ALL the Billy Bob photos when you go to the Billy Bob folder.

My suggestion? Tags. I use Digikam to manage my (500k) photograph collection because it is totally free and open-source and has a great set of features, but I'm sure there are a lot of programs that could do a similar job. I use the face detection to find and recognise faces. You tell Digikam who each face belongs to, and it writes it to the photograph as a 'tag'. You can also 'tag' things like the location, the event, the photographer, anything! And then when you want to see all the photos of Billy Bob, you just search his name. If you want to see the photos of Billy Bob you took last Christmas, you can search for Billy Bob Christmas. And if you want to see the photos of Billy Bob from the summer of 2012, you can do that too. Anyway, you get the idea. Hope that helps.

You could use something like this after you pull the hdds. I personally have an older less expensive version of this. You could also use something like google drive/one drive to if it’s not too much data to upload/redownload on main machine if you dont want to spend money. I know there are most likely other ways to do this such as folder mapping but I’m not sure if that works on Windows home editions as I’ve never had to try and set that up.

Edit: punctuation

Have a look at this https://github.com/jarun/Buku

I personally imported my 5K bookmarks, searched through and deleted a bunch, now it's down to 2K.

When ever I have some free time I'll go though and replace dead links to archive.org and tag my stuff It's slow and I only do it once in a blue moon

This does not sound sane to me. Why would you only overwrite files that are almost certain identical? Usually, these are the files I don't want to overwrite because ... overwriting a file with an identical copy doesn't make much sense to me.

If this is true: don't blame the tools for not offering strange requirements nobody else needs. I usually try to avoid those tools because they're bloated.

For solving your issue, I'd follow the following ideas:

Advanced file managers like FreeCommander, Total Commander, Dired, and so forth are offering comparison functionalities for directories.

Or you write a script. I'd use Python for any non-trivial stuff that is not part of my file manager.

And: please do learn how to write good subject lines. "Why does this not exist?" is as vacuous as "I have a question" or "help me". 🙄

You could put all of these files in a .hidden or .archive directory (like, .../pictures/.archive/YYYY/YYYY-MM-DD/...). I suspect many DAMs running on UN*X will skip over directories that start with a period. Alternatively, you can mark the directory as hidden on Windows or macOS.

There's also a <code>.nomedia</code> convention on Android that seems to match up with what you're asking here: they're for directories containing files that you don't want in your library, but don't want to delete. (I adopted it for PhotoStructure, but I don't know how many other DAMs respect it, compared to skipping over hidden directories).

Why tf would you use a spreadsheet for metadata?

I‘d recommend trying the free and open source program DigiCam which offers very fast tagging and all around metadata editing with a GUI. If CLI is more your thing ExifTool is a good choice, also free and open source.

I must admit I did click on the Tabbles in your first post but once I saw that it was closed source and Windows only, I really didn't look further.

For something that is so crucial to long term data storage, open source + cross platform would be a full requirement for my needs.

I use some closed source software, but something like this would need to be open source. What happens when Tabbles stops being supported? With open source, if the software is useful, its usually forked and then maintained by a new group. That isn't possoble with closed source.

Maybe this? https://www.tagspaces.org/ Haven't looked into it deeply, but its on my list now

I've been using The Archive for the last 6 weeks or so and I've written more in these 6 weeks than I have in the last year.

It's based around the idea of a Zettelkasten, which is getting quite a bit of attention recently.

As far as file structure goes, I just have ~/notes which is filled with markdown files.

Just an update, turns icaros (probably) affects all thumnails, so I ~can't set my own offset for only a specific directory.

BUT, in r/datahoarder someone recommended me Jellyfin to sort my shows/movies and it's sooo sleek and nice! And I can even stream the media. I ~just had to multi rename the files using this open source program; Simplest file renamer it allows to edit the files with notepad which I coupled it with excel and online databases. Then jellyfin ~took over and scanned my library.😅

I've had several beta users tell me that they lost a bunch on photos using other photo management software and then "clicked the wrong button." It's tough to hear when backups are so cheap these days.

It's also why I decided to never move original files: I only copy them.

https://github.com/jarun/Buku/wiki/Operational-notes

> URLs are unique in DB. The same URL cannot be added twice.

As for the second one is assume it would work, but I can't be 100% sure.

Also bukuserver is something that's new with the software that I haven't seen before, I'll have to give that a go.

For duplicate files I use,

jdupes - identifies files that are exactly the same - you can create a report, and/or take action over the duplicate files - https://github.com/jbruchon/jdupes
dupeGuru - has a photo mode that identifies similar images - https://dupeguru.voltaicideas.net

I need to play around with it more, I ran Hazel when I was on a Mac and really enjoyed it I should even still have a license, but when I switched to back to a pc I was left with a longing. I need to mess with it more: https://github.com/benjaminoakes/maid

Also funny that a windows equivalent is called Alfred, considering the Mac app by the same name is an accelerator like Quicksilver or Gnome-Do.

You have to check out https://fman.io/ - it's clean and simple, offers one of the killer features "GoTo on Steroids" and you can download tons of add-ons or write them yourself (Python).

Disclaimer: since I'm working in the zsh or with dired, I'm not using it myself. However, if I would need a GUI file browser, I would use fman for sure. And I know many happy users of fman that switched after my recommendation.

I think this is a two-step process.

First, a de-duplicator. Czkawka will have you covered and is fast. You can preview and compare via file hashes.

The second is to do a scan for files older than X and you can manually review them for removal. any scripting or programming language can do this.

check out TagSpaces. I'm thinking it may not be automated enough for your purposes, but I rather like it so I'm throwing it out there. I don't focus on file naming, rather on the tagging of data collections. Thing is, with TS you don't really need to put or file into a logical structure although I prefer it that way. TS spaces allows for you to search for tagged items across a data mount so the targeted tag's data can be scattered around and still logically organized and findable.

https://www.tagspaces.org/

Resilio Sync would be my go-to here. It’s based on P2P protocols and “just works” across any kind of network. I use it to keep many TBs in sync across multiple computers (and NAS devices).

The free version has some limitations that I can’t remember but might be suitable. If not the paid version is a one-off cost and is worth it IMHO.

I believe https://freefilesync.org/ is one of the more popular backup programs, I like it at least. I think you can change the options to move instead of copy if that's all you want.

Defragging is becoming less of a concern in the modern era and it mostly applies to cleaning up after deleting files. So if you moved files from DRIVE:A to DRIVE:B you might want to defrag DRIVE:A. That said, there is no need to do it very often; it won't even make a difference unless you created a significant amount of free space (100+ GB) the bigger the better.

DO NOT DEFRAG SOLID STATE DRIVES. It will NOT make it a difference, and it will waste read/write access on the drive. R/W access is becoming more generous, more than we ever need before we replace computers every 4 years or so but still; don't waste it.

I guess if I answer the first question now I'll have all three in reverse order so, why not. THEORETICALLY If you had a playlist with each episode automatically playing after the previous; then there might be millisecond-faster loading if the episodes were defragmented right next to eachother, in consecutive order. Its a bazillion times more effort than you'll ever notice results from.

Good guide to get you started. https://www.reddit.com/r/Android/comments/66kehg/twoclick_downloads_of_youtube_videos_straight/

Basically download Termux app, apt get-install python and pip install youtube-dl.

I've got a nice multiway selection so when I share I link to Termux I can chose to rip video or just audio or archive the whole channel etc.

I love using the Termux app (it's also available on f-droid, allows me to use stuff like Python, compile with clang, use ffmepg and SSH into machine's (I do it a lot) amoung other stuff. Packmamger is apt.

I've come across https://openpaper.work/ which seems like a great tool. I am not sure how I should combine its usage with my digital documents. I have types of documents of a certain class digitally and on paper. This causes a form of artificial segregation for these documents. Previously, I just threw digital and scanned documents into the same folder if both belonged into the same folder.

How would you handle such a case? Paperwork just seems to powerful to pass on.

Library science is great, but this is absolutely the best book I've ever read on NOT losing interesting stuff once I've gotten it:

https://smile.amazon.com/dp/0123708664/ Keeping Found Things Found: The Study and Practice of Personal Information Management (Interactive Technologies) Paperback : 448 pages ISBN-10 : 0123708664 ISBN-13 : 978-0123708663

It really is handy. It takes a bit of fiddling to set up but I've got it running on a little mid-range mini-PC on my network and I'm using this app to upload documents scanned via phone.

Howdy! I'm the author of PhotoStructure, which handles image and video deduplication server-side: I've spent a lot of time with this issue, and you might want to read this article to know what sorts of dupes you want to handle.

Whatever software you end up using, it will need to keep a database (or something like it) that stores a "hash" of every image that you're try to deduplicate, or performance will be terrible.

Hash-alone strategies have high false-negative (missing actual dupes) and false-positive (where different photos are erroneously gathered together) rates: it's the nature of using a hash to have collisions. I had to teach PhotoStructure to use image metadata heuristics as well to get good accuracy. (BTW, if you use PhotoStructure, and see any issues, tell me on the forum or on discord, and I'll fix it in the next release!)

Total Commander with the EXIF plugin can do this.

https://www.ghisler.com/dplugins.htm - search for "EXIF" and you will find the EXIF 2.6 plugin. After that you use this plugin to mass rename your files.

Look up the "N-Gram" algorithm (https://en.wikipedia.org/wiki/N-gram) It suffers from false positives because languages aren't discrete entities and more like a mess of overlapping, shared vocabularies. Ultimately you want something that can extract text from all of the "usual crap" and then have it identify the language. Elastic search has some stuff on this: https://www.elastic.co/blog/multilingual-search-using-language-identification-in-elasticsearch

Off hand, I don't know of an open source, multi-lingual, document management system that does this, but that is probably what you are looking for.

There's a linux release of it to run on .net/mono here. See AutoHotKey bug here for it.

This is how I scan

It's super crude but it will make all photos in your directory look like they've been scanned

Take photos with phone: Use gooseneck thing like this below, to hold phone up in air and just snap photo, change paper, repeat. (Phone holder wouldn't be necessary if you didn't want it, but would make it more consistent and less annoying i think)

https://www.amazon.com/Cell-Phone-Clip-Stand-Holder/dp/B079QY6RFQ/ref=mp_s_a_1_5?keywords=arm+phone+holder&qid=1642397700&sr=8-5

Get all photos into a directory

In the directory run something like this: for f in ./*.jpg ; do magick "$f" -alpha off -auto-threshold otsu "${f%.jpg}-scan.jpg ; done

If your photos aren't jpg, rename the jpgs above to your format before running

This will take every image in directory and add a copy named 'image-scan.jpg' that will now look scanned.

It makes the blacks super black so make sure it's flat and well lit, shadows will turn black. You could probably adjust this so its not as extreme but idk.

If you want, test it out on just one file instead of the whole directory with: magick "yourfile" -alpha off -auto-threshold otsu "yourfile"

I don't know about ocr. I tried it with tesseract on an image, and the resulting text was gibberish

Yeah, there's something relatively similar on Linux. It doesn't play quite perfectly nice with btrfs/zfs-style read-verification, but it can work in a pinch if you don't want to modify the storage layer.

A possible easier solution: tag them using the TagSpaces standard which works for any file type, using some sort of mapping between URLs and tags? The token tinyurl generates might suffice, or maybe roll your own mapping.

I personally seperate my Anime movies and shows from the normal Movies and shows.

I let beets handle music and it puts OST into a sub folder due to my configuration is specifically setup to do that

This configuration may be out of date, I'm not at my laptop to check.

https://hastebin.com/eqitukupay

I'll see about writing my setup on a Monday afternoon.

I'm not sure how to handle ROMs correctly.

I love Digikam. I don't do much editing myself, but I know Digikam has editing features. I've used the batch edit to a whole album of my original photos, reduce them to 1920x1080 pixels, add a watermark, and rename them to prepare them for upload, but that's as adventurous as I get, and I know it can do a lot more.

If you're looking for serious editing power, try Gimp. It's like an open-source version of Photoshop and it supposedly can do anything Photoshop can do, but I've never used either myself. If you have any questions I can't answer, the folks at r/opensourcephotography would probably be thrilled to hear them.

I do not allow spaces in any file or folder name outside of the OS system, and installed apps.

I do not allow any data to be saved to my C: Drive, only the OS and programs go on C: drive

For Folders/Filenames, I use PERIODS for all naming. Or capitalized like this

MyData

DataPublishers

Also I use special codes I created over 250,000 at the present time to Identify data and abbreviate file names. Yes I use a DB to track my naming system.

If you are on windows there is one tool I use everyday called Everything

Basically it is a search engine for any files on your computer or network. It is freeware and a fantastic tool, installed and portable versions.

https://www.voidtools.com/

This tool allows you to search by name, keywords, file size etc. The best part is that if you type say this:Library contacts

It will find the file called Library.Contacts.2019.12.31.xlsx or whatever.

I integrate the series title in the file name, ie:

Lois McMaster Bujold - [Vorkosigan 02] - Shards of Honor (1986)

It's usually sufficient to keep one directory per author, as it's very clear to understand which files are a part of the same story arch.

For Problem 1, - Books by various authors, I insert the "universe" name as the author name:

Star Wars - [X-Wing 01] - Rogue Squadron (Michael A. Stackpole, 1996)
Star Wars - [X-Wing 02] - Wedge's Gamble (Michael A. Stackpole, 1996)
Star Wars - [X-Wing 09] - Starfighters of Adumar (Aarson Allston, 1999)
Star Wars - [X-Wing 10] - Mercy Kill (Aarson Allston, 2012)

Shared series in a convoluted Universe:

Star Wars\Legends\Star Wars - [New Jedi Order 17] - Force Heretic III - Reunion (Dix & Williams, 2003)

Problem 1.1 -

I have named my novels by "author - title", and my non-fiction books by "title - author" - so that's it's easy to identify which is which. When searching for academic books, the subject is usually of more interest than the authors.

Problem 2:

Install a local search engine - https://www.voidtools.com/

PhotoStructure currently imports tags from Google Takeouts, and ML-based search and tagging is coming soon.

I agree with your sentiment: most for-profit software will try to introduce some sort of "friction" that makes it a PITA to switch to a competitor.

However, PhotoStructure is not most for-profit software: I don't have any investors pushing for those sorts of tactics: it's just me doing everything.

I believe well written software (closed AND open source) should:

Leave you in a better place than you started if the software goes away
Should adopt standards whenever possible and avoid proprietary data stores

I bend over backward to not be the "authoritative store" for any action you do in PhotoStructure: everything is stored in sidecars (or the original files: it's configurable).

Metadata is also stored in a clearly designed and commented SQLite database, which is used to make the UI and syncs performant, but it's treated as a possibly-out-of-data data mirror, not the authoritative store.

I'm assuming you want multi-year options to get a bigger discount than the 30% annual discount from monthly subscriptions?

If that's the case, know that I'm <u>liberal</u> with additional discounts. If you're a student, or medical professional impacted by COVID, you get 50% off. If you've an open source developer or maintainer, I'll give you between 50% and 100% off (depending on if PhotoStructure uses your code). If you've given substantive feedback, or helped debug issues, I'll give you a discount.

Basically, just give me an excuse, and I'll give you a discount coupon.

Feel free to try out PhotoStructure: I wrote it to help me dig out from my digital mess of dupes and disorganization.

You'll need to set up a free trial to enable "automatic organization." Ping me if you need a longer trial or have any questions.

I started trying to organize my photos and videos into YYYY/YYYY-MM-DD or YYYY/YYYY-MM-${EVENT_NAME} subdirectories, and then add symlinks to those files that lived in other directory hierarchies, but wasn't able to stay on top of it.

I actually wrote software to do robust captured-at extraction, but ExifTool will get you 80% of the way there (including moving your photos and videos into these subdirectories). You may want to do deduping as well, and depending on how bad your current situation is (mine was bad), just de-duping files based on SHA may be "enough."

The only thing I'd caution against is a directory structure that results in more than several thousand files in a given directory. Most OSes won't outright crash from gigantic folders, but if you ever export that directory onto spinning rust or via a remote file share, it's certainly not fun to wait a while navigating through and between those directories.

I've been soaking in this for a while (and even have my own, but it's focused on personal use, not the industrial/professional space).

I think this community's the right place: ask away!

I'd get a solid backup (or two) squared away before she starts using any deduplication apps. I wrote this up after being asked this by several of my beta users. (If she doesn't want to get a NAS, at least get an external drive!) She said that Time Machine wasn't enough, but that's certainly better than nothing!

Deduping is tricky and filled with heuristics that might or might not match up with what she's expecting. I wrote up how PhotoStructure compares files here. No matter what tool she goes with, she should at least be passingly aware of these details.

She also mentioned concern of metadata loss when copying to an external drive. Time Machine will retain everything, as would something like SuperDuper. You'll lose file metadata like user and group ownership of the external drive is formatted as FAT or exFAT, so she should use Disk Utility to format it to AFS+ or APFS (as long as it doesn't need to be read by a Windows box).

maybe you'd like to try Eagle? https://en.eagle.cool/

It's a desktop-based file management tool that can easily sync through the cloud and provide tagging and annotations. tho in its website it states its build mostly for designers but i think it suits every requirements you mentioned as well.

You can categorize assets (image/videos/gifs/audios/docx...etc) across different dimensions very easily (e.g., by platform, by subject, by style, by shape, by color, by rating, by dates etc.), and can easily create custom queries, smart folders, auto-import and even use your tag label to create a custom folder (e.g. one photo with two different tags can appear in different tag folders without duplicating and taking two spaces, which is extremely thoughtful).

anyway, I've found it to be very useful in managing my workflows and optimizing my project process too. also, beautiful interface is a plus as well.

hope you find this helpful!

have you tried Eagle? https://en.eagle.cool/ it's a desktop-based file management tool that can easily tag and give annotations.

You can search, browse and categorize assets (image/videos/gifs/audios/docx...etc) across different dimensions very easily (e.g., by platform, by subject, by style, by shape, by color, by date, by stars rating etc.),

and can easily create custom queries, smart folders, auto-import and even use your tag label to create a custom folder (e.g. one photo with two different tags can appear in different tag folders without duplicating and taking two spaces, which is extremely thoughtful).

anyway, it's just a cool and amazing tool in managing workflows and optimizing project processes. also, beautiful interface is a plus too

If your open to using the website, Google photos has great image recognition.

I've recently came across digikam for photo tagging. It will require a lot of front end work though.

https://www.digikam.org/

Digikam is a free and open source photo and video organization and photo editing tool. It can find people/faces, find duplicate/similar photos, stitch panoramas, create albums, edit photos and do much more.

I'm using organization capabilities of it. It can manage large collections without effort.

It doesn't modify your photos if you don't want to. Keeps the folder structure intact and evolves around your organization scheme. It supports multiple catalogs and catalogs can be on your disk, on a removable drive or on network.

It creates a database when you add photos to it, so it can make pretty fast searches on that database according to tags, dates, etc.

I'm using it for ~10 years but, I'm using it seriously for 1-2 years and as I dig deeper, I find more and more features.

While Digikam's photo editing tools are pretty good, I prefer Darktable to edit my photographs.

Tiny Media Manager is a great tool that I've come to rely on for organizing and properly naming TV episodes for me: https://www.tinymediamanager.org/

It had a bit of a learning curve, but compared to looking the info up on TVDB and doing it manually - I'll never go back.

Music is stored on NAS with folders for each artist and subfolders for album. Folders are indexed by Roon and Plex. Some are copied to iTunes on NAS.

As a primary music program I'm using Roon. It's perfect except it runs only in LAN.

For the go and on mobile I'm torn between old iTunes which I hate and Plex which is good but not so nice as Roon.

This is the setup I can recomend.

Either you need to use an off-site solution like Google Play Music, or you need to set up a server (like Plex) at home and VPN in to your home network.

Setting up a personal VPN server isn't going to be easy for a layperson. You're going to wind up learning a lot in the process, like how to sign your own certificates and how DNS records work. You'll also need to get a domain and set it up with dynamic DNS, as it's very unlikely you have a static IP address at home.

GPM has a free option with ads, which is going to be the cheapest way to get this done. Next up is setting up a subdomain on a service like FreeDNS and standing up a server that hosts OpenVPN and Plex, which costs a bit in power and a lot in time. Finally, paying for GPM gives you an ad-free experience for the least amount of work.

We do more than Google play books ;). You can choose to keep your entire collection synced to the cloud or only the ones that you are interested in reading.Once synced to your devices you can access and handle your eBooks *offline*. No internet connectivity required.

Unfortunately, like most services an account would be required. Your library is private and is only seen by you.

You can take us for a spin with a throw away email or https://protonmail.com/

Thanks a lot for responding.

I use instapaper for webpages, the FOSS equivalents are wallabag and shaarli

For PDFs, Word docs, etc. I upload to an online file service and organize them into folders manually, which is serviceable but not optimal.

I highly recommend Zotero. It was designed to be used to collect and curate scientific articles, but it works just as well for websites, blogs, online magazines, newspapers, ebooks, etc. There's a web browser plugin for Firefox and Chrome/Chromium (and possibly others) that lets you save straight from the web, and a dedicated app that you can view your collection in. It saves everything locally and can optionally back everything up to the Zotero server. It's also free and open-source.

I have not tried to download Pocket articles to my computer locally, but the sorting and "reading view" options were a nice transition for me.

getpocket.com

To select files to make 1GB folder is a hard (NP) problem, so no one have implemented such program for this case.

But you can use something like DropIt to help you selecting your files so you can have meaningfull folders and then select 1GB of files.

Remember: 1GB of files in the folder will not mean 1GB zip sizes.

I sort a bucket of files by extension and then by first letter so I can select it more properly.

Hope it help you.

My man! I really should have the means to give you reddit gold or some sort of gratification for your work!!!!!!

I'm usually only in the hoarding phase of getting things, and use a little DropIt to sort in folders by extension and letter. To RPG I try to improve by keeping folders together and for GURPS I really try do to my best. I have an Excel sheet that I made from SJG Site, do some cleaning and try do have the pdf titles in the same manner (Name and SJG Code), But now i can put it in a greater context.

Thanks again.

PS. I you like we can do some /r/DHExchange/ posts to help our mutual collections.

Cheers!

You could try something like Advanced Renamer or Bulk Rename Utility. I've used Bulk Rename Utility quite a bit over the years and have heard good things about Advanced Renamer. They should both do what you want once you learn your way around the interface.

well if your using Linux GNOME I would see it isn't too hard as there is a setting in preferences of nautilus that allow you to expand folders. and if you get the zoom level right you can see the pictures. If your on windows then freecomander. Might be your fancy

If they're going to be web-accessible, I'd take the advice in https://www.w3.org/Provider/Style/URI ("Cool URLs Don't Change") and just store the file under a dated directory (YYYY/MMDD/your-filename).

Then use some type of file-tagging software to make it show up in as many places as you like. This looks interesting, but I haven't used it: https://www.tagspaces.org/

You can use Veracrypt and Upload an encrypted volume to the cloud. Downsides you'll have to upload the entire volume each time you modify the contents.

Or you can use rclone with the encrypt, benefits means you only need to upload what you change. Downsides is it's command line if that's a problem for you. But it's pretty easy to get the hang off

also I would recommend doom emacs or castlemacs (if you use mac and/or don't want vim keys). The learning curve isn't as bad as people say, and I think these configs you can learn in a day not a week. Emacs using lisp is another benefit.

Get a good 3/4 drawer filing cabinet. Of course, I've really started to change my thinking in the last 10 years that almost everything I want should be stored digitally (why keep discs), but for personal/sentimental items, and for essential paperwork you need the physical paper documents.

Get regular file folders and separators. Categorize by broad category, alphabetize, and then if you need more, by date. For the personal stuff, keeping it in photo albums and/or small archive boxes is the way to go... those can go in the filing cabinet too, if there's room.

All of these need to be scanned and kept digitally too. While you'd be upset if you lost the originals (and while officials might not like them if you needed the paperwork), the digital copies are better than nothing.

> Linking objects to computer files. For example, I have a scanned copy of the manual that came with my dishwasher. It would be neat to have an ID I could put in the file name.

Check the lower lefthand corner of that manual (front cover, or maybe every page, tends to vary). There's an internal document id that the manufacturer uses in their own document management system. Also, rarely a need to scan except for old stuff. Anything made post-year-2000 tends to be available as a digital pdf, and you only need to google that id number to find it usually.

Also, you need to check into inventory management for some of your stuff. There are software packages that do this stuff, they ask about them on r/selfhosted every few weeks. Something like https://snipeitapp.com/ maybe.

>https://dendron.so/

Dendron is cool, forgot to list it.

>But also, OP didn’t say what they wanted in a note keeping app. So it’s not like we can help cater for their needs.

True, I just found that dropping a link to a very different note taking tool and calling it Better without specifying why was a bit of a stretch.

> For a time, I could keep all of my important data in my 1TB Dropbox account, and life was good. I’ve now outgrown this, and I’m at the point of dividing my hoard into Current data which I wish to keep online in Dropbox, and Archive data which is on roomier local storage.

I've settled on Nextcloud. It's a substitute for Dropbox that runs on your own hardware. In effect, both my local and online data are the same.

> One problem with this is choosing exactly where to cut my top level folders. I’ve settled on having a flat A-Z top level

I'm a big proponent of A-Z. However, I don't like it for a top level set of directories. Instead, I'll have this in my movies or my music folder, so that I can break them up a bit. Better to have 150 movies in S, than 1600 in Films. (And for music, performers/bands go in A-Z, albums in those).

Please please please let me know if you find anything, looks like we have the same requirements and I feel so unfulfilled still after so much searching.

https://joplinapp.org comes close, but the markdown files it creates have hashed filenames so impossible to navigate outside of Joplin :(

https://danobot.github.io/notorious-landing/ recently appeared on my radar and looks pretty good, need to test it.

A local NAS might be nice, and they can run software like little computers.

https://www.amazon.com/Synology-DiskStation-DS220j-Diskless-2-bay/dp/B0855LMP81

https://play.google.com/store/apps/details?id=com.synology.moments&hl=en_US&gl=US

You should look into PhotosScan by Google Photos, it's an app made by Google that does a pretty decent job, by taking multiple photos and merging them together. It works for printed documents too.

I'm sure you could try, but the resolution may not be amazing - my DSLR takes pictures at a resolution of 6000x4000, while my phone's max is 4032x3024.

Regardless, I would definitely try to use the Open Camera app, if you're using android, so that you can lock in exposure & white balance. I've no clue if IOS has an equivalent

A similar app I use for clipping sites on my phone is Article Reader for Android. For the desktop, someone else mentioned Zotero.

The Complete New Yorker.

It has a Mac and Windows version on disc. Only way to view the 4100 issues (1925 through 2007ish). Mac versions no longer run on modern OSX. Windows versions still run. Two dlls, one is sqlite by the name, nothing interesting in it. The other is FMLoader. FM stands for "Fata Morgana" the name of the library for secure djvu files, done by a company called LizardTech. The FM header files are hidden in the Mac .pkg file, so that's a bonus. Debugging tools can't hook the software (at least on Windows, were there ever any comparable debugging tools on OSX?). Going for a primitive dll proxy approach, and now with the correct version of windows/visualstudio I can at least get it to not barf on loading my proxy dll. That's good. But it started complaining that it couldn't load a type... and I was sure that the proxy tool had faked everything in it (750ish functions, structs, etc).

Then I downloaded dotpeek and decompiled it, and there's .NET stuff in it too. And I have no idea how to proxy that. If I can get the syntax right I can just try to extend the baseclass (but have mine with the same name? can you do that?), but this is straining my level of competency. Pretty close to giving up.

The same publishing company, Bondi, also did a "complete Rolling Stones" with everything from 1967 up through 2006. And they've also done Playboy (but I can't tell if it's just the 50s issues, the 60s, or both, or even up until modern times).

Busting this open gives everyone access to quite a few works that you can't get otherwise. I think the New Yorker now gives online access to the back catalog, but only for you to look at through the glass case, can't have your own copy. And if you stop subscribing, you lose even that.

What are /r/datacurator's favorite Products & Services? From 3.5 billion Reddit comments

The most popular Products mentioned in /r/datacurator:

The Complete New Yorker: Eighty Years of the Nation's Greatest Magazine (Book & 8 DVD-ROMs)

StarTech.com Standalone Hard Drive Duplicator, Dual Bay HDD/SSD Cloner/Copier, USB 3.1 (10Gbps) to SATA III HDD/SSD Docking Station, Hard Disk Duplicator Dock, Hard Drive Cloner (SDOCK2U313R)

Synology 2 bay NAS DiskStation DS220j (Diskless), 2-bay; 512MB DDR4

Cell Phone Clip on Stand Holder - with Grip Flexible Long Arm Gooseneck Bracket Mount Clamp for iPhone X/8/7/6/6s/5 Samsung S8/S7, Used for Bed, Desktop (Black)

Scanner Bin - The Clever Document Scanning Solution

Keeping Found Things Found: The Study and Practice of Personal Information Management (Interactive Technologies)

Western Digital 4TB WD Blue PC Internal Hard Drive HDD - 5400 RPM, SATA 6 Gb/s, 256 MB Cache, 3.5" - WD40EZAZ

The most popular Services mentioned in /r/datacurator:

PhotoStructure

Everything

digiKam

TagSpaces

FreeFileSync

Czkawka

Buku

DropIt

jdupes

Advanced Renamer

Snipe-IT

Nextcloud

fman

VisiPics

Dendron

The most popular Android Apps mentioned in /r/datacurator:

Open Camera

PocketBook - PDF, EPUB reader

Termux

PhotoScan by Google Photos

Article Reader Offline

The most popular reviews in /r/datacurator:

What are /r/datacurator's favorite Products & Services?
From 3.5 billion Reddit comments