Thats fair, /u/GuybrushFourpwood. You want each data point visualized. Ultimately there are 3 dimensions, and we want to know how extreme each person is in FMK.
https://plot.ly/~datavistics/6/
I don't think you can do better than this (ternary plot). Let me know what you think.
You could write a python script to extract the data.
Or, if that sounds ominous to you, you could check out this nifty tool, tabula. With this you can just drag a window over each table and it'll save the table data in one csv, tsv, or json file. It'll take a little bit of manual labor but probably not much more than 20 minutes.
To actually plot that seems pretty advanced. You could try Sketchup, a free 3D drawing program. It's mostly used to sketch house layouts or architectural designs, but they have a lot of items to download. You can specify their dimensions also. So you make a box of roughly your backpack and then put items you get from the library into it.
For a more programming approach you could try Processing.
I guess if you know the dimensions of all your items (to plot them), you already have a good estimate whether their combined volume fits into your bag.
R is a programming language that is focused on statistics. It has a ton of packages that are written by the community to do various things (load csv files, load png files, save jpg files, create graphs, statistical analysis, etc) so it can be very powerful.
It can be used in two ways. 1) As a tool for exploration, by typing and running the commands one by one. Load up some data. Explore it. Create some rough graphs. See where the data leads.
2) As a programming language in a more traditional sense.
If you try it, I would recommend you download Rstudio to use as an IDE.
Great start.
My thoughts. I think stack bar should not have so many slices just like you should avoid a pie chart with many slices.
So I tried to flipped the dimensions and plotted the stack bar like this (the first chart in the link below) https://my.infocaptor.com/dash/mt.php?pa=expensive_diagnosis_561fbbc29a941
also see the second sankey, it gives a high level view and better than pie chart
You could use something like this to create a document with your texts in it. Then from there any word stat solution would be viable.
https://play.google.com/store/apps/details?id=com.smsconverter.lite
I'm really glad you like it.
I used R to load the data, create each individual image, and save each image to a .jpg file. (Credit to Robert Grant for the pictogram function).
I then converted the jpgs to a gif using Graphicsmagick
The one drawback is that the pictogram doesn't seem to play well when there are 0 people in any category.
ImageMagick has a few ways of rendering text to an image, and also does scaling (so you can make a thumbnail-style).
For example, you can make a series of images from my_file.txt
with something like:
convert text:my_file.txt my_file.png
Or do the whole lot in one go:
convert label:@my_file.txt my_file.png
You'll probably want to play with different combinations of -size
, -pointsize
, -density
, and even -font
to get the effect you want. Since your "mile-high" view isn't meant to be readable, don't be afraid of ridiculous numbers!
Starting from an HTML file you could extract the text (using one of the many html2text tools, or something like lynx -dump
) and then use text-based approach above. Alternatively you could use an HTML rendering tool (The wkhtmltopdf suite has a wkhtmltoimg tool, LibreOffice and friends have a --convert-to
option)
I don't think you need a stack chart or an overlapping line chart either
See the group chart example here group column chart kpi
It communicates both the categories and you can also see the year comparison very clearly
Here is the sample data
Here are few options
Stacked bar for percent values https://my.infocaptor.com/dash/i.php?viz=ndhkmtbj
Sankey for actual survey count
https://my.infocaptor.com/dash/mt.php?pa=harry_potter_house_vs_mbti_survey_55f2dc3cbb8a6
Have you tried any tools yet? Modern tools simply let you drag and drop and you can play with different visuals
For e.g If you visit this online tool https://my.infocaptor.com/free_data_visualization.php you can click on the "Data" tab on the top left, copy/paste your excel data and then hit "Analyze". Then in the visualizer tab, play with the columns, drag them into rows and columns buckets and there are various visualization options.
http://www.infocaptor.com/how_to_build_dashboard_using_excel.pdf
Very interesting data.. I just made this quickly with few variables. Although a complete survey with all these variables would make it awesome study sample
I've used plantuml (here's an online editorhere's an online editor) and yuml.meyuml.me.
The layout and options/colors are not always what I want so I've taken the SVG from plantuml and modified it in Inkscape before without much trouble.
No because the origin isn't special. Things could clump around (-5, +8), or whatever.
Randomizing over a very large set does have the same effect as averaging, because over a long-enough period of time every spot would be filled and ±∞ averages out to zero. But the origin is just the center point and things won't tend to clump around it.
Here, I did a Google search for something close then tweaked it: https://codepen.io/seancdavis/pen/PozMYaP then drop this into the JavaScript part: https://codeshare.io/2EvjWN
Feel free to play with the values at the top, set row/column each to 1000 but the dots will start overlapping each other. Parts will look gray because of an optical illusion.
That runs through an array and picks a random color (black or white) and doesn't pick a random place but the end result should still be the same.
Hey. This is a challenging one to make clear, since as you already said the activity depends on three different factors (exercise, diet, and type). In any case, I tried my hand at it to the best of my abilities, and reasoned as follows:
It's very obvious there is a big difference between pit bulls and german shepherds in terms of activity, so we can have a separate graph for each type.
The interesting part here is that german shepherds seem to be more active in non-morning hours, while pit bulls seem to be more active in mornings. With respect to diet, we see that pit bulls are more active with normal diets, while german shepherds show increased activity with a high fat diet.
I have attempted to create a bar chart in R, using ggplot, that attempts to convey the differences in exercise and diet between shepherds and pitbulls:
Let me know if this is alright :)
I agree @Taunk, Cole Nussbaumer Knaflic's books are awesome...
Storytelling with Data, A Data Visualization Guide for Business Professionals - Cole Nussbaumer Knaflic (Wiley 2015)
https://www.amazon.com/Storytelling-Data-Visualization-Business-Professionals-ebook/dp/B016DHQSM2