I kept things simple in the beginning to focus on the problem, but here are some more details @mfherman.
I actually work on a csv file tracking my expenses, each row having six columns:
- date
- category (I don't use it, it is always
Other)
- amount
- currency (always
€)
- note (I use that as a category, so notes are from a limited, but possibly growing set)
- tags (variable number of tags from a possibly growing set)
Here is a example of what couldbe in such a csv:
"28/7/2018","Other","73,03","€","Ben Shop","Supermarket,Cash"
"5/8/2018","Other","15,08","€","Jon Shop","Market,Food"
"8/8/2018","Other","1,15","€","Baker","Cash,Food"
"25/8/2018","Other","35,69","€","Petrol","Car,CreditCard"
I want to analyse and chart data, for example with:
- daily expenses in a bar chart + moving average lines
- barchart of expenses total amount per tag, with lines or dots for median value, average value and count of expenses
- barchart of expenses total amount per note, ....
- possibly same barcharts, but with expenses count rather than total amount
- histogram for expenses (bucketed in 10€ width buckets for example)
- tag cloud for tags and note based on count and total amount
Suggestions are very welcome, as I'm only starting with R!