Skip to content

Latest commit

 

History

History
73 lines (66 loc) · 4.26 KB

File metadata and controls

73 lines (66 loc) · 4.26 KB

TODO

(split off from main readme to reduce clutter)

Future Plans

  • fully integrate with the functional competency map, and replace this syllabus with that when it is ready
  • split up the syllabus and the curriculum for this package (this makes it easier to look for new and better materials, and to make updates to the curriculum, while at the same time making sure the scope of the training package doesn't drift too much as stuff is added or removed)
    • the syllabus notes down things that need to be learned and outcomes that should be achieved
    • the curriculum collates and orders the material needed to cover the syllabus
  • cover deep learning in the ML module in a bot more depth
  • update ASR
  • notes on timeseries data
    • fb prophet
    • changepoint detection
    • model / data drift, and how to detect it
    • how to split training and test sets (and how NOT to)
      • split by time (or in some cases, some other time-dependent variable)
      • splitting randomly leaks labels into the training set
    • remember that near-100% results are suspicious
  • evaluating your results "by eye"
    • is the accuracy weird
      • is it too good? (98-100%, but 95+ is where you'd get suspicious)-> leaking labels, easy problems
      • is it too bad? -> low quality labels and test set?
    • rules of thumb
      • would the opposite finding be surprising?
      • if you were told that some other model predicted the opposite (or a different) result, would it be believable?
      • if you were instructed to come up with the opposite result (given your current data), would you be able to do so easily?

Excessively long list of resources we can consider for inclusion

data viz gallery