Odds and Ends #3
Karothek, Jupyter on K8s, and more.

- My former colleagues at Blue Yonder (now part of JDA) have introduced Kartothek, software for managing tables stored as parquet files.
- A great set of documentation on going zero to JupyterHub with Kubernetes.
- A related piece from Jim Crist on installing JupyterHub on an existing Hadoop cluster.
- An interesting paper interoducing PATE from a couple of years ago that had passed me by. PATE stands for Private Aggregation of Teacher Ensembles, and is a method for doing semi-supervised transfer learning from private data.