Joint Biostatistics, CS, and S&DS
BIN YU, University of California, Berkeley
Date: Monday, April 01, 2019
Time: 4:00PM to 5:15PM
Dunham Lab. see map
10 Hillhouse Avenue, Rm. 220
New Haven, CT 06511
Website
Title: Three principles of data science: predictability,
computability, and stability (PCS)
Information and Abstract:
In this talk, I’d like to discuss the intertwining importance and connections of three principles of data science in the title and the PCS workflow that is built on the three principles. The principles will be demonstrated in the context of two collaborative projects in neuroscience and genomics for interpretable data results and testable hypothesis generation. If time allows, I will present proposed PCS inference that includes perturbation intervals and PCS hypothesis testing. The PCS inference uses prediction screening and takes into account both data and model perturbations. Finally, a PCS
documentation is proposed based on Rmarkdown, iPython, or Jupyter Notebook, with publicly available, reproducible codes and narratives to back up human choices made throughout an analysis. The PCS workflow and documentation are demonstrated in a genomics case study available on Zenodo.
3:45 p.m. Pre-talk tea Dunham Lab, Suite 222, Breakroom 228
For more details and upcoming events visit our website at
http://statistics.yale.edu/ .