Monthly Archives: April 2018

Statseminars Stat & Data Science Seminar, Speaker Carl Zimmer 4/27 @ 11am-1pm

Title: The Library of Babel: On Trying to Read My Genome

Information and Abstract:

Applied Data Science Seminar. Not long ago, information about our DNA was virtually impossible to gain. Now, thanks to the falling cost of DNA sequencing and the growing power of bioinformatics, genetic information is undergoing a Gutenberg-scale explosion of popularity. Millions of people are paying for DNA tests from companies like 23andMe and Ancestry.com, and they are getting unprecedented amounts of information about their ancestry and hereditary diseases. For my latest book, “She Has Her Mother’s Laugh,” I got my genome sequenced and enlisted scientists at Yale and elsewhere to help me interpret it. In my talk, I’ll discuss the results of that exploration–at once enlightening and baffling

For more details and upcoming events visit our website at
http://statistics.yale.edu/ .

YINS Tomorrow – 4/18 Sanjeev Arora, Toward theoretical understanding of deep learning

“Toward theoretical understanding of deep learning”

Speaker: Professor Sanjeev Arora

Princeton University & Institute for Advanced Study

Tomorrow – Wednesday, April 18, 2018, 12:00-1:00pm

Location: Yale Institute for Network Science, 17 Hillhouse Avenue, 3rd floor

Abstract: This talk will be a survey of ongoing efforts and recent results to develop better theoretical understanding of deep learning, from expressiveness to optimization to generalization theory. We will see the (limited) success that has been achieved and the open questions it leads to. (My expository articles appear at
www.offconvex.org (link is external))

Bio: Sanjeev Arora is Charles C. Fitzmorris Professor of Computer Science at Princeton University and Visiting Professor at the Institute for Advanced Study. He is an expert in theoretical computer science, especially theoretical ML. He has received the Packard Fellowship (1997), Simons Investigator Award (2012), Goedel Prize (2001 and 2010), ACM-Infosys Foundation Award in the Computing Sciences (now called the ACM prize) (2012), and the Fulkerson Prize in Discrete Math (2012).

Upcoming:

4/25/18: Adam Auton (23andme)

5/2/18: Andre Levchenko

Bass 434 Zoom meetings with iPad annotation

Hi all,

Here are the instructions for setting up the iPad in Bass 434 to interface with the main screen, and to allow annotation during a Zoom meeting:

1) On both the iPad and the Mac mini, go to yale.zoom.us (the iPad has the Zoom app installed as well)

2) Click on "Join a Meeting"

3) In the box for "Join with Personal Link Name" enter (for both devices): lori.bass.434

4) Share the Mac mini screen with the meeting participants

5) On the iPad, click on the "Annotate" tool.

6) Scribble and annotate to thine heart’s content.

Note that the login times out after 24 hours, a relevant fact for those 25 hour meetings in our future.

Best wishes,

Prashant.

farnam disk usage

total 5.67534E+11 of 600TB
gg487 72172301056
sl857 46536808960
jx98 42877998336
tg397 41892482688
fn64 37383067136
cy288 35153647488
mg888 32460720000
jz435 28937379968
dl598 22267850752
sk972 21964532864
pse5 20727790208
sl2373 15560952448
cs784 14082331264
mr724 11791888768
sl847 9973112704
wum2 9466381952
ll426 8905029760
pmm49 8371290624
jad248 7989755008
yy222 6347266176
rrk24 6182451584
yf9 5816445952
hm444 5719887232
mihali 5459016704
meg98 4376345984
yy532 4123354880
lc848 4090249984
ah633 3367398912
bp272 2906803456
xk4 2478567296
jjl86 2104665728
rdb9 1763952640
msp48 1748680320
as2665 1596345472
ky26 1583088768
jw2394 1562731648
ml724 1557992448
jl56 1480538368
ha275 1467031936
sb238 1275168128
gf3 1189340928
jrb97 1012897664
zc264 1006521728
xs252 920548352
slw67 837290496
pdm32 789250432
lh372 671649152
mx55 662563200
dc547 648720384
jsr59 592016256
xc279 554618880
as898 506352512
gunel 499962624
mpw6 385383040
hz244 374372096
km735 337744640
nb23 324053504
ls926 314810880
keckadmins 265108480
aa544 249558400
xl348 237337088
yf95 198000512
simen 163574272
xz374 162198144
lr579 159751424
nmb38 115795456
jjl83 109213440
mas343 96425216
yk336 95688832
williams 95688832
zl222 68034176
wb244 63682432
rka24 59127808
jhq4 48107776
yy448 46536704
law72 45638912
aa65 44632832
shuch 39508992
gene760 33406080
spb63 26176512
zhao 25241600
ajf73 22082688
amg89 21919360
co254 21889920
an377 19965312
xm24 19335680
jc2296 17970560
jw72 17455616
njc2 16694016
root 9156608
jk935 6167936
cc59 4636672
yz464 1122176
gene760_2016 475520
bab99 387584
dw396 383872
tl444 326144
dr395 185472
mj332 60160
rm658 4096
jjp76 3968

Farnam disk usage

gerstein 5.59995E+11 of 600TB
gg487 72097902080
sl857 46536808960
jx98 42157560192
fn64 37383062784
tg397 36354959232
cy288 35153367680
mg888 32460140160
jz435 28900470144
sk972 21964532864
pse5 20415937280
dl598 19097782272
sl2373 15545731968
cs784 14082331264
mr724 11791422592
sl847 9972764800
wum2 9457952128
ll426 8905029760
pmm49 8371290624
jad248 7989755008
yy532 7351755648
yy222 6347266176
rrk24 6182451584
yf9 5816445952
hm444 5719887232
mihali 5459016704
meg98 4235523456
lc848 4090249984
ah633 3367398912
bp272 2906803456
xk4 2467884544
jjl86 2087435904
rdb9 1763952640
msp48 1748680320
as2665 1596345472
ky26 1583088768
jw2394 1562731648
ml724 1557992448
jl56 1480538368
ha275 1467031936
sb238 1275168128
gf3 1189340928
jrb97 1012897664
slw67 826952192
zc264 802075392
pdm32 789250432
lh372 671649152
dc547 648720384
mx55 635310720
jsr59 592016256
xc279 554618880
as898 506352512
xs252 499962624
gunel 499962624
mpw6 385383040
hz244 374372096
km735 337744640
nb23 324053504
ls926 314810880
keckadmins 265108480
aa544 249558400
xl348 237337088
yf95 197553536
simen 163574272
xz374 162198144
lr579 159751424
nmb38 115795456
jjl83 109213440
mas343 96425216
yk336 95688832
williams 95688832
zl222 68034176
wb244 63682432
rka24 59127808
yy448 46536704
aa65 44632832
law72 43699072
shuch 39508992
gene760 33406080
zhao 25241600
ajf73 22082688
amg89 21919360
co254 21889920
an377 19965312
xm24 19335680
jc2296 17970560
jw72 17455616
jhq4 17082240
njc2 16694016
root 9156608
jk935 6167936
cc59 4636672
yz464 1122176
gene760_2016 475520
bab99 387584
dw396 383872
tl444 326144
dr395 185472
mj332 60160
rm658 4096
jjp76 3968

Need to make a molecule? Ask this AI for instructions

Need to make a molecule? Ask this AI for instructions
http://www.nature.com/articles/d41586-018-03977-w #DeepLearning to do better #retrosynthesis. Perhaps other things in chemistry could be learned as well!

QT:{{”
“The tool, described in Nature on 28 March1, is not the first software to wield artificial intelligence (AI) instead of human skill and intuition. Yet chemists hail the development as a milestone, saying that it could speed up the process of drug discovery and make organic chemistry more efficient.

“What we have seen here is that this kind of artificial intelligence can capture this expert knowledge,” says Pablo Carbonell, who designs synthesis-predicting tools at the University of Manchester, UK, and was not involved in the work. He describes the effort as “a landmark paper”.”
“}}

Math-applied APPLIED MATH PROGRAM: Seminar & Refreshments Thursday, April 10, 2018

APPLIED MATH/ANALYSIS SEMINAR

Speaker Mauro Maggioni, John Hopkins University

Date: Tuesday, April 10, 2018

Time: 3:45p.m. Refreshments (AKW, 1st Floor Break Area)

4:00p.m. Seminar (LOM 206)

Title: “Learning and Geometry for Stochastic Dynamical Systems in high dimensions”

Abstract:

We discuss geometry-based statistical learning techniques for performing model reduction and modeling of certain classes of stochastic high-dimensional dynamical systems. We consider two complementary settings. In the first one, we are given long
trajectories of a system, e.g. from molecular dynamics, and we estimate, in a robust fashion, an effective number of degrees of freedom of the system, which may vary in the state space of then system, and a local scale where the dynamics is well-approximated by a reduced dynamics with a small number of degrees of freedom. We then use these ideas to produce an approximation to the generator of the system and obtain, via eigenfunctions of an empirical Fokker-Planck equation (constructed from data), reaction coordinates for the system that capture the large time behavior of the dynamics. We present various examples from molecular dynamics illustrating these ideas.

In the second setting we only have access to a (large number of expensive) simulators that can return short paths of the stochastic system, and introduce a statistical learning framework for estimating local approximations to the system, that can be (automatically) pieced together to form a fast global reduced model for the system, called ATLAS. ATLAS is guaranteed to be accurate (in the sense of producing stochastic paths whose distribution is close to that of paths generated by the original system) not only at small time scales, but also at large time scales, under suitable assumptions on the dynamics. We discuss applications to homogenization of rough diffusions in low and high dimensions, as well as relatively simple systems with separations of time scales, and deterministic chaotic systems in high-dimensions, that are well-approximated by stochastic
diffusion-like equations.
Mauro Maggioni 4-10 flyer.pdf