国外交流

国外交流

您当前所在位置: 首页 > 对外交流 > 国外交流 > 正文
报告时间 2022年9月6日下午14:00- 16:00 报告地点 腾讯会议:322-378-412
报告人 Ana Trisovic

“一带一路”创新人才交流项目特邀报告

报告人:Ana Trisovic,副教授,哈佛大学

邀请人:李伟

时间:2022年9月6日下午14:00- 16:00

腾讯会议:322-378-412

Speaker introduction: Ana Trisovic is a Sloan postdoctoral scholar at the Institute for quantitative Social Sciences (iqss) of Harvard University. Her research focuses on computational reproducibility, data protection and data science. Working with the dataverse team, she studied how to promote the reuse of research data and code through automation, metadata and encapsulation. Previously, Ana Trisovic was a CLIR postdoctoral fellow at the University of Chicago, where she worked with the Energy Policy Institute (EPIC) and the library. She completed her PhD in computer science at Cambridge University in 2018, and her doctoral thesis is entitled "data preservation and reproducibility of CERN lhcb experiment". During her work at CERN, she worked with lhcb, CERN open data and CERN analysis and preservation group. During her doctoral study, she was a member of Muir wood scholar at Newham college and a winner of CERN doctoral program and Google Anita Borg Memorial Scholarship.

Title 1:How to conduct a big data analysis on air pollution and health? The study design.

Abstract: The talk will present the logistics of planning, designing, and executing a big data analysis on air pollution and health. The talk will give an introduction to epidemiology and basic study design. First, we'll introduce basic terms, concepts and requirements for performing healthcare data analysis. Then, we will talk about the datasets required to undertake the study, in particular, exposure data describing air pollution; and confounders data such as population, geospatial data, weather and climate data, and others. We will talk about conducting descriptive and regression analysis and defending your decisions regarding model selection, interpretation, and presentation.

Title 2:How to conduct a big data analysis on air pollution and health? The computational execution.

The talk will present the logistics of planning and executing analysis on the analytic data set prepared from multiple data sources. It will focus on spatial and temporal data aggregation for statistical analysis on air pollution and health and automating these processes in computational workflows on the high-performance computing infrastructure. We will talk about interpreting the final model in context of your original hypothesis. In the end, we’ll present the best practices for code naming and arrangement, stepwise selection modeling, odd and prevalence ratios, and relative risk. We'll also talk about result dissemination, which is especially challenging when working with sensitive healthcare data.

上一篇:克罗地亚Kozak教授、塞尔维亚Trisovic教授、Rajic教授应邀访问西电开展教学与学术交流

下一篇:“一带一路”创新人才交流项目特邀报告----Yangyang Wang

关闭