class: center, middle, inverse, title-slide # Introduction to Multilevel Analysis ## PSYC 575 ### Hok Chio (Mark) Lai ### University of Southern California ### 2020/07/30 (updated: 2021-08-29) --- background-image: url(img/hierarchical_structure.png) ??? Hi everyone, welcome to PSYC 575, multilevel modeling. In the first week, you'll be hearing from me a quick introduction about multilevel analysis, and some basic concepts. Some of the concepts will be further elaborated in future lectures, so don't worry if some of the ideas may not be very intuitive to you---we'll come back to them. So what is multilevel analysis? A multilevel analysis is a statistical analysis to analyze data that have some forms of a hierarchical structure. A classic example is shown in this picture. Imagine you're collecting student data in Los Angeles, from multiple schools, and multiple school districts. Now, it is very likely that different schools will have different student demographics, so for example, these students here may be quite different from, say, these students here from another school. Now, at the school level, we may have these two schools from one school district, and these two other schools from another school district. Again, with the different geographic locations and policies of the school districts, these two clusters of schools may again be different. Traditional regression analysis is not suitable for analyzing data with kind of hierarchical structure, because it violates the "independence observation" assumption, which we will further discuss in week 3. Multilevel modeling, on the other hand, is well suited to analyze such data. --- # Learning Objectives ### Other names used for MLM ### Types of data structure MLM handles ??? In the next few videos, I'm going to talk about some history of multilevel modeling, abbreviated as MLM, and why it leads to some confusing names in the literature. I'm also going to provide more examples on the types of data MLM can handle. --- class: inverse, center, middle # Naming and History of MLM ??? The title of this course is multilevel modeling, but you may sometimes hear people referring to this kind of analysis by a different name. --- # Alternative names for MLM: - Hierarchical linear model: Education/Sociology/Psychology <sup>1</up> - Variance component model: Statistics <sup>2</up> .footnote[ [1]: [Raudenbush & Bryk, 1986](https://doi.org/10.2307/2112482) [2]: [Aitkin et al., 1981](https://doi.org/10.2307/2981826) ] -- - Mixed model/mixed-effect model: Statistics, biomedical - Random coefficient model: Econometrics ??? The reason that there are several different names. This is not uncommon in statistics; indeed, some have made the joke that the more names is given to a concept, the more important the concept is, so this really says that multilevel modeling is a very important technique. But really, the different naming is mainly because people in different areas of research call them differently. We've seen HLM, which is commonly used in education, sociology, and psychology. Variance component model is used in statistics. Another very popular name for multilevel modeling in statistics and biomedical research is mixed model or mixed-effect model; indeed, this is how multilevel modeling is called in many software, including R, SAS, SPSS, and Stata. Finally, in economics and econometrics, you may also see the term random coefficient model. In this course we're going to use the term multilevel modeling, which I think is easier to understand and is also becoming the standard name in social and behavioral sciences. However, do realize that some other researchers may call it differently even though they are actually using the same technique. --- # History of MLM MLM naturally handles data coming from different "levels" -- Robinson (1950)<sup>1</sup> - State level: Correlation between % illiterate and % foreign born: * `\(r = -.53\)` - Individual level: Within-state correlation between illiteracy and foregin born status: * `\(r = .12\)` [1]: p. 354, https://doi.org/10.2307/2087176 ??? MLM is a technique that naturally handles data coming from different "levels". And it is important to know what the level is when interpreting results. One of an early example of analyzing data across different levels come from Robinson (1950), who used data from the 1930 Census of the United States. The figure here shows the relation between illiteracy, in terms of American English, at the state level. Each point here represents one of the 48 states, as in 1930 Hawaii and Alaska are not part of the union. As you can see, there seem to be a negative association, in that a state with more immigrants would actually have higher literacy in American English, which would seem counterintuitive. I would encourage you to pause the video to think about why this is the case. What's interesting here is that when you talk about associations between variables, they can be very different depending on what levels you're looking at. At the state level, we're talking about proportion of people who's illiterate and proportion of people who's foreign born. And the correlation of that, based on Robinson's account, was -.53. However, we may also be talking about whether an immigrant is more or less likely to be illiterate, in which case we're talking about a person-level association. In this case, we need to look at the correlation within a state, in which case Robinson tells us that the correlation is actually positive, meaning that an immigrant is actually more likely to be illiterate. You will see many more examples whether the correlations are quite different across levels of analysis. These observations are part of what motivates people to develop methods to analyze such data. --- class: inverse, middle, center # Usage of MLM --- # Hierarchical Data Structure .pull-left[ Multiple units at a lower level nested within a unit at a higher level Level 1 | Level 2 --------| ------- Clients | Therapist Classrooms | School Employees | Organization People | Family Citizens | Country Measurements | Person ] ??? As we have seen earlier, a hierarchical structure is one where multiple lower level units are nested within a higher level unit. There are plenty of examples in social and behavioral sciences, and here are some examples. Notice that by convention, we call the units at the lowest level as level 1. Here are some other examples. Note that although the lower level is usually persons, sometimes it can be things like classrooms nested within multiple schools. And the last one here is one where a person gives multiple measurement, which essentially covers all within-subject experiments, and longitudinal studies. -- .pull-right[ ## Network Graph
] ??? A useful way to represent hierarchical data structures is to use what is called a network graph. Here there are eight lower level units and three higher level units, and the lines show which lower level units are related to which upper level units, like here A is related to unit 1. I would encourage you to take a moment to convince yourself that all the examples in the table on the left can be represented by a network graph like the one here. --- class: middle, center <a title="Salvor / CC0" href="https://commons.wikimedia.org/wiki/File:Vistfr%C3%A6%C3%B0ikenning.svg"><img width="512" alt="Vistfræðikenning" src="https://upload.wikimedia.org/wikipedia/commons/thumb/f/f0/Vistfr%C3%A6%C3%B0ikenning.svg/512px-Vistfr%C3%A6%C3%B0ikenning.svg.png"></a> ??? One way to think about MLM is to consider Urie Bronfenbrenner's ecological systems theory. It is a statistical technique that is geared to study how contexts, be it families, institutions, and culture, influences individuals, and also brings in the time perspective. --- class: center, middle # Once you know that hierarchies exist, you see them everywhere. [Kreft & de Leeuw (1998, p. 1)](https://dx.doi.org/10.4135/9781849209366) .footnote[ Kreft, I. G., & de Leeuw, J. (1998). _Introducing multilevel modeling_. Sage. https://dx.doi.org/10.4135/9781849209366 ] ??? Because individuals are always situated in some contexts, the use of MLM is extremely common. To use the quote by Kreft and de Leeuw's classic textbook on MLM, "Once you know that hierarchies exist, you see them everywhere." --- class: middle, center # When it comes to regression, multilevel regression deserves to be the default approach [McElreath (2020, p. 400)](https://doi.org/10.1201/9780429029608) .footnote[ McElreath, R. (2020). _Statistical rethinking: A Bayesian course with examples with R and Stan (2nd ed.)_. Chapman & Hall/CRC. https://doi.org/10.1201/9780429029608 ] ??? Because of how common it is used, there are also scholars that propose that multilevel regression should be the default method for statistical analyses. --- # Sampling the Literature PsycINFO, 2020 May-Aug, with keywords ``` "multilevel model*" OR "mixed model*" OR "mixed-effect model*" OR "hierarchical linear model*" ``` .center[ ![](img/PsycINFO_search_2020.png) ] ??? To show the broad range of studies that used MLM, I did a quick search on PsycINFO on studies in the past three months. As you can see, it has been used in many different areas of psychological research. --- Recent literature in 2021 class: middle - Threshold dose for intravenous nicotine self-administration in young adult non-dependent smokers - A cross‐level analysis of organizational knowledge creation: How do transformational leaders interact with their subordinates' expertise and interpersonal relationships? - Benefits of movement-integrated learning activities in statistics and research methods courses - Levels and instability of daily self-esteem in adolescents: Relations to depressive and anxious symptoms - Internet-delivered cognitive processing therapy for individuals with a history of bullying victimization: A randomized controlled trial - Problems of cross-cultural criminology no more! Testing two central tenets of self-control theory across 28 nations - Emotional well-being in covid-19 mass quarantine: The role of personal response and life activity: A 14-day diary study in china ??? Here are some of the titles, some of which may be relevant to your research. Indeed, you're encouraged to check out some articles in your area of research that use MLM, and you'll be doing exactly that in Homework 1. --- # Software .pull-left[ ## General stat software - R (**`lme4`**, `brms`, `nlme`) - SAS - SPSS - Stata ] .pull-right[ ## Specialized software - HLM - MLwiN - Mplus ] ??? The last two pages listed some software and resources that are useful for doing MLM. I encourage you to check out those resources. --- # Resources - Centre for Multilevel Modeling, University of Bristol http://www.bristol.ac.uk/cmm/learning/multilevel-models/ - Textbook examples (with syntax for different software), Institute for Digital Research and Education, UCLA https://stats.idre.ucla.edu/other/examples/ - GLMM FAQ by Ben Bolker https://bbolker.github.io/mixedmodels-misc/glmmFAQ.html - Curran-Bauer Analytics, Multilevel Modeling Archives https://curranbauer.org/category/news-and-updates/multilevel-modeling/ - Multilevel Modeling Discussion List (Listserv) https://www.jiscmail.ac.uk/cgi-bin/webadmin?A0=multilevel