86M 4.2% Executive Summary. Valuation Whitepaper. The purposes of this paper are threefold: At a Glance. Median absolute prediction error (MdAPE)

Similar documents
*Predicted median absolute deviation of a CASA value estimate from the sale price

AVM Validation. Evaluating AVM performance

Glossary of Terms & Definitions

Technical Description of the Freddie Mac House Price Index

Assessment-To-Sales Ratio Study for Division III Equalization Funding: 1999 Project Summary. State of Delaware Office of the Budget

Demonstration Properties for the TAUREAN Residential Valuation System

The TAUREAN Residential Valuation System An Overview

METHODOLOGY GUIDE VALUING LANDS IN TRANSITION IN ONTARIO. Valuation Date: January 1, 2016

Trends in Affordable Home Ownership in Calgary

What s Next for Commercial Real Estate Leveraging Technology and Local Analytics to Grow Your Commercial Real Estate Business

EVGN 11. The Valuer s Use of Statistical Tools

86 years in the making Caspar G Haas 1922 Sales Prices as a Basis for Estimating Farmland Value

METROPOLITAN COUNCIL S FORECASTS METHODOLOGY JUNE 14, 2017

REDSTONE. Regression Fundamentals.

Past & Present Adjustments & Parcel Count Section... 13

Introduction. Bruce Munneke, S.A.M.A. Washington County Assessor. 3 P a g e

MONTHLY HOUSE PRICE INDEX REPORT

PROPERTY TAX IS A PRINCIPAL REVENUE SOURCE

IREDELL COUNTY 2015 APPRAISAL MANUAL

Summary Statistics. Closed Sales. Paid in Cash. Median Sale Price. Average Sale Price. Dollar Volume. Median Time to Sale.

7224 Nall Ave Prairie Village, KS 66208

American Community Survey 5-Year Estimates

Housing market and finance

SELECTED HOUSING CHARACTERISTICS American Community Survey 5-Year Estimates

next generation automated collateral risk solution for those who demand true valuation

An Introduction to RPX INTRODUCTION

METROPOLITAN COUNCIL S FORECASTS METHODOLOGY

The Accuracy of Automated Valuation Models

American Community Survey 5-Year Estimates

Hennepin County Economic Analysis Executive Summary

ANOVA Method (Gage Studies Variables)

Following is an example of an income and expense benchmark worksheet:

NCREIF Research Corner

Washington Department of Revenue Property Tax Division. Valid Sales Study Kitsap County 2015 Sales for 2016 Ratio Year.

PROPERTY DEVELOPMENT REPORT

Summary Statistics. Closed Sales. Paid in Cash. Median Sale Price. Average Sale Price. Dollar Volume. Median Time to Sale.

Summary Statistics. Closed Sales. Paid in Cash. Median Sale Price. Average Sale Price. Dollar Volume. Median Time to Sale.

Summary Statistics. Closed Sales. Paid in Cash. Median Sale Price. Average Sale Price. Dollar Volume. Median Time to Sale.

Summary Statistics. Closed Sales. Paid in Cash. Median Sale Price. Average Sale Price. Dollar Volume. Median Time to Sale.

Summary Statistics. Closed Sales. Paid in Cash. Median Sale Price. Average Sale Price. Dollar Volume. Median Time to Sale.

Sales Ratio: Alternative Calculation Methods

American Community Survey 5-Year Estimates

Summary Statistics. Closed Sales. Paid in Cash. Median Sale Price. Average Sale Price. Dollar Volume. Median Time to Sale.

Summary Statistics. Closed Sales. Paid in Cash. Median Sale Price. Average Sale Price. Dollar Volume. Median Time to Sale.

Summary Statistics. Closed Sales. Paid in Cash. Median Sale Price. Average Sale Price. Dollar Volume. Median Time to Sale.

Summary Statistics. Closed Sales. Paid in Cash. Median Sale Price. Average Sale Price. Dollar Volume. Median Time to Sale.

Summary Statistics. Closed Sales. Paid in Cash. Median Sale Price. Average Sale Price. Dollar Volume. Median Time to Sale.

Cook County Assessor s Office: 2019 North Triad Assessment. Norwood Park Residential Assessment Narrative March 11, 2019

Estimating National Levels of Home Improvement and Repair Spending by Rental Property Owners

EXPLANATION OF MARKET MODELING IN THE CURRENT KANSAS CAMA SYSTEM

Washington Market Highlights: Fourth Quarter 2017

CABARRUS COUNTY 2016 APPRAISAL MANUAL

Washington Market Highlights: Third Quarter 2018

Summary Statistics. Closed Sales. Paid in Cash. Median Sale Price. Average Sale Price. Dollar Volume. Median Time to Sale.

Using Hedonics to Create Land and Structure Price Indexes for the Ottawa Condominium Market

Value of Building Work Put in Place: March 2013 quarter

Cook County Assessor s Office: 2019 North Triad Assessment. Evanston Residential Assessment Narrative Updated: April 8 th, 2019

STATE OF OHIO FINANCIAL REPORTING APPROACH GASB 34 IMPLEMENTATION ISSUES TRANSPORTATION INFRASTRUCTURE

Summary Statistics. Closed Sales. Paid in Cash. Median Sale Price. Average Sale Price. Dollar Volume. Median Time to Sale.

Summary Statistics. Closed Sales. Paid in Cash. Median Sale Price. Average Sale Price. Dollar Volume. Median Time to Sale.

How to Read a Real Estate Appraisal Report

14 N O V E M B E R

Regression + For Real Estate Professionals with Market Conditions Module

Summary Statistics. Closed Sales. Paid in Cash. Median Sale Price. Average Sale Price. Dollar Volume. Median Time to Sale.

Bargara Property Factsheet

Washington Market Highlights: Fourth Quarter 2018

Definitions ad valorem tax Adaptive Estimation Procedure (AEP) - additive model - adjustments - algorithm - amenities appraisal appraisal schedules

Description of IHS Hedonic Data Set and Model Developed for PUMA Area Price Index

METHODOLOGY GUIDE VALUING MOTELS IN ONTARIO. Valuation Date: January 1, 2016

MONTHLY HOUSE PRICE INDEX REPORT

An Assessment of Current House Price Developments in Germany 1

Summary Statistics. Closed Sales. Paid in Cash. Median Sale Price. Average Sale Price. Dollar Volume. Median Time to Sale.

An Assessment of Recent Increases of House Prices in Austria through the Lens of Fundamentals

Summary Statistics. Closed Sales. Paid in Cash. Median Sale Price. Average Sale Price. Dollar Volume. Median Time to Sale.

Summary Statistics. Closed Sales. Paid in Cash. Median Sale Price. Average Sale Price. Dollar Volume. Median Time to Sale.

ASSESSORS ANSWER FREQUENTLY ASKED QUESTIONS ABOUT REAL PROPERTY Assessors Office, 37 Main Street

Chapter 35. The Appraiser's Sales Comparison Approach INTRODUCTION

Summary Statistics. Closed Sales. Paid in Cash. Median Sale Price. Average Sale Price. Dollar Volume. Median Time to Sale.

National Policy Statement on Urban Development Capacity Price efficiency indicators technical report: Price-cost ratios

Illinois Farmland Sales Database

Summary Statistics. Closed Sales. Paid in Cash. Median Sale Price. Average Sale Price. Dollar Volume. Median Time to Sale.

Summary Statistics. Closed Sales. Paid in Cash. Median Sale Price. Average Sale Price. Dollar Volume. Median Time to Sale.

Summary Statistics. Paid in Cash. Median Sale Price. Average Sale Price. Dollar Volume. Median Percent of Original List Price Received

A View Like Never Before

DIRECTIVE # This Directive Supersedes Directive # and #92-003

Initial sales ratio to determine the current overall level of value. Number of sales vacant and improved, by neighborhood.

Summary Statistics. Closed Sales. Paid in Cash. Median Sale Price. Average Sale Price. Dollar Volume. Median Time to Sale.

Creswick Property Factsheet

If you want even more information, look for the advanced training, which includes more use cases and demonstrates CU s full functionality.

2018 Profile of Home Buyers and Sellers

Cook County Assessor s Office: 2019 North Triad Assessment. Elk Grove Residential Assessment Narrative April 16th, 2019

Assessment Year 2016 Assessment Valuations / Mass Appraisal Summary Report

High-priced homes have a unique place in the

Course Residential Modeling Concepts

STEVEN J. DREW Assessor OFFICE OF THE ASSESSOR Service, Integrity, Fairness, Internationally Recognized for Excellence

Quarterly Market Detail - Q Single Family Homes Miami-Fort Lauderdale-West Palm Beach MSA

Quarterly Market Detail - Q Townhouses and Condos Miami-Fort Lauderdale-West Palm Beach MSA

Volume Title: Well Worth Saving: How the New Deal Safeguarded Home Ownership

Estimating Poverty Thresholds in San Francisco: An SPM- Style Approach

Alexandria Property Factsheet

Transcription:

Executive Summary HouseCanary is developing the most accurate, most comprehensive valuations for residential real estate. Accurate valuations are the result of combining the best data with the best models. This whitepaper provides transparency to our performance and methodology used to achieve the most accurate nationwide published real estate valuations. The HouseCanary valuation algorithm is designed to use all available market data to estimate the most likely value that a property would currently transact for at arm s-length. HouseCanary strives to put a value on every single family home, condo, and townhouse in the United States. The value range should be as narrow as possible while still providing an approximate 68% coverage probability (one standard deviation) on actual arm s-length sale prices. The purposes of this paper are threefold: (i) (ii) (iii) Identify the data included in the valuation algorithm Provide context for the logic behind the valuation algorithm Define key model outputs At a very high level, the valuation algorithm is composed of the following three steps: Query and clean relevant data from HouseCanary s database, build localized price indices to adjust all past prices to current, and train valuation models from time-adjusted historical prices. At a Glance 86M U.S. residential real estate properties 4.2% Median absolute prediction error (MdAPE) All 50 states Models for 3 property types Updated monthly Machine learning-based Third party-validated Performance reported online Copyright 2016 HouseCanary Inc. All rights reserved. Page 1 of 12

Table of Contents Executive Summary... 1 Modeling Principles... 3 Data... 3 Ensuring Data Quality... 4 Methodology... 5 Forecast Standard Deviation... 9 Testing and Validation... 10 Future Capabilities... 11 About the HouseCanary Research Team... 12 Contact... 12 Copyright 2016 HouseCanary Inc. All rights reserved. Page 2 of 12

Modeling Principles The key to accurate valuations = comprehensive data + machine learning Traditional valuation models only consider recently transacted nearby comparable properties in their determination of value. While recently transacted comparable properties are an important piece of the valuation puzzle, we can achieve better valuations by also considering all previous arm s-length sales for the subject property and neighborhood, as well as information from multiple other sources: Macroeconomic data, capital markets data, mortgage records, search and social data, and house/parcel data. The primary assumptions behind HouseCanary s model are the following: 1. Prices in a given neighborhood tend to move together through time. 2. It is statistically more efficient to use all known historical price events instead of only those occurring within the past few years. 3. After accounting for trends in price and changes in the condition and structure of the underlying asset, the best predictor of the value of a home is most often past price occurrences of that home. 4. Machine learning-based algorithms are better suited than classical models at recognizing and exploiting higher-order complex relationships among input variables and their relationship to the response variable. 5. Human effort should focus on enhancing existing datasets to generate better data that can be fed into the algorithm, further improving valuation accuracy. Data Nationwide County Assessor + County Recorder + MLS + Other Property Level Data Property level data entering the valuation algorithm are composed of public record data, Multiple Listing Service (MLS) data and other property level information. Public record data that includes property characteristics sourced from 3,100+ county assessor offices and historical recorded sales prices sourced from 2,700+ county recorder offices over the previous 20 years, where available. MLS data utilizes property characteristics, listed prices, and contract prices. Other property information includes data on mortgage balances, mortgage type, measures of financial distress, along with other details that impact value, such as proximity to a busy street, golf course, or view. For the purpose of non-disclosure states, contract prices from the MLS are used as a substitute for recorded sales prices if we can jointly verify that an arm s-length sale was actually recorded from the county recorder s office. Data from both MLS and public record are refreshed daily. The only potential delay in our dataset would be due to delays from the original source. As an example, if a particular county takes three months to add recorded transactions to its electronic data file, we will not have access to those transactions for three months after they were recorded. Market level summary data typically gets compiled and added into the database monthly or quarterly. Copyright 2016 HouseCanary Inc. All rights reserved. Page 3 of 12

Ensuring Data Quality HouseCanary achieves trusted data quality levels with systems and processes developed and supported by a team of data engineers, data analysts, and domain experts. Our system is designed for complete end-to-end visibility of data flows with layers of dynamic, intelligent controls. Our process starts with full profiling of any data source that will feed into our platform. For the profiling, we use multiple full data instances over time to get a complete picture of content and expected changes. We create fieldby-field, value-by-value rules that determine how the data maps into our content management system. Those rules, combined with intelligent monitoring, provide the first layer of control that flags suspicious data changes, anomalous and unexpected values, and inconsistent data use. Flagged content gets quarantined until approved as valid, the issue gets remedied by the source, or new handling logic gets implemented. Once validated data flows into the content managements system, it gets linked and normalized at the address, building, and census levels. With the content linked, a second quality control pass uses multi-source comparison to arrive at a consensus view of the correct and usable data for a given object. Only this data gets fed into products and models. We use a United States Postal Service Coding Accuracy Support System (CASS) certified service to validate, standardize and match all addresses that feed into our system. This includes all addresses from data feeds and user inputs. All matching and standardization gets handled by this service. To protect against degradation related to updates, any change of any component of a full address triggers a new validation, standardization, and matching event for the subject address and all previously matched and related addresses. Copyright 2016 HouseCanary Inc. All rights reserved. Page 4 of 12

Methodology Three steps i) query and clean data, ii) build price indices, iii) train models Step 1: Gather, combine, and filter data Prior to modeling, the first step involves gathering, combining, and filtering all data we have access to. This step includes, but is not limited to, identifying valid historical arm s-length sale prices and market clearing list prices, and determining valid property characteristics when data differs across sources. The filtered historical sales and list prices form the basis for our response variable and price indices. Each model is built and run at the census tract level. When a single census tract does not have enough data, data from neighboring tracts is brought in to supplement the target tract. If neighbors plus target still does not yield enough observations, we continue to pull in data from neighbors of neighbors until the required sample size threshold has been met. Neighboring census tract information is only used to train models, which are then used to value properties within the target tract. Separate models will be built for the neighboring tracts, each individually treated as a subject tract. Step 2: Build localized price indices to adjust all past prices to current values The second step is to create neighborhood home price indices by property type. To illustrate, the time scatter plot in Exhibit 1 shows all transactions within a census tract over the last 20 years. The vertical axis represents price, in dollars. Census tracts are further subdivided by the census into a series of blocks. Both tracts and blocks each form a non-overlapping partition of the United States. Exhibit 1: All transactions within a census tract over the last 20 years Copyright 2016 HouseCanary Inc. All rights reserved. Page 5 of 12

Valuation Whitepaper As an example, Exhibit 2 plots all identified price instances by property type for four individual census blocks within the target census tract. Census block is the smallest quantifiable group of properties in the US - there are approximately 10 million US census blocks. Property types are denoted by the color of the points. Single family home prices are shown as yellow dots, and condos are shown as green dots. The purple line represents the model estimated single family home price index for that particular block, and the red line represents the model estimated condo price index for that block. Machine learning methods are used to create smooth block-level price indices. These methods allow us to borrow information from surrounding blocks to estimate block indices, even in blocks with very small sample sizes. As an example, Block 4 only contains condos, and has a very limited number of historical price instances. However, the model is able to estimate an index over the entire 20-year time period using the behavior of condo prices from other similarly priced condos in the census tract. These indices are used to bring the price of all valid historical sale and list prices to current values. By controlling for both time and location prior to fitting the valuation models, we obtain a much larger model dataset than if we limited ourselves to only using closed sales prices over the previous one or two years. Exhibit 2: All price instances by property type for four individual census blocks Single family home prices Condo prices Estimated single family home price index for respective block Copyright 2016 HouseCanary Inc. All rights reserved. Estimated condo price index for respective block Page 6 of 12

To show the effect of time adjustment at the block level, Exhibit 3 shows the empirical distribution of the resulting time-adjusted current prices in Block 3. There are clearly two distinct price populations in this block. Condos with a median value of $210k (red distribution adjusted from red price index in Exhibit 2, Block 3), and single family homes with a median value of $445k (purple distribution adjusted from the purple price index in Exhibit 2, Block 3). Exhibit 3: Effect of time adjustment at the block level Exhibit 4: Time-adjusted price deltas from block median Once the time-adjusted prices are calculated, the final step in generating the dependent variable is to use a transformation to center the prices around their respective current block median price for each property type. Centering is done as a percentage deviation from the current block median estimated price for each property type. The distribution of the price deltas across the entire census tract is shown in Exhibit 4. The red vertical lines represent the empirical 16 th and 84 th percentiles, i.e., the bounds in which approximately 68% of the data resides (one standard deviation). In the example census tract, these values correspond to approximately +/-0.14. In other words, if we only used the estimated current median block price, the property type, and the date of occurrence to estimate all known historical prices in this census tract, we would be within 14% of actual historical price approximately 68% of the time. The quantity 14% is referred to as model forecast standard deviation (fsd), explained in-depth in the Forecast Standard Deviation section. The deltas in Exhibit 4 represent the remaining unexplained variance among all historical prices in the census tract after we have accounted for the median block price, property type, and the date in which the price occurred. The final step seeks to further explain this unexplained variance in terms of many other observable data inputs. Copyright 2016 HouseCanary Inc. All rights reserved. Page 7 of 12

Step 3: Train valuation models from time-adjusted historical prices The third and final step involves fitting machine learning-based models to explain the price deltas shown above. In this step we account for many more variables beyond property type, block median price, and time. Depending on the data available, these models can include property characteristics, neighborhood characteristics, macro and micro economic data, spatial relationships, repeat price observations and much more. Machine learning-based methods allow us to account for both high order interactions and nonlinear relationships with the response variable. We currently run four individual models, each designed to accommodate varying amounts of data available by property. Machine Learning Primer: Machine learning applications for real estate valuation Real estate valuation involves predicting an output from one or more inputs, also known as a supervised learning problem. The field of machine learning encompasses a set of algorithms designed specifically to tackle this type of prediction problem. Machine learning algorithms arrive at a solution by learning patterns from large amounts of data in order to make predictions. They seek to minimize the error from predictions by using information from many different inputs. The computer is presented with inputs and outputs. The goal is to learn a general rule, specified by the machine learning algorithm, which maps the inputs to the outputs very accurately for the entire population of homes. The algorithmic process is iterative and often performs many iterations of error minimization in order to produce a robust and highly accurate prediction. The patterns that exist in the data are not subject to the same parametric model restrictions as classic statistical models. The learning in the algorithm uncovers interactions among many different variables that cannot be functionally defined in traditional statistical models. The algorithms are naturally suited to capture both high-order relationships with the output variable and interactions among the inputs. Lastly, it is easily possible to consider many more inputs than actual observations. This means that a model may potentially contain thousands of inputs in order to achieve the most accurate prediction possible. So what does all this mean for real estate valuation? If there truly exist predictive signals within the input dataset, these methods will find them with enough iterations. It is worth pointing out that the improved predictive performance of these methods comes at the expense of the additional computational time required to train them. Copyright 2016 HouseCanary Inc. All rights reserved. Page 8 of 12

From these models, we generate an estimated price delta for every property in the census tract. Finally, the delta estimates are back-transformed to yield a historical price estimate. This is done for all four models within the algorithm. Coming up with a single value estimate involves weighting how well each of the four models was able to estimate the observed historical prices. Those models with better predictions receive higher weights. Exhibit 5 shows how the final weighted historical price estimates compare to actual observed historical prices in this census tract. Exhibit 5: Model fit versus all observed historical prices Forecast Standard Deviation Model Derived Measure of Uncertainty The HouseCanary forecast standard deviation is a measure of model uncertainty. It is a quantity used to create the upper and lower bounds on the value estimate. The value range represents the quantity such that the range will actually capture the realized market value in an arm s-length sale approximately 68% of the time (one standard deviation). As an example, if the property fsd is 0.07, then the upper bound on value is given by P*(1+0.07) and the lower bound on value is given by P*(1-0.07), where P is the estimated price for that property. If an arm s-length sale were to occur shortly after the estimate was generated, there is a 68% probability that the estimated range will cover the actual realized price. It is worth noting that the empirical error distribution is not necessarily normally distributed. Therefore, users cannot obtain 95% coverage intervals by simply multiplying the fsd by 2, and updating the formula above. HouseCanary will make intervals of differing coverage probabilities available in a future release. Another closely related quantity to the fsd is the confidence score. The confidence score is simply 1-fsd. If fsd equals 0.07 then the confidence score is 0.93. It is common to see both these quantities scaled up by a factor of 100, i.e., 7 and 93. Copyright 2016 HouseCanary Inc. All rights reserved. Page 9 of 12

The fsd in our models is largely derived from the tract level empirical error distribution. Recall that the empirical error distribution is the collection of percent deviations obtained by comparing estimated historical prices to actual historical prices. In the previous plot, actual historical prices appear on the vertical axis and estimated historical prices appear on the horizontal axis. The plot in Exhibit 6 shows the empirical error distribution for these same observations. Exhibit 6: Empirical error distribution The vertical red lines indicate the 16 th and 84 th error percentiles, i.e., 68% of all the deviations fall within these red lines. In this case, these red lines are approximately +/-0.045, and hence the average fsd for this census tract is approximately 0.045. Individual properties may deviate somewhat from 0.045 depending on individual characteristics, but 0.045 is a good approximation for the fsd for properties within this census tract. Testing and Validation Continuous Internal Testing + Quarterly Third-Party Testing HouseCanary runs continuous testing on our valuation accuracy and coverage. Test results are updated weekly and available at http://www.housecanary.com/our-method-valuation. All tests are composed of a six-month testing window. As an example, tests published on June 1 st, 2016, would be composed of a test sample over the period December 1 st, 2015 to May 31 st, 2016. The reason a six-month moving test window is used is due to data delay in certain states. It is not uncommon for certain states to have a three to four month delay in reporting sold prices via the county recorder and/or MLS. Test accuracy attempts to measure how close a value estimate produced before an actual arm s-length sale was to the subsequent sale price. As an example, lets say HouseCanary produced a set of nationwide property estimates using all known data through March 31 st 2016. If the next scheduled model run takes place on April 30 th, 2016, then all sales occuring for the month of April would be compared to the most recently available value estimates prior to their sale date, the estimated values as of March 31 st, 2016. HouseCanary s ongoing internal tests track all of the performance measures below. Copyright 2016 HouseCanary Inc. All rights reserved. Page 10 of 12

1. hit_rate The proportion of sold properties in which we had an estimate of value prior to the sale. 2. Median_Abs_Pct_Err The 50 th percentile of absolute error in percentage terms. In other words, if this value equals 6.0%, then half our estimates were within +/-6.0% of actual sales price, and half were outside +/-6.0% of actual sales price. 3. Median_Pct_Err The 50 th percentile of actual error in percentage terms (not absolute error). Values close to zero imply that the estimator is unbiased. 4. Within X% The percent of estimates that fell within +/-X% of actual sales price. HouseCanary produces this value for the 5%, 10%, and 20% bounds. 5. Within_HC_Prediction_Interval The percent of actual sales prices that fell within HouseCanary s upper and lower estimates of value. The coverage probability of HouseCanary s upper and lower bounds is 68%. Therefore, this value should fall somewhere close to 68%, and values near 68% indicate that our model is accurately measuring the unexplained variance in price. As error rates continue to decrease, the width of the intervals will get smaller while still maintaining the target 68% coverage probability. As of July 6 th, 2016, HouseCanary s continuous internal testing over the previous six months yielded a national MdAPE of 4.6% on 1,001,846 transactions. In addition to internal testing, HouseCanary undergoes quarterly thirdparty testing. On a blind sample, the most recently completed third-party test measured over the second quarter of 2016 yielded a national MdAPE of 4.2%, compared to actual contract price. Detailed test results, including the metrics above, are available by request at the national, state, and MSA levels. At the national level, HouseCanary s internal results are further available by property type and by the month in which the closed sale price occurred. Future Capabilities Real-Time Valuation and Updates to Existing Valuations The current algorithm produces static nationwide estimates of value monthly. These are stored and used as our best estimates of value over the next month. Values are replaced once the algorithm runs again the following month and produces a new set of value estimates. There are a two areas for improvement with this approach. First and foremost, new data received does not get considered until the next run. As an example, this could include additional sales transactions within the neighborhood, or updated property characteristics for one or more properties within the neighborhood. Second, we can only value properties that we have record of in our database. As an example, new homes may take 6+ months to be reported to us by the county assessor. In the current setup, we cannot value these until we learn of the property from the assessor 6+ months from now. Future development will enable real-time valuations once a minumum set of information is entered about an unknown property. Minimally required input from the user includes an address that we can validate and the property type. Optional customer input will include corrected and/or previously unknown property characteristics. With the customer input in hand, a backend service will send the new information to a set of stored model objects to generate an updated valuation. Copyright 2016 HouseCanary Inc. All rights reserved. Page 11 of 12

About the HouseCanary Research Team HouseCanary s research team is composed of PhD statisticians, economists and mathematicians. Full bios of our research team are available at http://www.housecanary.com/about. Contact Please contact us with any questions or comments at sales@housecanary.com or 855.218.9597. Copyright 2016 HouseCanary Inc. All rights reserved. Page 12 of 12