I want to get a function that predicts wheat yields on the basis of climate and soil. I have extracted (with Malcolm help at the GIS end) some basic climate data from an old map of Britain. I did a regression of wheat yields against elevation, amounts of rainfall and length of the growing season. The results are below. This is what the results mean: ELEV is negative, meaning the higher up the farm, the less the yield. Makes sense. GS is growing season....strangely the longer the growing season, the lower the yield. The two 'RAINFALL' variables show the effect of rainfall of 1000mm a year and 1250mm a year. The coefficient for the 1250 is greater (at -4.42) than the one for 1000 (at -1.34). Meaning is that the greater the rainfall, a large reduction in yield. See the adjusted R-squared on the right? That gives us the percentage of the variation in wheat yield explained by the model. Here it is 0.3094 or a tad over 30%. This isn't very good, but better than I had expected. Now I need to add soil data and more accurate climate data, such as hours of sunshine.

## Monday, January 31, 2011

### Market distance and wheat yields

I am still having problems sleeping because I can't understand why we have some positive signs for distance to market. The theory is that the sign should be negative, meaning that the further the farm is from the market, the lower the rent. Makes sense, doesn't it? So why am I getting a positive sign for some regions in the the south-west of England. Could be related to relatively high yields. Below is a map showing in the top panel our 715 parishes with the wheat yields. In the bottom panel is a map showing the magnitude of the coefficient of the variable 'market distance'. The areas I have circled in the top panel show high yields, in the bottom panel a positive sign. The two areas seem to correspond, don't they?

So it could be that the gains to the farmer of high yields more than make up for distance to market. I'll work on the math to get a function for this and then test it.

So it could be that the gains to the farmer of high yields more than make up for distance to market. I'll work on the math to get a function for this and then test it.

## Monday, January 24, 2011

### Climatic data found

Malcolm has scored what look's like a bulls-eye. I asked him to locate historical data that we could use to test my hypothesis about weather causing a positive sign in the distance to market regression. Today he found some weather maps which look just the thing---I am particularly interested in July rainfall and August/September sunshine. Wheat does best with good rain in July and then a hot dry couple of months. Somewhere also in his find is some historical time-series data. I need that to calculate the variance. The weather maps give an average which is really useful, but it would be nice to know the variance. The two sequences {1,2,3,4,5} and {3,3,3,3,3} have the same mean but very different variances. If you were a farmer, the historical variance in your rainfall and sunshine might alter your cropping pattern and therefore the rent that you might bid for the use of the land.

## Saturday, January 22, 2011

### Possible interesting solution to distance to market mystery

I haven't been able to sleep these last two nights for thinking about why distance to market might

*increase*rent (ie has a positive sign in the regression). I posted something about this mystery a few days ago. I think the solution might be connected with perceptions of risk. Farmers who face stable long-term conditions with regard to climate can generally out-bid farmers who are concerned only about the short-term. Folks with deep pockets can wait out the troughs because they aren't so worried about bringing home food to their family every single day. They can store food or buy it. So it is possible that the unusual pattern of a positive sign might be caused by highly variable weather conditions in that location. So I need to go back over metereological records and calculate the coefficients of variation for temperature and rainfall in various parts of the southwest. I'll put the numbers into the regression and see what happens. This is fun!### Track, rent and causality

I have been working on the mathematical model for the relationship between the laying of railway track and changes in rent. A bit of a problem is showing that there is 'causality' between the two. How can we prove that track caused change in rent? The answer is we can't, and the whole area of causality is frankly speaking a philosophical minefield. It is extraordinarily difficult to show that one thing 'causes' another. For our purposes, the only tool we can use is Granger causality which tests the relationship using time. What we want to see is that rent changes AFTER a change in track, not simultaneously or even worse, before. Malcolm has just given me the track for our eighth estate and I have done the Granger test on all of the them. I'm pleased to say that they all scraped through, some only just. I limited the range of years from 1832-1882 which gives us a half-century. We don't have thorough track measurements for the period after 1872, and wheat and cattle prices were highly volatile in the 1880s. That's OK...we've made the point.

I've been using an interesting form of regression, called Vector Auto-Regression or VAR to get out the stats. It is simply beautiful! Life doesn't get better than this!

I've been using an interesting form of regression, called Vector Auto-Regression or VAR to get out the stats. It is simply beautiful! Life doesn't get better than this!

## Tuesday, January 18, 2011

### Wheat flows within Britain

Mi has helped me with the data for wheat flows within Britain towards the end of the 19th century. The map shows whether a county was in surplus or deficit. The calculations are for production by county minus consumption within the county. What is left over could be carted and sold to another county. As you can see, the pattern is predictable: the counties in blue had a surplus. These counties are on the arable lands towards the east and south of the country. The areas most in deficit were the sheep raising and also industrialising counties, in red and orange. The country as a whole had a net deficit which was covered by imports from Ireland and also Prussia, and later on, the United States. The point of the map is to show that not much grain moved within Britain by rail. It went by coastal steamer.

Calculating net flows is a useful step in analysing the agricultural structure of a country. We could if we wanted expand the scope to include European countries and North America. Here we had to make an assumption that the per capita consumption of wheat (in the form of bread) was the same across counties. There is evidence for and against that assumption.

Calculating net flows is a useful step in analysing the agricultural structure of a country. We could if we wanted expand the scope to include European countries and North America. Here we had to make an assumption that the per capita consumption of wheat (in the form of bread) was the same across counties. There is evidence for and against that assumption.

## Monday, January 17, 2011

### Malcolm's comment about railway shareholding

Malcolm made an interesting comment on my last post. We were discussing whether the fact that the owner of a large estate invested in railway shares was interesting or not. Malcolm pointed out that the very substantial investment by the Earl of Leicester, owner of the very large Holkham Hall estate, in railway shares, was just prior to the expansion of railway track near his estate. This would indeed be interesting if:

- we could see a pattern, such as other estate owners also making large investments
- we could infer from this that the estate owners knew that they would be able to increase rents as a result of 'extracting' the savings from their tenants. If thus was the case, the estate owners were getting a free ride: their investment in railway shares would be (probably) be profitable AND they trousered the extra rents. Nice work if you can get it!

## Friday, January 14, 2011

### Connectivity in the railways paper

Some interesting advances today in the railways paper.

1. Malcolm has been calculating the length of track available cumulatively on an annual basis from 1832 within a 40km radius of each of our six estates. So far the regressions of rent against track, controlling for wheat and cattle prices for the period 1832-1899, have been excellent. Today I was writing up the mathematical model for the paper and I am not sure that just length of track is enough. How about if there was a very long track but only one station? Of course that didn't happen, but we need to have a variable that we can prove mathematically. While we'll stick with track lengths for the moment, let's consider other options. We want a measure of connectivity: how easy was it for a farmer to get his cattle on a train? How about number of stations? From this I thought about network theory.....and nodes. I found there is a useful add-in to Excel called

I tried this for Norfolk. Here is the map of modern stations in Norfolk and a bit of the Nodexl output. There is obviously a lot I need to learn about Nodexl but it does have the advantage of having some math theory to back up the output stats.

3. I think I've found somewhere equally interested in Victorian railways, found

1. Malcolm has been calculating the length of track available cumulatively on an annual basis from 1832 within a 40km radius of each of our six estates. So far the regressions of rent against track, controlling for wheat and cattle prices for the period 1832-1899, have been excellent. Today I was writing up the mathematical model for the paper and I am not sure that just length of track is enough. How about if there was a very long track but only one station? Of course that didn't happen, but we need to have a variable that we can prove mathematically. While we'll stick with track lengths for the moment, let's consider other options. We want a measure of connectivity: how easy was it for a farmer to get his cattle on a train? How about number of stations? From this I thought about network theory.....and nodes. I found there is a useful add-in to Excel called

**NodeXl**that will draw a graph of connections and also give us a statistic of how connected a network is. So we could update the network year by year and then use the annual statistics of connectivity as the variables in the regression. We could get names of stations and years when they opened from railway timetables, the famous**Bradshaw**.I tried this for Norfolk. Here is the map of modern stations in Norfolk and a bit of the Nodexl output. There is obviously a lot I need to learn about Nodexl but it does have the advantage of having some math theory to back up the output stats.

Nodexl output of some of the map. I got bored before I put in all the stations |

Modern rail network |

Holkham Hall is close to Fakenham towards the north of the map. When I was a kid we went on family holidays at Hunstanton on the coast to the north-west. And we went on the railways!

2. I was looking for involvement by estate owners in railway companies. Mi found me a book on Holkham Hall and guess what: the Earl of Leicester, the owner of the Holkham Estate, spent a year and a half of the estate's profit in buying railway shares in the 1880s. That was a lot of money. AND he campaigned for a branch line to run near the estate. Mi is getting me more books on this. I'd like to know whether the owners of our other estates also bought railway shares. But is this actually **useful**data? What do you think?3. I think I've found somewhere equally interested in Victorian railways, found

**his blog**by chance this morning. Beautifully laid out and all the sources meticulously included.## Thursday, January 13, 2011

### Railways paper: six panels up

We now have six 'panels' each representing an estate's rent for the period 1832-1899. That is a total of 337 observations allowing for missing data and the 'instruments' needed to adjust for temporal effects. Malcolm just sent me the track records for the last estate, Badminton....I held my breath while the computer did the work (takes a few minutes). I was blue in the face when the answer came out---still significant! So we have a model that looks like this:

Note the positive sign for track, which means that an increase in railway within 40 km of the estate increases the rent. The effect isn't large, probably because the difference in costs between using the railway and 'droving' the livestock along a road wasn't a large part of the total farm budget. BUT it is statistically significant, and that is what counts.

One of the six estates is Thorndon, pictured here. Think of the heating bill! No wonder they had to raise the rents of all their tenant farmers!

Next I am going to work on a separate regression for Holkham Hall, a big estate in Norfolk for which Mi has been getting me the data. Holkham is one of the six, so we already have some knowledge of this estate. It is in Norfolk, on the east coast, about a hundred miles north of London.

Family home for the owners of the Thorndon Estate |

One of the six estates is Thorndon, pictured here. Think of the heating bill! No wonder they had to raise the rents of all their tenant farmers!

Next I am going to work on a separate regression for Holkham Hall, a big estate in Norfolk for which Mi has been getting me the data. Holkham is one of the six, so we already have some knowledge of this estate. It is in Norfolk, on the east coast, about a hundred miles north of London.

### Soils

One of the variables that has a considerable impact on yields, both for arable and livestock, is the type of soil. It is clear why for crops such as wheat, but not so obvious for livestock. Don't they just eat grass? Well, yes, but grass won't grow everywhere, or perhaps not as luxuriantly as the livestock would prefer. Malcolm has identified some soil data, and on the map below I've layered the soil data with the 648 parishes. The next step is to add more soil layers (water, slope etc) and then extract the readings for each parish. Then build a model of yields using regression. We'll also add metereological data, and Mi is helping me with that.

Just from a glance at the map you can see that most of our parishes lie on fairly sandy soil (2) while those to the north-east lie on more clayey soil. Usually clayey soil is better for arable. It will be interesting to test this.

You'll see that there is a gap in the extreme south-west. That is the county of Cornwall, which I wasn't originally going to include. But it looks as though the thesis of 'market integration' is going to be something we'll run with, so Malcolm is working on that data now. That will bring our total number of observations to nearly eight hundred.

Just from a glance at the map you can see that most of our parishes lie on fairly sandy soil (2) while those to the north-east lie on more clayey soil. Usually clayey soil is better for arable. It will be interesting to test this.

You'll see that there is a gap in the extreme south-west. That is the county of Cornwall, which I wasn't originally going to include. But it looks as though the thesis of 'market integration' is going to be something we'll run with, so Malcolm is working on that data now. That will bring our total number of observations to nearly eight hundred.

### Pasture yields

This concerns the 'Devon Rents' paper. The analysis we have done for 648 parishes in six counties in south-west England has used only the arable rent and the wheat yield. But many of the parishes were right in the middle of prime sheep and cattle-raising areas, and in fact livestock farming would have been more important in 1836 than wheat farming. I haven't analysed the livestock sector for the parishes because a lot of the data is missing for livestock: we have livestock yields and livestock rents only for one county, Devon. We don't have livestock yields for the other counties. It is very tricky to calculate livestock yields, so that is probably why the Inspector didn't bother. But we do have livestock rents, which for the Inspector would have been much easier to record; he would just have written them down.

So we aren't currently making use of the livestock rent data...which seems a shame. Hate to waste data! This morning I was out running and I thought of a way that we would use the livestock rent data. We know the livestock yields AND the livestock rent for Devon (n =96). Could we use the mathematical relationship that we know for Devon yields and rents to construct a simulation for the other counties? We know the soil, the rainfall, distance to market town----perhaps this is enough to calculate the relationship between rent and simulated yield for our other 600+ parishes? This would be a significant contribution to the literature. And the applications to developing countries are obvious.

So we aren't currently making use of the livestock rent data...which seems a shame. Hate to waste data! This morning I was out running and I thought of a way that we would use the livestock rent data. We know the livestock yields AND the livestock rent for Devon (n =96). Could we use the mathematical relationship that we know for Devon yields and rents to construct a simulation for the other counties? We know the soil, the rainfall, distance to market town----perhaps this is enough to calculate the relationship between rent and simulated yield for our other 600+ parishes? This would be a significant contribution to the literature. And the applications to developing countries are obvious.

## Tuesday, January 11, 2011

### A distance to market mystery!

I have been working on the geographically-weighted regression tool, exploring the spatial distribution of the various explanatory variables in the 'Devon rents' paper. One variable is distance to the nearest market town of the parish. When I was doing 'ordinary' regression this particular variable caused me some grief because it wasn't always significant. It kept changing its sign depending on the size of the dataset. The theory holds that the sign should be negative. Greater distance to market should lower the rent. Below is a map of the signs and coefficients of the market distances for the parishes. Some areas, notably to the west and the northeast (Devon and Herefordshire) are respectably negative, but Somerset and Dorset are positive. How can being

Red and ochre colour are positive for market distance, blue and dark blue are (respectable and what we like!) negatives. I can only think that this result is connected somehow with transport to market over longer distance, or integration into the wider market. If you are putting cattle and wheat onto railways and canals, the distance to the nearest market town doesn't matter. Next step is to control for soil and climatic conditions. Fascinating!

*further*from the market increase your rent?Red and ochre colour are positive for market distance, blue and dark blue are (respectable and what we like!) negatives. I can only think that this result is connected somehow with transport to market over longer distance, or integration into the wider market. If you are putting cattle and wheat onto railways and canals, the distance to the nearest market town doesn't matter. Next step is to control for soil and climatic conditions. Fascinating!

### Five railway 'panels' significant

Malcolm just sent me the lengths of railway track for the fifth estate---Tavistock. Tavistock is away down in the south-west of England in Devon. I was interested to test for this estate because it took longer for the railway to reach down into this relatively remote part of England. Below is a graph showing track mileage for Tavistock and for Dalemain.

I'm pleased to say that the results remain highly significant, regressing rent per acre against wheat and cattle prices, and length of track. This regression is 'longitudinal' over the years 1832-1899 with five 'panels', one for each estate. Next step is to add one further estate (Badminton) and then adjust the track amounts with some new data for 1872. I also want to include some data specific to each estate, such as yields, but there are no consistent and reliable time-series for this. It might be possible to 'control' for climatic conditions, such as average rainfall, but this might not be worth the work. With at least one more estate we already have a good outcome.

Now I am working on the structure of the paper and I will post a draft on Google docs for you to read and comment on very shortly.

Track mileage for Dalemain (in the north-east) and Tavistock (in the south-west) |

I'm pleased to say that the results remain highly significant, regressing rent per acre against wheat and cattle prices, and length of track. This regression is 'longitudinal' over the years 1832-1899 with five 'panels', one for each estate. Next step is to add one further estate (Badminton) and then adjust the track amounts with some new data for 1872. I also want to include some data specific to each estate, such as yields, but there are no consistent and reliable time-series for this. It might be possible to 'control' for climatic conditions, such as average rainfall, but this might not be worth the work. With at least one more estate we already have a good outcome.

Now I am working on the structure of the paper and I will post a draft on Google docs for you to read and comment on very shortly.

### Geographically Weighted Regression

I finally found out how to do geographically weighted regression! This allows us to look at local changes in a statistical relationship. I tried out the technique by regressing the natural log of arable rent against the natural log of wheat yield. The coefficient on wheat yield will give us the elasticity: the percentage that arable rent changes for a certain percentage of wheat yield. Elasticity is a commonly-used measurement in economics....often used for price and demand. So if a 100% percentage decrease in price for some item caused the demand for that item to double (go up 100%) , we could say that the elasticity was one, or unitary. I have used elasticity to try to show how much the landlord keeps from the rent. Here is a map indicating the coefficient for the natural log of wheat yield, or the elasticity. To the west, the elasticity is lowest, but rises gradually as we move to the east, towards London. Just why we should see this very clear trend is highly interesting. Any ideas?

## Monday, January 10, 2011

### Cartogram of arable rents and wheat yields

I have been learning how to use some free software called Geoda. Frankly, it is not easy! So far I have constructed two cartogram of arable rent and wheat yields. Cartograms aren't really maps: they represent quantities of interest. Here they are:

The top one is wheat yield and the bottom one is rent. I find it interesting that the red dots don't match: red represents a large positive outlier, green represents normal. So we have quite a few instances of high rents, but rather fewer of high yields. So some landlords were extracting high rents when the yields were only normal. Now, let's not get too carried away with blaming landlords: there might have been other factors, such as closeness to the market. Anyway that is something to test.

The top one is wheat yield and the bottom one is rent. I find it interesting that the red dots don't match: red represents a large positive outlier, green represents normal. So we have quite a few instances of high rents, but rather fewer of high yields. So some landlords were extracting high rents when the yields were only normal. Now, let's not get too carried away with blaming landlords: there might have been other factors, such as closeness to the market. Anyway that is something to test.

## Friday, January 7, 2011

### The parishes and Thorndon added to panel data

Two advances today:

1. Finally worked out how to join the dataset that Malcolm has been so patiently preparing to the map of the locations of the parishes. The result is below. Looks a bit as though south-west England is suffering from chicken-pox! We have the data on arable rents, yields, distances to nearest market-town and elevations for these parishes, nearly 800 of them. We have already done the basic statistics and found that there is a very strong relationship between rent, yields, distance and elevations. We also found that the elasticity of rent to what the farmer took home increased markedly as we moved east towards London. I don't know why that should be, but I suspect it might be connected to the amount of 'enclosure' that went on in the area. I'll work on that idea. Next step is to use some free software called Geoda to calculate the 'spatial lag', which is the grouping together of rents. Here's the map and then below some notes on Thorndon.

2. The 'railways' paper: Malcolm calculated the amount of track laid on an annual cumulative basis for Thorndon, the fourth of the estates. Thorndon is in Essex, right over on the east coast, not far north of London. As a result, they had railway track early on. I added Thorndon to the other three estates in the panel data set, and I'm delighted to say that the results remain highly significant. It is clear that landlords were extracting the savings from their tenants----but hey! what else is new? Mi has found me useful information on yields which is part of the estate-specific information I will begin adding to the dataset. Think of it this way: we want to isolate the impact of just one factor---track---so we need to hold steady anything else that might have an effect on rents. This is what we can do with panel-data regression and that's why it is such a powerful tool. So much is done with regression....learn it whenever you get a chance. It will be really useful to you. I'll teach you if you like.

1. Finally worked out how to join the dataset that Malcolm has been so patiently preparing to the map of the locations of the parishes. The result is below. Looks a bit as though south-west England is suffering from chicken-pox! We have the data on arable rents, yields, distances to nearest market-town and elevations for these parishes, nearly 800 of them. We have already done the basic statistics and found that there is a very strong relationship between rent, yields, distance and elevations. We also found that the elasticity of rent to what the farmer took home increased markedly as we moved east towards London. I don't know why that should be, but I suspect it might be connected to the amount of 'enclosure' that went on in the area. I'll work on that idea. Next step is to use some free software called Geoda to calculate the 'spatial lag', which is the grouping together of rents. Here's the map and then below some notes on Thorndon.

The 800 (or so!) parishes in south-west England: data from the 1836 Tithe Files |

### Population growths

Mi has helped me to calculate the annual population total for three counties: Cumberland (where the Dalemain estate lies), Norfolk (Holkham Hall) and Sussex (Petworth). There is a graph below. Since the area of the county, didn't change, we can use the population figures as representing population densities. I tried including population density in the panel data regression, but it wasn't significant. I think we'll try the population size of the nearest market town to the estate. That worked well for the 'Devon' data. Take a look at the graph: can you see that the population of Sussex more than doubled in half a century? This huge growth meant many more mouths to feed and so required agriculture to increase its yields. The area of cultivatable land is fixed and so the only way to increase output is to increase the yields. This is a close parallel to the world situation today: food prices are climbing because the global population is swelling. That's one reason why the work we are doing has relevance.

I'll spend today at UBC getting more maps of railway construction: the Victorians helped solve their food problem by transporting agricultural output more efficiently. Lessons to be learned!

I'll spend today at UBC getting more maps of railway construction: the Victorians helped solve their food problem by transporting agricultural output more efficiently. Lessons to be learned!

## Wednesday, January 5, 2011

### Track around 3 estates significant and elasticities

Two interesting advances today:

1. For the 'railways and rents' paper: I have built a 'panel' dataset using annual total railtrack in a 40km radius of three estates: Dalemain, Holkham Hall and Petworth. I regressed the rent against the track, cattle and wheat prices. The three independent variables---track, wheat and cattle---are all significant and with the 'right' signs. This is very satisfying. I'd like to get more data specific to each estate, such as yields etc. Mi is working on this. Soon we will develop a clear picture of how agricultural rents were set in the 19th century. This is something no one has tried before. The techniques will have analytical uses in developing countries where data is --- like 19th century Britain --- a bit sparse.

2. For the 'Devon rents' paper: I built a variable which represents the amount of money a farmer would get after he had paid his farming expenses. I regressed this, together with population of nearest market town and elevation, against rent. The results are highly significant. Then I built four 'windows' moving from west to east, so that I selected only the farms inside the windows. For each window I calculated the 'elasticity', which is the percentage change in rent for a percentage change in farmer's take-home money. I think (!) that this is a measure of the 'surplus extraction' of the landlord: how much he can 'squeeze' out his tenant. What is remarkable is that the elasticity changes as we move east towards London. It more than doubles over two hundred miles. This is a fascinating result, but at the moment I am at a loss as to how to explain it! I have put the elasticities into the relevant counties in the map below....not quite the same as the moving window but I can't see how else to show you.

It has been a good day. Thank you!

1. For the 'railways and rents' paper: I have built a 'panel' dataset using annual total railtrack in a 40km radius of three estates: Dalemain, Holkham Hall and Petworth. I regressed the rent against the track, cattle and wheat prices. The three independent variables---track, wheat and cattle---are all significant and with the 'right' signs. This is very satisfying. I'd like to get more data specific to each estate, such as yields etc. Mi is working on this. Soon we will develop a clear picture of how agricultural rents were set in the 19th century. This is something no one has tried before. The techniques will have analytical uses in developing countries where data is --- like 19th century Britain --- a bit sparse.

2. For the 'Devon rents' paper: I built a variable which represents the amount of money a farmer would get after he had paid his farming expenses. I regressed this, together with population of nearest market town and elevation, against rent. The results are highly significant. Then I built four 'windows' moving from west to east, so that I selected only the farms inside the windows. For each window I calculated the 'elasticity', which is the percentage change in rent for a percentage change in farmer's take-home money. I think (!) that this is a measure of the 'surplus extraction' of the landlord: how much he can 'squeeze' out his tenant. What is remarkable is that the elasticity changes as we move east towards London. It more than doubles over two hundred miles. This is a fascinating result, but at the moment I am at a loss as to how to explain it! I have put the elasticities into the relevant counties in the map below....not quite the same as the moving window but I can't see how else to show you.

It has been a good day. Thank you!

### Distances to Market Decreasing With Longitude

I have been analysing data from about 700 parishes in the southwest of England, looking for patterns of how rent was set in the 1830s. The relationship between arable rent and wheat and barley yields is very strong, as we would expect. I started off the analysis with 96 parishes in Devon, in the far southwest of England. Here the relationship between rent and distance to market is very clear....further from market the lower the rent. Over the holidays, Malcolm helped me to increase the dataset, moving east towards London. The larger dataset was initially puzzling, because distance to market was no longer significant and had a positive sign. This morning I regressed distance to market against longitude and found that distance to market decreases as we move east towards London. In other words, there is a higher population density and so the farmer doesn't have to cart his produce so far to sell it. Here is a scatter plot with the regression trend line:

Now, I'll be the first to agree that this looks pretty much like wasps around a honey jar: BUT the relationship is statistically highly significant although with very little explanatory power (r-squared is small). I think this negative relationship goes towards explaining my initially confusing results. [Obvious when you think about it, which is (probably) what Newton thought after the apple landed on his head. ]

I did some more probing and found that the sign for market distance changes from negative and significant to either positive or not significant at about longitude= - 3.25. This is close to the eastern borders of our two western counties. Next step is to go to the 1841 census files and get population densities for the six counties we have been analysing. Clearly it wouldn't be a huge surprise if distance to market correlated with population density.

Now, I'll be the first to agree that this looks pretty much like wasps around a honey jar: BUT the relationship is statistically highly significant although with very little explanatory power (r-squared is small). I think this negative relationship goes towards explaining my initially confusing results. [Obvious when you think about it, which is (probably) what Newton thought after the apple landed on his head. ]

I did some more probing and found that the sign for market distance changes from negative and significant to either positive or not significant at about longitude= - 3.25. This is close to the eastern borders of our two western counties. Next step is to go to the 1841 census files and get population densities for the six counties we have been analysing. Clearly it wouldn't be a huge surprise if distance to market correlated with population density.

## Tuesday, January 4, 2011

### Dalemain Results

Malcolm has calculated the track for Dalemain, an estate in the north of England noted for sheep and cattle raising. The results look significant: the graph below shows the actual rent and the rent predicted by the regression model. Not bad....but I need to use some deflated prices and more localised variables, such as population density.

### Google ngram as a research tool

There is a highly useful Google tool here:

http://ngrams.googlelabs.com/

which allows you to search for a word or phrase through all those books that Google has been scanning and also track the rise and fall of the word or phrase over time. You can select by language and bracket by years. And at the bottom of the graph there is a clickable link to the original books. Naturally I immediately tried 'agriculture','railway' for 1800-1870 and found some useful source documents. Might help you.

http://ngrams.googlelabs.com/

which allows you to search for a word or phrase through all those books that Google has been scanning and also track the rise and fall of the word or phrase over time. You can select by language and bracket by years. And at the bottom of the graph there is a clickable link to the original books. Naturally I immediately tried 'agriculture','railway' for 1800-1870 and found some useful source documents. Might help you.

### Holkam Track Results

I've done a time-series regression of amount of track within a 40km radius of Holkham Hall against land rent, controlling for the price of wheat and cattle. I deflated the rent and the commodities to adjust for changes in the cost of living. The result is statistically significant, and I'd love to show you the output but can't work out how to paste it into the blog. Track and cattle have positive signs, but wheat is negative. Why should a drop in the wheat price result in increased rent..I don't know yet! I'll get there. The positive sign for Track is what we had expected to see: more track means increased rent. The savings are being extracted by the landowner. Recall that the equation for locational rent is

where m is the market price of the commodity, c the cost of production, E the yield, f the cost of transportation per unit distance and d the distance. As the distance increases, the right hand side gets smaller, so the rent gets smaller. Eventually the rent would be zero right on the edge of the cultivatable land. By increasing the amount of track, in effect the distance is getting smaller...so the rent goes up.

Encouraged by this, we're going to build a larger dataset. Malcolm is calculating the track for three more estates: Petworth, Thorndon and Dalemain. A graph of their rents is here:

You can see that there is a bump in the rents in the period around 1850----what a coincidence! Once we have the track data in, I'll do the same type of regression, but this time it will be a panel-data longitudinal regression. This is a very powerful technique, which I'd urge you to learn if you see the chance.

where m is the market price of the commodity, c the cost of production, E the yield, f the cost of transportation per unit distance and d the distance. As the distance increases, the right hand side gets smaller, so the rent gets smaller. Eventually the rent would be zero right on the edge of the cultivatable land. By increasing the amount of track, in effect the distance is getting smaller...so the rent goes up.

Encouraged by this, we're going to build a larger dataset. Malcolm is calculating the track for three more estates: Petworth, Thorndon and Dalemain. A graph of their rents is here:

You can see that there is a bump in the rents in the period around 1850----what a coincidence! Once we have the track data in, I'll do the same type of regression, but this time it will be a panel-data longitudinal regression. This is a very powerful technique, which I'd urge you to learn if you see the chance.

## Sunday, January 2, 2011

### Track in 40km radius of Holkham Hall

Malcolm has done a wizard job of calculating the amount of track on an annual basis within a 40km radius of Holkham Hall in Norfolk. The graph is here:

Our hypothesis is that the availability of track would reduce the costs of farming to the tenant farmer, but that the landowner would grab the savings in the form of higher land rents---the 'resource extraction' theory.The next step is for me to test this using a time-series regression, controlling for other variables such as the price of wheat and the price of livestock. We need to hold steady the other variables so that we can isolate the effect of the reduced transport cost. Rent is the dependent variable and then the amount of track and the various prices are the independent (or 'explanatory' variables). The equation looks like this:

The regression is a time-series, and so we have to remove the effects of auto-correlation over time. I've omitted all the subscripts for time for clarity. We can use ARIMA for the regression. This is exciting and a great way to spend the holiday! Thanks Malcolm for your speedy work. I'll be back with some statistical output shortly.

Our hypothesis is that the availability of track would reduce the costs of farming to the tenant farmer, but that the landowner would grab the savings in the form of higher land rents---the 'resource extraction' theory.The next step is for me to test this using a time-series regression, controlling for other variables such as the price of wheat and the price of livestock. We need to hold steady the other variables so that we can isolate the effect of the reduced transport cost. Rent is the dependent variable and then the amount of track and the various prices are the independent (or 'explanatory' variables). The equation looks like this:

The regression is a time-series, and so we have to remove the effects of auto-correlation over time. I've omitted all the subscripts for time for clarity. We can use ARIMA for the regression. This is exciting and a great way to spend the holiday! Thanks Malcolm for your speedy work. I'll be back with some statistical output shortly.

## Saturday, January 1, 2011

### New Year's update

So, we have four papers and a book (yes!) to finish off this year. Here is a little run-down job by job:

1. The 'political' paper is under peer review at the moment. Let me know if you want a copy. We showed that there was a strong statitistical relationship between the type of crops grown in a political constituency, the attendance at church of the residents, and how the MP for that constituency voted. The parliament of 1841 was very much about 'church and state' and so these findings make sense. We used some novel statistical techniques to get round the 'missing voter' problem. Many MPs didn't turn up for divisions and so our sample size is small. But we know all we need about the MP except for how he would have voted had he turned up. We want to keep this information, not toss it out. We used Anne Sartori's improvement on the Heckman procedure (he won a Nobel for that). Special kudos to Hugh (Salway) from the University of York in the UK for volunteering a huge amount of time and helping us with hard to get data. Come and visit us, Hugh!

2. The 'Devon rents' paper. Here we are using very old data: from the 1836 Tithe Commission. We find that in the county of Devon, there is an amazingly robust relationship between arable rent, wheat yields, elevation and distance to market town. This fits all the 'locational rent' theories: von Thunen, Ricardo etc. So not content with that, we want to see whether this relationship holds in neighbouring counties. Malcolm has built up a large dataset (n>600) of parishes in six contiguous (look it up!) counties. What is interesting is that some parts of the relationship change as we move east, seeming at first sight to indicate less reliance on local markets as we get closer to London. Makes sense. To test this, we'll be using a relatively new statistical technique called Geographically Weighted Regression. In regular regression, we are trying to estimate some 'global' parameter that fits throughout the statistical population. In GWR, we allow the parameter estimates to vary according to local conditions. I am keen to measure the elasticity between rent and what the tenant farmer could take home to his wife and kids: in other words, were some landlords greedier than others, and if so, why? I am half-hopeful that we'll see the hand of the Anglican Church behind all this, but mustn't get my hopes up.

3. The 'railways and rents' paper. A huge amount of track construction went on in 'our' period, see the graph below. This must have had some impact on farming. James Caird, writing in 1851, makes an intriguing reference to a Norfolk farmer saving four hundred pounds a year ( a lot. You could have bought a Bentley if they had made them then) because his cattle didn't lose weight when they went by rail; when he walked them to market they ended up quite thin. But amazingly there isn't much published on this. And I can see why. Getting the data is like pulling hen's teeth. Malcolm is figuring out the total amount of track in a 40km radius of Holkham Hall, Norfolk for the years 1836 to 1866 on an annual basis. And Mi is scouring the libraries of the world for any sort of references that might help. Here is the graph:.

Growth like this must have had consequences! We hypothesise that the tenant-farmer's savings would have been transferred to the landlord via an increased rent. This is the phenomenon of 'surplus extraction'. Generally you want to be the extractor, not the extracted. But the tenants were in a weak position....so we would expect to see their rents going up in tandem with better transportation. Mi has got us the rents, now we await Malcolm's annual track data. Then I'll use a time-series analysis procedure called ARIMA to test for linkage. If the Norfolk estate gives us a YES, we'll extend the procedure to other estates and use a panel-data approach. Nice cutting edge stuff.

4. The 'supply response' paper. In the 1870s, the price of wheat fell dramatically because those blasted Americans opened up their railroads and shipped in wheat below the domestic price. See the graph of wheat prices below. Quite shocking: halved in price in less than two decades.

Look what has happened: wheat has halved but livestock numbers have shot up. This look like a structural shift in agriculture. But this is at the national level....individual farmers won't all have been able to shift out of wheat and into livestock. We hypothesise that those estates which were more flexible with what the tenants did with their land probably didn't need to drop their rents as much as those estates which were more rigid. Farmers are not (always) stupid and can adapt pretty well to changing market conditions. Different matter if they can't adapt because of regulations on land use. We will test for a 'breakpoint' in rents, and then use that year as an indicator. Going to borrow from medical statistics, normally for use in working out how long a patient has got to live. Again, nice cutting edge stuff.

5. A book! New idea, inspired by Sarah, one of my two fantastic sisters. This is historical fiction, in other words a story based on real people. The working title is 'Breaking Free From Dr Malthus' and the theme is how the heck did the farmers increase yields enough to allow enough folk to escape from the hard scrabble of agriculture to start off the Industrial Revolution. A new format...left hand side page is fiction, the facing right hand side is economic analysis and commentary on the the fiction. Including the highly exciting new field of neuro

1. The 'political' paper is under peer review at the moment. Let me know if you want a copy. We showed that there was a strong statitistical relationship between the type of crops grown in a political constituency, the attendance at church of the residents, and how the MP for that constituency voted. The parliament of 1841 was very much about 'church and state' and so these findings make sense. We used some novel statistical techniques to get round the 'missing voter' problem. Many MPs didn't turn up for divisions and so our sample size is small. But we know all we need about the MP except for how he would have voted had he turned up. We want to keep this information, not toss it out. We used Anne Sartori's improvement on the Heckman procedure (he won a Nobel for that). Special kudos to Hugh (Salway) from the University of York in the UK for volunteering a huge amount of time and helping us with hard to get data. Come and visit us, Hugh!

2. The 'Devon rents' paper. Here we are using very old data: from the 1836 Tithe Commission. We find that in the county of Devon, there is an amazingly robust relationship between arable rent, wheat yields, elevation and distance to market town. This fits all the 'locational rent' theories: von Thunen, Ricardo etc. So not content with that, we want to see whether this relationship holds in neighbouring counties. Malcolm has built up a large dataset (n>600) of parishes in six contiguous (look it up!) counties. What is interesting is that some parts of the relationship change as we move east, seeming at first sight to indicate less reliance on local markets as we get closer to London. Makes sense. To test this, we'll be using a relatively new statistical technique called Geographically Weighted Regression. In regular regression, we are trying to estimate some 'global' parameter that fits throughout the statistical population. In GWR, we allow the parameter estimates to vary according to local conditions. I am keen to measure the elasticity between rent and what the tenant farmer could take home to his wife and kids: in other words, were some landlords greedier than others, and if so, why? I am half-hopeful that we'll see the hand of the Anglican Church behind all this, but mustn't get my hopes up.

3. The 'railways and rents' paper. A huge amount of track construction went on in 'our' period, see the graph below. This must have had some impact on farming. James Caird, writing in 1851, makes an intriguing reference to a Norfolk farmer saving four hundred pounds a year ( a lot. You could have bought a Bentley if they had made them then) because his cattle didn't lose weight when they went by rail; when he walked them to market they ended up quite thin. But amazingly there isn't much published on this. And I can see why. Getting the data is like pulling hen's teeth. Malcolm is figuring out the total amount of track in a 40km radius of Holkham Hall, Norfolk for the years 1836 to 1866 on an annual basis. And Mi is scouring the libraries of the world for any sort of references that might help. Here is the graph:.

Growth like this must have had consequences! We hypothesise that the tenant-farmer's savings would have been transferred to the landlord via an increased rent. This is the phenomenon of 'surplus extraction'. Generally you want to be the extractor, not the extracted. But the tenants were in a weak position....so we would expect to see their rents going up in tandem with better transportation. Mi has got us the rents, now we await Malcolm's annual track data. Then I'll use a time-series analysis procedure called ARIMA to test for linkage. If the Norfolk estate gives us a YES, we'll extend the procedure to other estates and use a panel-data approach. Nice cutting edge stuff.

4. The 'supply response' paper. In the 1870s, the price of wheat fell dramatically because those blasted Americans opened up their railroads and shipped in wheat below the domestic price. See the graph of wheat prices below. Quite shocking: halved in price in less than two decades.

Look what has happened: wheat has halved but livestock numbers have shot up. This look like a structural shift in agriculture. But this is at the national level....individual farmers won't all have been able to shift out of wheat and into livestock. We hypothesise that those estates which were more flexible with what the tenants did with their land probably didn't need to drop their rents as much as those estates which were more rigid. Farmers are not (always) stupid and can adapt pretty well to changing market conditions. Different matter if they can't adapt because of regulations on land use. We will test for a 'breakpoint' in rents, and then use that year as an indicator. Going to borrow from medical statistics, normally for use in working out how long a patient has got to live. Again, nice cutting edge stuff.

5. A book! New idea, inspired by Sarah, one of my two fantastic sisters. This is historical fiction, in other words a story based on real people. The working title is 'Breaking Free From Dr Malthus' and the theme is how the heck did the farmers increase yields enough to allow enough folk to escape from the hard scrabble of agriculture to start off the Industrial Revolution. A new format...left hand side page is fiction, the facing right hand side is economic analysis and commentary on the the fiction. Including the highly exciting new field of neuro

Subscribe to:
Posts (Atom)