Thursday, February 24, 2011
Railways and rent
So far we have twelve estates with kilometres of railway track counted up. In the regression we get very satisfactory results, which go to show that railway construction did indeed cut costs for farmers---but those savings were transferred to their landlords in the form of higher rents. However, in my readings I am finding some interesting differences between landlords. Some landlords had political ambitions and wished to be able to direct the votes of their tenants. As a result they under-rented...so the next research step is to try to find out which of our landlords had political ambitions, and then include this variable as a dummy variable. Then we can see the effect of politics on rent.
Wednesday, February 16, 2011
Coastal counties got cheaper wheat
I coded the counties with a dummy variable (0 = interior, 1 = coastal) so that I could test whether there was
any difference in the 'mark-up' according to whether a county importing wheat was on the coast or not. A lot of wheat was shipped by coastal routes because that mode was often easier than using a horse and cart. I want to test for that. So
was the regression equation I used. On the left-hand side we have the dependent variable, the difference in wheat price between any one county and the source of the wheat, which was either Cambridgeshire or Lincolnshire. On the right-hand side is the intercept, then the coefficient for distance between the counties; the coefficient for amount of surplus or deficit in wheat for that county; and finally the dummy variable for 'coastal'.
As you can see from the output below, all the explanatory variables are statistically significant at the 95% level (p < 0.05). The coefficient for distance is positive, meaning that the further away, the higher the price difference. In effect this is giving us the transport rate for hauling wheat. It should be positive, makes sense doesn't it? Surplus/deficit has a negative sign. That means that the more surplus the county has, the smaller the price difference. Again, that makes sense. Why would a county pay more for wheat if they already have lots of it? The coastal dummy is the most interesting. It has a negative sign. So if you lived in a county on the coast, you paid less for wheat "other things being equal". We are 'controlling for" distance and surplus. So if we happened to have two counties, both the same with respect to distance and surplus, the one on the coast would pay less for wheat. Because they got more of their wheat through coastal shipping would seem to be a reasonable explanation! Can you see the power of these techniques? We can learn a huge amount just with a few bits of data scraped off the floor.
any difference in the 'mark-up' according to whether a county importing wheat was on the coast or not. A lot of wheat was shipped by coastal routes because that mode was often easier than using a horse and cart. I want to test for that. So
was the regression equation I used. On the left-hand side we have the dependent variable, the difference in wheat price between any one county and the source of the wheat, which was either Cambridgeshire or Lincolnshire. On the right-hand side is the intercept, then the coefficient for distance between the counties; the coefficient for amount of surplus or deficit in wheat for that county; and finally the dummy variable for 'coastal'.
As you can see from the output below, all the explanatory variables are statistically significant at the 95% level (p < 0.05). The coefficient for distance is positive, meaning that the further away, the higher the price difference. In effect this is giving us the transport rate for hauling wheat. It should be positive, makes sense doesn't it? Surplus/deficit has a negative sign. That means that the more surplus the county has, the smaller the price difference. Again, that makes sense. Why would a county pay more for wheat if they already have lots of it? The coastal dummy is the most interesting. It has a negative sign. So if you lived in a county on the coast, you paid less for wheat "other things being equal". We are 'controlling for" distance and surplus. So if we happened to have two counties, both the same with respect to distance and surplus, the one on the coast would pay less for wheat. Because they got more of their wheat through coastal shipping would seem to be a reasonable explanation! Can you see the power of these techniques? We can learn a huge amount just with a few bits of data scraped off the floor.
Tuesday, February 15, 2011
Difference in wheat market prices by distance
I came across a really special dataset that has wheat (and other grain) prices by the week from 1760 to 1820 for every one of the 40+ English counties. Think of the work some dedicated soul did in copying out all those numbers! I used this dataset in a recent post to show that wheat prices varied quite a lot, especially by 'demand', ie whether the destination market was in surplus or deficit.
Malcolm has kindly provided me with a set of measurments from the two major wheat producing counties (Cambridgeshire and Lincolnshire) to each of the counties. See Malcolm's map on the right. We know the price difference between the wheat exporting county and county of importation; the distance between the two; the mean elevation of the journey, and the standard deviation of the elevation of the journey. The last measurement is to proxy the 'roughness' of the journey. I figured that greater changes in the standard deviation would be more expensive for the horse and cart operators of that time, and so they would increase the prices accordingly. We only have about 35 observations, but the results are interesting. After some experimentation, I found that regressing the natural log of the price difference against the natural log of the distance produced an acceptable result (p=0.02). The scatter plot of log price difference on the x axis with log distance on the y axis is shown here, together with a trend line of predicted values.
If we use logs in the regression, then we also have a measure of the elasticity. The coefficient from the regression is 0.47, which means that a one percent increase in distance increases the price difference by 0.47 per cent. (See how useful logs and elasticities are!). This is quite a lot, and more than I had expected. The other measurements, such as roughness of the journey, didn't seem to matter as much. Now I am going to move on to 'control' for other variables, such as whether or not the county was industrial/agricultural, or on the coast. Why should coastal be interesting? Quite a lot of wheat was shipped by coastal vessels, probably a lot cheaper than by land. More later.
Malcolm's map showing wheat origins |
Malcolm has kindly provided me with a set of measurments from the two major wheat producing counties (Cambridgeshire and Lincolnshire) to each of the counties. See Malcolm's map on the right. We know the price difference between the wheat exporting county and county of importation; the distance between the two; the mean elevation of the journey, and the standard deviation of the elevation of the journey. The last measurement is to proxy the 'roughness' of the journey. I figured that greater changes in the standard deviation would be more expensive for the horse and cart operators of that time, and so they would increase the prices accordingly. We only have about 35 observations, but the results are interesting. After some experimentation, I found that regressing the natural log of the price difference against the natural log of the distance produced an acceptable result (p=0.02). The scatter plot of log price difference on the x axis with log distance on the y axis is shown here, together with a trend line of predicted values.
Regression of log price diff against log distance |
Thursday, February 10, 2011
Wheat price differentials surplus/deficit areas
Earlier I calculated the 'wheat flow' from counties where they grew more than they ate to counties which grew less wheat and more livestock. Recently I found extraordinarily detailed data which lists the wheat price by county only by year but by week within the year! I thought that the difference in wheat price between counties might be explicable just by some function of the distance between them. Turns out not to be so simple as that, but more interesting. The difference between counties is much more pronounced when there is a bad harvest. When the harvest is bad, the wheat exporting counties really take advantage and the prices in the importing counties surge. Sounds like the sort of behaviour we see over tickets to hockey games, doesn't it! So I worked out which counties had the most surplus (Lincolnshire and Cambridgeshire) and which had the biggest deficits (Lancashire and Middlesex). Then I subtracted Lincolnshire from Cambridgeshire...because if my theory os correct, the gap between the two surplus counties shouldn't change much. Then between the biggest surplus county (Lincolnshire) and the highest deficit county (Lancashire). If things are going my way, then the difference between these two should be accentuated in years of bad harvests. Sure enough, the red line really jumps in years when we know from old books that the harvest was poor. The years 1816/17 are a good example of this. 1800 looks dramatic, but we were at war with the French then (remember Napoleon?) and so there were all sorts of other factors involved. These results have a modern-day significance in terms of food security. Especially now as the world looks like it is running out of food. The events in Eqypt were basically triggered by high food prices. Learn from history! Next step is to try to quantify the effect. This is fun!
Tuesday, February 8, 2011
Relationship between flow and price in wheat
Negative relationship between surplus/deficit |
Wheat price betrween farm and market
Wheat movements |
difference in prices between two counties = intercept + beta1*distance
beta 1 would be the transport costs.
(Now, I've linked the word 'regression' here to a Youtube video I've done in case you need a little revision). Take a look!
Remarkable price correlations |
Monday, February 7, 2011
Wheat Prices and Geography
Mean wheat price 1760-1820 |
Sunday, February 6, 2011
Wheat price coordination
Wheat price yearly averages for four counties |
Thursday, February 3, 2011
Sunshine and wheat yields
Malcolm has been plugging away at data from the UK Meteorological Office, calculating mean hours of sunshine in August for our 715 parishes. Now, the data comes from earlier last century and not from 1836, when the wheat yields were recorded, but that’s the best we can do.
August sunshine is a critical factor in wheat yields: best is hot of course, allowing the ear to ripen fully before harvest. I’ve done a regression of wheat yields against hours of sunshine in August, as well as the other data we have: elevation, amounts of rainfall in different groupings. There is a map of sunshine to the right and the regression output is below. The map is interesting, because you can see the parishes in bright yellow, indicating the most sunshine. Along the coasts, and down to the tip of Cornwall. And of course this matches up with where people like to go on holiday. The best wheat yields come from Somerset, inland and with plenty of August sunshine.
Here is the regression output. Elevation has a negative sign, meaning less yield with height. Sunshine is positive...as expected. Rainfall is negative above a certain amount. More than 1000 mm in a year waterlogs the wheat plant. This is not a bad result at all, and it will improve once I get data for soil types and amounts of available water into the regression.
Statement of the theory!
We are trying to apply a combination of von Thunen’s rent and distance theory [1] and Ricardo’s rent and land fertility theories, using Dunn’s equation[2] from 1954:
where R is the rent for crop j at location i. E is the yield of crop i, and a is the production cost. k is the cost per unit distance of transporting crop j and d is the distance from the farm to the next commercial transit point (let’s call it CTP). P is the price of the crop at the market. This might be the mill, or a dealer’s yard---wherever the production next changes hands and where its value needs to be calculated.
The rent right next to the CTP will be highest because the distance is shortest. In the same way there will eventually be a ‘margin of cultivation’ where the cost of transport exactly equals the revenue from production. So at this point the rent will be zero. Farmers will bid for the use of the land at any point along a bid curve we construct just by joining up rent at the CTP and rent at the margin of cultivation:
Farmers won’t bid for land if the rent is set at any point to the right of the red line: it is too expensive. Likewise, landowners won’t accept any offer to the left of the red line, because they think they can get a better offer.
In the empirical testing we have been doing, we found that Dunn’s equation worked very well for Cornwall, Devon and Dorset. So far so good. But when I tried to include counties to the east, such as Somerset, the results were no longer statistically significant. In particular, the regression results gave a POSITIVE sign for Ekd, on the right hand side whereas it should (the theory goes) be negative.
In the literature I found several theoretical reasons for why this might happen. First: worker wages are less further out from the CTP. So this would mean a reduction in ‘a’ in the equation, making the net revenue greater the further away from the CTP. (We see this happening in contemporary business: that’s why firms ‘outsource’ to China and India). Second: the perception of the farmer as to future harvests might differ from place to place. If he/she thinks that the weather will make yields highly variable, then he/she probably won’t bid as much for the land [3]. This second reason is why I have been getting you working on meteorological data these last few weeks.
But actually----I think the solution is simpler (they usually are). I have been mis-specifying the model. Instead of just distance to nearest market town (which worked well enough for Devon), I should have been thinking about the ‘connectivity’ of the towns and villages. The network density of the county/area in which the farm was. If it was highly interconnected with several different routes to the next town, then we would expect the rent to be higher, because the flow of goods in any direction would be less. Just have to figure out how to do that. Less reliance on distance to market town and more on how the towns were all connected together.
[1] von Thunen, J.H. Von Thunen's' Isolated State': An English Edition Pergamon, 1966.
[2] Dunn, E.S. The location of agricultural production University of Florida Press, Gainesville, FL, 1954. [3] Cromley, R.G. "The von Thünen model and environmental uncertainty." Annals of the Association of American Geographers 72 (1982):404-10.
Malcolm's comment on Cornish railways
Old Cornish tin-mine pumping station (Wikipedia, Tim Corsler) |
Wednesday, February 2, 2011
Market distance signs: a clear break
I used geographically-weighted regression to get location specific coefficients for market distance in the regression: Rent = wheat yield + market-town population + distance to market-town. As I have gone on and on about before, the theory is that there should be a negative sign for distance, because the further you are from your market, the more it costs you to truck your produce in on market day. The fact that this sign changes has been perplexing me...and there is something interesting going on here. Look at the map below. Can you see that the locations to the west of that thick red line have a negative sign, those to the right a positive sign? I did put a marking on the map but it is a little faint. Wow! Such a clear boundary. But why is it there and not a few kilometres to the west or east for that matter? What are factors that decide the location of this line? Interesting!
Mountains and food security
Just been reading a really interesting article about food supply chains in mountainous areas. The ideas behind the article...in mountainous areas you rely more on your neighbours etc, could apply to any time period. The article is a bit light on theory and math, but it gave me the idea that perhaps Devon was unusually reliant on local food markets. That is why the sign for distance to market is so resolutely negative. So it is not Somerset that is the odd county out for having a positive sign for distance to market....it is Devon because Devon was a bit cut off (in more ways than one) and relied on its local food markets. Here is a map of population density and slopes. The population density I got by looking at the 1831 census and then dividing the number of inhabitants by the area of the county. Having less than one person per square kilometre seems unimaginably empty to us now.
You can see how fewer people and big slopes go together. So in Devon they would have felt pretty isolated, and so relied on their local market towns. I think I'll work on this theory for a bit. I've circled Devon in the map of southwest England below.
You can see how fewer people and big slopes go together. So in Devon they would have felt pretty isolated, and so relied on their local market towns. I think I'll work on this theory for a bit. I've circled Devon in the map of southwest England below.
Density (in green) and slope (red for small, blue for big) |
Tuesday, February 1, 2011
Water and wheat yield
I've been mapping topsoil water availability throughout the year and wheat yield. The relationship is clear: more water gives more yield. A more reliable indicator than rainfall. Here is a map with high water availability matched by high wheat yield.
Subscribe to:
Posts (Atom)