Tragsa has developed the CAPAS service which integrates multi-sectorial data for better Common Agriculture Policy (CAP) funds assignments to farmers and land owners. Several external datasets – as LiDAR , Copernicus Sentinel2 and Protected Sites from the Spanish Environment and Agriculture Ministry, among others- has been used to improve the Spanish Land Parcel Identification System.
Products from LiDAR data
LiDAR files are a collection of points stored as tuples which represent longitude, latitude, and elevation. This data is provided by the Spanish National Geographic Institute (IGN). This data was processed using automatic algorithms to detect landscape elements (copses and isolated trees) within agricultural parcels.
Protected sites and ecological value report
On one hand, the density of isolated trees and the presence of copses were evaluated with the Landscape Elements Value. On the other hand, the presence/absence of protected areas that intersects subplots was evaluated with a score named Protected sites Value. The result of the sum of Protected Sites Value plus Landscape Elements Value is an Ecological value.
The full description of these products and how they were generated and their validation is explained here.
Products from Sentinel2 data
The Sentinels are a fleet of satellites for land monitoring which is part of the European Copernicus program. The products generated from satellite data were explained in a previous blogpost.
Every week, the images with low cloud cover percentage were downloaded and processed to generate three single products (true colour image, false colour image and NDVI). For the pilot area, Castile and Madrid regions, a total amount of 168 tiles were processed during the year 2017 (until the 31st of August). The irrigation maps were generated in two pilot areas. They were evaluated and they proved to be helpful to identify the crops in control tasks.
Other products generated by CAPAS have been used to update the LPIS database. For example, the grassland layer displays actual grassland areas. The change detection layer highlights the changes happened since the last updating of LPIS and it is focused in changes between agricultural land, forests, and grassland areas.
Many innovative products were generated by CAPAS business case leveraging previously under-used data. The different methodologies and derived products proved a high success ratio after several tests and all the resulting data can be obtained and visualized on the ProDataMarket platform.
During a project meeting in Sofia on September 21, 2016, Cerved teamed up with TRAGSA to brainstorm ideas of re-using the TRAGSA methods for processing satellite imagery to analyse green areas in urbanized cities.
Fundamentals of Tragsa Processing
A common feature in Vegetation Spectra is the high contrast observed between the red band and the Near Infrared (NIR) region. The optical instrument carried by Sentinel 2 satellites samples 13 spectral bands, including high resolution bands in the red (bands 4, 5 & 6) as well as bands in the NIR (8 & 8A). Refer to this blog post for more details about processing Sentinel 2 data.
Using the TRAGSA methodology it is possible to isolate and enhance the vegetation, to locate green areas in urban areas. Green areas are important input to the Cerved’s innovative real estate evaluation model (which is being developed within one of the Cerved’s business cases in the project, as introduced in this blog post). Cerved uses open data, to generate indicators of green areas defined for the model: green area coverage and distance to the wood. Operations that Cerved performs to compute these indicators are similar to those that TRAGSA does on satellite data, such as clustering of green areas into big areas and isolating trees and group of trees. This motivated us to experiment with satellite data and TRAGSA’s methodology, to see whether we could potentially use more complete, structured and up-to-date source of green areas information as input to our real estate evaluation model.
We identified a highly urbanized Italian city but with particular attention to green areas, which is the city of Turin.
The steps that we followed:
extraction of city boundaries of Turin in GeoJSON format by SPAZIODATI
selections of good quality imagery for Turin from the Sentinel data repository by TRAGSA
processing S2 imagery in order to get a vector layer which indicates the presence or absence of a green area in each pixel (1/0) by TRAGSA
display of the green areas of the tiles (see the screenshot below) prototype Amerigo visualisation service, under development by SPAZIODATI
data processing and aggregation of the tiles into census cells areas, in order to develop green areas indicators for each census cell, by CERVED
integration and testing of the score dedicated to green areas within the business model CCRS (Cerved Cadastral Report Service) by CERVED
The result of this experiment was extremely surprising; the detail and accuracy of this new score in identifying the green areas (not only public green areas) is far greater than accuracy of the other scores, developed on public and open green areas of datasets.
TRAGSA, as a business case provider in the project, is developing the CAPAS service which aims at publishing and integrating multi-sectorial data from several sources into an existing data-intensive service, targeting better Common Agriculture Policy (CAP) funds assignments to farmers and land owners. The goal is to leverage the data integration facilities offered by proDataMarket, to better define the funds assignments features in parcels and subplots.
CAPAS is working on an improvement of the efficiency and competitiveness of the existing Spanish CAP (Common Agriculture Policy) service by integrating more datasets, underused at the beginning of the proDataMarket project. To use them as a powerful tool, it was necessary to create and develop new data processing algorithms. Therefore, CAPAS is not only an end-user application. Indeed, it involves data collection, data modelling and data processing techniques.
The CAPAS Business Case is oriented towards the replacement of human-generated (subjective) data with more objective data that can be collected and integrated from different cross-sectorial sources in an automated way.
At least two external datasets (LIDAR and Copernicus SENTINEL2) are being used to improve the agricultural cadastre Spanish database. The economic value generated by this process and its relation to CAP funds assignment will be evaluated during the next year, in the final phase of the project.
Managing LIDAR data
LIDAR files are a collection of points stored as x, y, z which represent longitude, latitude, and elevation, respectively. This data is hard to process for non-specialists. To use them as a powerful tool to define objectively the parameters of agricultural use of parcels and the presence of landscape elements, a new data processing and treatment algorithm has been created.
This algorithm classifies and groups the cloud of points in order to simplify the huge amount of data. The clouds of points are topologically processed to obtain connected areas as polygons or to maintain them as single points. In conclusion, LIDAR datasets are transformed into new raster and vector files, more popular data types, and easier to be dealt with. The overlaps and intersections of the new datasets produced (as Landscape elements) will define the CAP parameters for a specific subplot or parcel.
Managing Satellite data
The Sentinels are a fleet of satellites designed specifically to deliver the wealth of data and imagery that are fundamental to the European Commission’s Copernicus program. The use of satellite images in CAPAS has already been explained in this blog entry.
Description of the source datasets and result dataset
The main source datasets of Business Case CAPAS and main processes used to obtain output datasets are explained below:
LIDAR files can be available under two different formats: .las and .laz. The LAS file format is a public file format commonly used to exchange 3-dimensional point cloud data between data users, being LAS just an abbreviation of LASER. LAZ files, due to the big size of LAS files, is the zipped version of the LAS format.
Although developed primarily for exchange of LIDAR point cloud data, LAS format supports the exchange of any 3-dimensional x,y,z tuples. This format maintains information specific to the LIDAR nature of the data while not being overly complex.
In the context of the ProDataMarket Project, LAS files used in the CAPAS business case will just be a collection of points (latitude, longitude, elevation).
The information to be used in CAPAS business case is the Image Data (JPEG2000) provided by Copernicus at Sentinels Scientific Data Hub (https://scihub.copernicus.eu/). The description of JPEG2000 format is beyond the aim of this blog entry but some general ideas will be described.
The following data workflow, as shown in the diagram below, illustrates the evolution of the different datasets, their transformations and their integration to generate the final result datasets.
The Grouping process gathers the LIDAR points using the following rules:
Errors, noise and overlaps are not taken into account (Classifications 1, 4, 7 and 12). As a consequence, more than 50% of points are removed from the process.
Soil, water and buildings have their own groups
Classification 19 is considered as short trees
Classification 20 is considered as medium trees
Classification 21 are 22 are grouped as tall trees
The result of this process is still a LAS file. The following image shows how LIDAR points (green points) have been processed and classified (Green points as trees, red points as soil, orange and yellow as bushes).
The next steps, such as Rasterization or Vectorization, involve topological rules in order to group the points to generate squares (raster) that would be processed to obtain the final vector shapefile.
The following image shows how LIDAR points have been grouped to create topologically connected surfaces. In the image below, yellow areas are Soil, orange are Bushes, green are Trees. Grey areas and blue surfaces (not present in this image) are Buildings and Water, respectively.
Once the trees class is defined in a raster format by LiDAR data, it wasrefined thanks to Sentinel Data which has more updated information. RGB and NDVI products help to identify which pixels have an NDVI value over 0.5 and it could be detected by RGB product in order to check which pixels represent vegetation areas.
Finally, trees auxiliary layer refined by Sentinel is processed to obtain different configurations:
The final result of the process is a vector ESRI shape file, where the copses layer is a polygon feature type and the isolated trees layer is as point feature type. All of them have a direct correspondence with the landscape elements.
The overlaps between detected landscape elements, currently protected sites of Natura 2000 network and the Land Parcel Identification System allows performing an accurate ecological value report for Spanish crops areas.
LiDAR algorithm allows to obtain more detailed information because the landscape value helps to identify which subplot has more value per parcel, obtaining the following benefits:
Farmers will get an economical profit through fund-assignments to maintain these trees forms, and
the ecosystem and its species will be preserved.
This Ecological value report has been developed regarding the following queries:
Query 1: Surface of Sites of Community Importance (LIC) / subplot area.
Score between 0 and 1.
Query 2: Surface of Special Protected Areas for Birds (ZEPA) / subplot area.
Score between 0 and 1.
Query 3: Protected Sites Value = Sum of query 1 + query 2. Score between 0 and 2.
Query 4: Number of Isolated tree / subplot area. Score between 0 and 1.
Query 5: Surface of copses area / subplot area. Score between 0 and 1.
Query 6: Landscape Elements Value = Sum of query 1 + query 2. Score between 0 and 2.
Query 7: Ecological Value = Sum of query 3 + Query 6.
Sentinel Products generation
In the first place, Sentinel 2 (S2) imagery has to be downloaded from the ESA server. In the automatic download process developed, selection parameters were incorporated in order to download only the imagery that satisfies our quality criteria. Two kinds of products are generated from S2 imagery.
Simple products: Those which have been generated with one-date imagery. By an automatic process, TRAGSA is generating RGB products for supporting photo interpretation. Another simple product generated is the Normalized Difference Vegetation Index (NDVI) which is widely used for vegetation monitoring.
Complex products: Those which are generated with imagery from different dates. The following four thematic layers are going to be created.
Permanent grassland: This layer will be useful to determine photosynthetically active vegetation and non active (unproductive or bare soil) areas. Therefore it will help to monitor the maintaining of existing permanent grassland, which is an agricultural beneficial practice for the climate and the environment (REGULATION (EU) No 1307/2013).
Herbaceous and woody crops: By using decision algorithms, different crops can be identified. The results will be displayed in two different layers, one for herbaceous crops and other for woody crops.
Change detection layer: This layer will highlight areas where changes have happened. The layer will be focused on forests and grassland areas in order to detect dramatic changes, such as those caused by logging or forest fires, as well as to detect more subtle changes associated with AIS (Alien Invasive Species), diseases and reforestation.
Hitherto, only one of the twin S2 satellites (Sentinel 2A) has been launched. When the second satellite (Sentinel 2B) is on orbit, the revisit time at the equator will be 5 days which results in 2-3 days at mid latitude. This high revisit time will offer a quicker updating of SigPAC database in comparison with current updates that are based on low precision data (LANDSAT and SPOT5 satellites) or ortophoto flights generated by each Autonomous Community.
As stated previously, Common Agriculture Policy funds Assignments Service (CAPAS) is a set of tools that improves the existing Common Agriculture Policy service (CAP), in order to innovatively manage and upgrade the CAP database provided by Spanish Administration to farmers and land owners. It is important to note that this CAP database is one of the main pillars of the CAP funds calculation systems. As mentioned earlier, the improvement process is based on the leverage of new cross-sectorial data sources from different fields and geographical areas, and the result datasets will be also available at the proDataMarket marketplace.
To use these new datasets as a powerful tool to define objectively the parameters of agricultural use of parcels, presence of landscape elements or temporal evolution of crops, the explained data processing and treatment algorithms have been, at the moment, partially developed.
As a summary, the usage of LIDAR files modifies some Parcel and Subplots features, and SENTINEL images will improve the definition of Parcel and Subplots land use and its temporal evolution.
The new datasets produced by CAPAS using those external sources will be RDFized and incorporated to proDataMarket platform. Therefore, Spanish rural property data, improved using new and underexploited datasets, will be accessible through proDataMarket platform providing the users with advanced visualization and querying features.
 JPEG 2000 (JP2) is an image compression standard and coding system. It was created by the Joint Photographic Experts Group committee in 2000