2. Vector Data in QGIS
2.1. Introduction
QGIS can open around 54 different file-based data formats (using the OGR library), including ESRI shapefile and MapInfo TAB, and can save around 23 of these. It can also connect to four spatial database sources natively (PostgreSQL, SpatiaLite, Oracle, MSSQL), with others available using ODBC, and to Web Feature Service (WFS) and Point Cloud data among other formats.
The default file format used by QGIS when creating data OGC GeoPackage, a file-based database. For databases the default connection is to PostgreSQL/PostGIS.
Most QGIS functionality is available for all vector data sources, though some formats can only be used in read-only mode and need to be saved to another format for editing.
2.2. File-Based Vector Data
GeoPackage
GeoPackage is relatively new - it was first published as an OGC standard in 2014. It has a .gpkg file extension, and although it can be used for single layers, it is implemented using the SQLite database standard, so can hold multiple layers in a single file as well. It is supported by current versions of most desktop GIS packages, and although it has not been widely used until now, it should be set to take over from shapefile as the default file-based storage and interchange format across applications and platforms. It will hold raster as well as vector data.
To load a GeoPackage layer:
Click Open Data Source Manager
Browse to the .gpkg file you want to open (this will be in .../Data/Vector under root directory for the course)
Double-click or drag and drop to open the layer in QGIS
Layer metadata
Right-click > Properties on the layer in the browser to see its metadata - CRS, number of features, fields and so on.
ESRI Shapefile
Shapefile is a widely-used format, developed by ESRI (used in for example ArcGIS), and is used on many download sites. Shapefiles comprise four or more individual files on disk - all must be available for the file to open.
To filter directory listings to shapefile (or any file vector format):
Click Open Data Source Manager
Click the Vector tab on the left of the dialog
Click the three dots (...) to the right of the Vector Dataset(s) field, then use the dropdown to the right of the File name: field to filter to ESRI Shapefiles (*.shp)
The list of files will be filtered to .shp only
MapInfo
QGIS can open both MIF and TAB files - these will appear in the browser with other file-based data. Note that MapInfo files can contain multiple geometry types, so QGIS may present a dialog with a choice of geometry types - i.e. line, polygon, point - to load. You can filter to MapInfo files in the same way as described for shapefiles above.
Editing MapInfo files
QGIS can edit .tab but not .mif files - if you need to edit these, you will need to save them to a different format (for example GeoPackage), then save back when you are finished.
Other
The process above can be used to load most vector data from file, including for example GML and GeoJSON. QGIS can also open ESRI GeoDatabases, though these will be read-only.
2.3. Spreadsheet Data
QGIS will open an Excel file as vector data, and for the most part correctly recognise data types (for example string, integer and so on). If the data includes coordinates, creating a geometry column which QGIS can use to represent the data spatially requires an extra step, described under Excel below. Alternatively, use the Spreadsheet Layers plugin, or load the data as CSV using the Delimited Text option in the Data Source Manager described in Delimited Text Data below.
Excel
To load data from Excel:
Click Open Data Source Manager
Click Vector and browse to the Excel file
Click Add to add the layer to QGIS
Select the worksheet(s) to add
At this stage, the layer will show in the Layers panel as a table (for example ), but it won't appear on the map. If the data doesn't contain coordinates, this may be what you want. However if the data represents points, and you have an x and y column, you will need to add a geometry column to the data so that QGIS can add it to the map.
In the search bar, search for the Create Points from Table process
Open the process, and complete the dialogue, making sure to set the X and Y fields correctly, and to use the CRS that the coordinate data is in
Click Run
The layer will appear in the Layers panel as Points from table, and the points will appear on the map
Delimited Text Data (e.g. CSV)
If you need more control over the process of loading spreadsheet data, or if it's already held in delimited text such as CSV, you can use the Add delimited text layer tool.
Click Open Data Source Manager
Select the Delimited Text tab to bring up the Delimited Text dialog
Adjust the settings shown below if necessary - note that in this instance, only the Geometry CRS setting was changed from the default - QGIS derived all other settings from the data
Click Add to add the layer to the map
Saving CSV in a vector format
At this point you may want to save the CSV as a standard vector format (for example GeoPackage) - there are some things which can't be done while it's still held as CSV. On the other hand, if you are using a CSV which is regularly updated from an external source, you may want to keep it as CSV, so that QGIS is always looking at the latest version of the data.
If the data has no coordinate data, and is just going to be used in QGIS as a table, check the Geometry definition > No geometry option in the dialog above. This is useful, for example, if there is a layer that needs to be joined.
If the file has a column containing Well Known Text (WKT) data (e.g. "SRID=27700;POINT(507417 163699)" ) instead of coordinates, check the Geometry definition > Well known text option.
2.4. Database Layers
Loading database vector layers follows a slightly different process to loading a file-based layer, as you first need to establish a database connection. The process is similar for all databases, but in this example we will use data in a PostGreSQL database.
Database layers
You can load either spatial or non-spatial data from a database - the non-spatial data will be shown in QGIS as a table, which you can join to spatial data as shown below.
To load data from a PostGreSQL database:
Click Open Data Source Manager
Click the PostgreSQL tab
If you haven't connected to the database before, click New to create a new database connection
Complete the dialog:
Name: your name for the connection
Host: the name or IP address of the server the database is running on
Port: the port the database is running on - by default in PostGreSQL this is 5432, but it could be different
Database: the name of the database you want to connect to
Under Authentication, click the Add button to add your credentials
The first time you do this, you will be prompted for a Master password - make sure this is something you can safely store or remember, as it cannot be retrieved later
Once you have done this, you will be able to set the credentials for this connection:
Click Save to save these credentials, then click Test Connection to make sure the connection is OK
If all is well, click OK, then click Connect to establish the connection with the database
Expand the database schemas, and select a table and click Add to add it to the map
Check Also list tables with no geometry to view non-spatial data tables
Once the data is loaded in the Layers panel, it will behave in the same way as file-based vector layers. You only need to set up the connection once - after that, you can connect to the database from the PostGreSQL table in the Data Source Manager, or from the Browser tab.
2.5. Web Feature Service (WFS) Layers
Data available in OGC Web Feature Service (WFS) format can be loaded in QGIS and used in a similar way to any other vector layer. However as the data is online, an Internet connection is required, although once the connection has been made it can be saved locally. WFS data is supplied as GML, and features can be selected, styled and queried in the same way as any other vector data.
Create a new saved WFS Connection
The first time you use a WFS source, you will need to create a connection:
Click Open Data Source Manager, then the WFS tab
Click New, then in the Create a New WFS Connection dialog, add the connection details.
Name: your name for the connection
URL: the URL of the WFS, for example http://essex.astuntechnology.com/getows.ashx?mapsource=Essex/Inspire_Colchester&TypeName=allotment&version=1.1.0&request=GetFeature&service=WFS
Authentication: complete as required
Click OK.
Connect to the WFS
Select the saved WFS
Click Connect to show the layers available in the WFS
Select the layer(s) required - you can select multiple layers
Click Add to load the layer(s) into the map
Note that with large datasets, it can take some time for the layers to load
WFS Queries
By default, only features within the current view extent will be downloaded - uncheck Only request features overlapping the view extent to load all features from the source
To load only a subset of features (other than by extent), click Build query to bring up the SQL Query Composer, and add a query to select the features you need
Once WFS layers are loaded, they can be managed in the same way as other vector layers.
Transactional Web Feature Service (WFS-T) layers, which are editable, are also supported in QGIS.
2.6. Navigating Vector Layers
There are different approaches to navigating around the layers you have loaded, for example using toolbars, mouse actions and menus - some of the key ones are listed here.
The buttons in the Map Navigation Toolbar will also get you around the map, and change the action of your mouse and cursor - these will help you identify the data and views you need.
View Layer Extents
When a layer has been added to the Layers panel, the features will be visible in the map frame on the right. To see the whole layer in the frame:
Click on the layer to select it
Right-click > Zoom to Layer (or click the icon on the toolbar)
Attribute Table
You can browse the attribute data for each layer by right-clicking on the layer and selecting Open Attribute Table, (or click the icon in the toolbar).
Manage the Layer List
Normally, layers at the top of the list in the Layers panel appear on the map in front of layers further down the list. You can change the order of layers in the list by dragging and dropping layers to a new position.
Group Layers
Layers can be grouped to create a hierarchical layer list, or table of contents.
Click the Add Group button at the top of the Layers panel
Right-click and Rename the new group
Drag and drop layers into the group
Remove Layers
To remove a layer from the list, select the layer, then Right-click > Remove. This does not delete the data, just removes it from the map. However you will lose any styling or queries you have applied to the layer.
2.7. Joining Layers
If you want to use data in your map which has no spatial references, you will need to join this to a corresponding spatial dataset. For example, you might have population data for census areas, and a polygon layer of the census areas themselves, and you want to join the two in order to show the population figures for each areas as a label.
Add the non-spatial data table, and the spatial layer you are joining it to, as described earlier
The new table will appear in the Layers panel, but not on the map
Double-click the spatial layer to bring up the Properties dialog, and select the Joins tab
Click the Add button bottom left to add a join
In the Add Vector Join dialog, use the dropdowns to specify:
Join layer: the layer you want to join to the current one
Join field: the field in the target layer (the one you are joining to) that is going to be used for the join (this might be ID, for example, or name, or in the case below, neighbourhood)
Target field: the field in the current layer (the spatial one) which is going to be used for the join - this must contain the same data as the join field
Click OK and OK again, then check the attribute table of the spatial layer - it will have an expanded number of columns to include the data from the non-spatial table
Join definition
The join definition is saved in memory, and is saved in the QGIS project - it does not affect the source data. It will therefore reflect the current data each time the project is loaded. You can edit the join after it has been created - for example to exclude fields from the joined data - in the same dialog you used to create it.
2.8. Exporting Vector Data
To export a layer or part of a layer:
Right-click > Export > Save Features as from the layer
In the dialog, you can simply save the file as a different name - however you can also use other options:
Format: save the data in a different vector format
File name: browse to the location where you want the file to be saved, and give it a name
Layer name: for a GeoPackage, by default this will be the same as the file name, but you can change it (GeoPackages can contain many layers)
Save only selected features: save a subset of the features
CRS: reproject the data on save to a different CRS
Select fields to export: save a subset of fields from the attribute table
Extent: save only features visible in the current extent
Exporting a joined layer
If you export a layer with a join, the exported data will include the fields in the join - this is a way of creating a permanent new dataset from a joined layer.