Digital Image Classification for Land Use Land Cover (LULC) Assessment - Part 1

1. Raster Images and Digital Imagery

Raster images, also known as digital images, consist of a matrix of individual pixels arranged in rows and columns. Each pixel contains a digital number (DN), which quantifies the intensity of electromagnetic energy detected by a remote sensing sensor at a specific location. This pixel-based structure is essential for remote sensing applications, as it enables detailed statistical analyses at the pixel level across various spectral bands. By leveraging raster data, analysts can examine and interpret patterns, trends, and characteristics within imagery, facilitating the extraction of valuable information for geographic information systems (GIS) and spatial analysis.

 

Sample Raster Image
Figure 1 Sample raster image.
Dr. Qassim Abdullah

2. Digital Image Classification

Image classification involves analyzing each pixel within a raster image and assigning it to a specific land cover category, such as forest, water, agricultural fields, or urban areas. This procedure transforms the raw spectral data collected by remote sensing sensors into practical and interpretable datasets that can be integrated into geographic information systems (GIS) and used for spatial analysis. The overarching aim is to generate thematic maps or information layers that reveal land cover patterns, distribution, and changes, enabling informed decision-making in resource management, urban planning, environmental monitoring, and other geospatial applications.

 

Figure 2 Sample image classification results (Left: Resulted classified image Right: Near Infrared Image before classification)
Figure 2 Sample image classification results (Left: Resulted classified image Right: Near Infrared Image before classification)
Dr. Qassim Abdullah

 

Suggested additional readings on image classification: https://gisgeography.com/image-classification-techniques-remote-sensing/

The process utilizes one or more of the following recognition types:

  1. Spectral pattern recognition: When decision rules are based on spectral radiance characteristics of the scene.
  2. Spatial pattern recognition: When decision rules are based on geometric characteristics of the scene (i.e. shape, size, patterns)
  3. Temporal pattern recognition: uses time as an aid in feature identification
  4. Object-oriented classification: involve combined use of both spectral and spatial recognition

3. Pattern Recognition in Classification

Image classification utilizes a variety of pattern recognition methods to accurately categorize land cover types. These methods include analyzing spectral information, such as pixel intensity values; examining spatial characteristics like shapes and textures within the image; evaluating temporal patterns by observing how pixel values change over time; and employing object-based strategies that assemble individual pixels into coherent, meaningful groups or objects. This multi-faceted approach enhances the ability to distinguish and classify diverse features present in digital imagery.

4. Spectral Signatures

Spectral signatures characterize the statistical properties of a land cover type by examining its response in multiple spectral bands. These signatures typically summarize features such as the average (mean) pixel values, the degree of spread (variance), and sometimes the relationships between bands (covariance). By capturing these patterns, spectral signatures provide a foundation for distinguishing different land cover categories within digital imagery, enabling accurate classification and analysis.

5. Informational vs Spectral Classes

Informational classes are categories defined by the user based on specific interests or objectives, such as types of land cover or land use. In contrast, spectral classes are groups of pixels that have been clustered together solely based on their statistical properties in the image data, without regard to their real-world meaning. One of the primary difficulties in digital image classification is establishing a clear correspondence between these statistically determined spectral classes and the user-relevant informational classes, as the relationship between them is not always direct or obvious. 

6. Spectral Variability

It is common for a single land cover category to exhibit several distinct spectral subclasses within a raster image. This diversity arises from factors such as varying angles of sunlight (illumination), differences in the density of vegetation cover, the presence of multiple species within the same category, and fluctuations in moisture levels. As a result, pixels representing the same informational class can display a wide range of spectral responses, making it more challenging to accurately assign them to the correct category during the classification process. Figure 2 illustrates an example on hierarchy tree of spectral subclasses within an information class.

 

Hierarchy tree of spectral subclasses within an information class
Figure 3 Hierarchy tree of spectral subclasses within an information class
Dr. Qassim Abdullah

 

7. Unsupervised Classification

Unsupervised classification operates by automatically sorting image pixels into distinct groups based on their shared statistical characteristics, without prior knowledge of land cover types. Figure 3. This process relies on clustering algorithms, such as ISODATA, which repeatedly analyze and adjust pixel groupings to improve the internal consistency of each cluster. After the algorithm has established these preliminary clusters, an analyst reviews the results and assigns meaningful land cover labels to each group, linking them to real-world categories. This approach is particularly useful when no reference data is available, but it requires careful interpretation to ensure accurate correspondence between statistical clusters and actual land cover classes.

 

Two band data set of three spectral classes
Figure 4 Example on unsupervised classification.
Dr. Qassim Abdullah

 

Pros:

  • No extensive prior knowledge of the region required
  • Opportunities for human error are minimized
  • Unique classes are recognized as distinct units
  • Logistically less cumbersome

Cons: 

  • Natural groupings do not necessarily correspond nicely with desired information classes
  • No control over the menu of classes and their specific id
  • Spectral properties of informational classes vary over time, and relationships between information and spectral classes change, making it difficult to compare unsupervised 
  • Classes from one image/date to another

8. Supervised Classification

Supervised classification involves utilizing labeled training data from areas of land cover that have been accurately identified on the image. From these reference sites, the classifier computes statistical descriptors—such as means and variances—for each class. These statistics serve as a model to evaluate and assign class membership to every unknown pixel in the image. Among the various supervised classification techniques, the Maximum Likelihood classifier is widely adopted due to its effectiveness at considering both the center and spread of the class distributions when determining the most probable category for each pixel.

Pros

  • An analyst controls the selected menu of informational classes or categories tailored for a specific purpose and geographic region 
  • Tied to specific areas of known identity 
  • Can evaluate results with additional training areas

Cons:

  • An analyst imposes a classification structure on the data (which may not match the natural spectral clusters that exist)
  • training data defined based on informational categories and not on spectral properties (may have important variation in the forest)
  • Careful selection of training areas is time and labor-intensive
  • training areas may not encompass and subsequently represent special or unique categories that don’t fit the information classes

9. Training Data Requirements

To ensure reliable classification outcomes, training sites must be carefully chosen to be internally consistent (homogeneous), well distributed across the study area, and large enough to include an adequate number of pixels that accurately capture the statistical properties of each land cover class. If the training data are poorly selected—such as being too small, unrepresentative, or clustered in a limited area—the resulting classification will suffer in accuracy and may misrepresent the actual distribution of land cover types in the imagery.

10. Classification Workflow

The standard process for image classification generally begins with the development of spectral signatures, where representative samples are selected to capture the statistical characteristics of each land cover class. Following signature generation, the classification algorithm assigns each pixel in the image to the most likely land cover category based on these statistical models. Once classification is completed, post-classification filtering is applied to smooth out noise, reduce isolated misclassifications, and enhance the spatial coherence of the results. The workflow concludes with an accuracy assessment, where the classified map is systematically compared against reference data to evaluate its performance. It is important to recognize that every stage in this workflow—signature development, classification, post-processing, and assessment—carries the risk of introducing errors, which can accumulate and influence the overall reliability of the final classification outcome.

11. Improving Classification Accuracy

Classification accuracy can be further enhanced through several strategies. Segmenting the image into meaningful regions prior to classification helps reduce within-class spectral variability and improves the coherence of mapped classes. Integrating supplementary GIS information—such as elevation data, soil maps, or land use records—provides valuable context that supports more precise class assignments. Utilizing imagery captured at different times (multitemporal data) allows the detection of seasonal or phenological changes, which aids in distinguishing between land cover types that may appear similar in a single image. Additionally, employing sophisticated classification algorithms—including artificial neural networks and fuzzy logic techniques—enables the modeling of complex, non-linear relationships in the data, thereby increasing the robustness and reliability of the classification results.

12. Spatial Resolution Effects

When the spatial resolution of an image is increased, it becomes possible to distinguish much smaller details and individual features within the landscape. However, this enhanced detail also means that elements like shadows, surface roughness, or minor variations in texture are more likely to be captured within a single land cover class. As a result, pixels that are supposed to represent the same class—such as a forest or an urban area—may exhibit greater differences in their spectral signatures. This added variability within the class can complicate the classification process, making it harder to achieve consistent and accurate grouping of similar land cover types across the image.