Quantification of Colon Content from MRI Using a Semi-automatic Approach Group 774

Worksheets for the paper:
Quantification of Colon Content from MRI
Using a Semi-automatic Approach
Group 774
December 21st, 2012
School of Medicine and Health
Aalborg University
Worksheets for the article:
Quantification of Colon Content from MRI Using a
Semi-automatic Approach
1. semester master, Biomedical Engineering and Informatics
Group 774
Authors
Astrid Munk
Thomas Holm Sandberg
Søren Sohrt-Petersen
Daniel Vyskovsky
Lin Wang
Supervisors
Lasse Riis Østergaard,
Associate Professor
Jens Brøndum Frøkjær,
Staff Specialist and Head of Research
50 pages
8 copies
Number of Pages:
Number of Printings:
21st of December 2012
The content of this report is freely available, but publication is allowed only with complete reference.
Abstract
Introduction: Constipation is a common problem in the Western world [1]. No golden standard to
assess constipation exists [2]. The aim of this paper was to develop an algorithm for semi-automatic
segmentation of the colon content from magnetic resonance imaging (MRI) in order to estimate
the volume of the colon content.
Methods and Materials: T1 weighted MRI data from liver acquisition with volume acquisition
(LAVA) sequence was used. The initial seed points for region growing were selected using a semiautomatic method based on fuzzy c-means clustering. Growing criteria for a seeded region growing
featured edge detection and intensity thresholds. Total and partial volumes of the segmentation
were calculated. The accuracy was validated by comparison to manual expert segmentations using
the Dice Similarity Coefficient (s) (7.1). The reliability was validated by segmenting continuous
scans.
Results: Five series were segmented with total volumes of 0.8667, 0.0772, 0.6542, 0.8799, and
0.6772 litres, respectively. The comparison with ground truth produced Dice coefficients of s = 0.40
and s = 0.63, respectively. In single coronal slices Dice coefficients of s = 0.60 − 80 were obtained.
The reliability of the algorithm was expressed in a standard deviation of σ = 0.0234 litres from a
mean of µ = 0.5394 litres.
Discussion: The proposed algorithm shows successful and robust results in segmenting bright
colon content. The algorithm proposes a way to semi-automatically quantify volume of colon
content from MRI. Segmentation results show differences between automatic segmentation and
ground truth. Differences are mainly due to the nature of the algorithm being able to segment
content with differences in contrasts between bright colon content and surrounding tissues. Manual
outlining of colon content integrates prior anatomical knowledge. This is not incorporated in
the algorithm that relies on universal segmentation criteria. This results in some parts of the
segmentation leaking to unwanted areas. The empirically selected criteria could be learned by
training the parameters of the criteria. The proposed algorithm may have a clinical application by
helping more objectively assess constipation in patients.
Astrid Munk
Thomas Holm Sandberg
Søren Sohrt-Petersen
Daniel Vyskovsky
Lin Wang
Preface
This is the worksheets describing the development of an algorithm for quantification of colon content
from magnetic resonance imaging (MRI) using seeded region growing. The algorithm utilises acknowledged methods from the fields of image analysis and pattern recognition, and is able to segment colon
content and calculate one total volume and two partial volumes.
The project uses MRI data of human patients acquired from Department of Radiology, Aalborg Hospital. The project group would like to thank Staff Specialist and Head of Research Jens Frøkjær for
providing the MRI data and manually segmenting two series of MRI.
Present project suggests an algorithm for quantification of colon content from magnetic resonance
imaging. In order to provide the reader with requisite knowledge to understand the underlying construct in this project, terms as constipation, diagnostic criteria, and the anatomy of the colon are
described in Chapter 1. Afterwards a review on medical imaging techniques of the colon follows in
Chapter 2. Finally, the aim of this project is set in terms of a problem statement in section 2.3.
The considerations and complications that came up during the development of the proposed algorithm is described in Chapter 3-6. Chapter 3 contains an overview of the image analysis system and
a description of the data set. Chapter 4 deals with an initial segmentation, and Chapter 5 is about
the actual segmentation. In Chapter 6 parameters for clinical use are extracted, while Chapter 7
contains the results of the segmentation and a validation of the algorithm with regard to accuracy and
reliability. Finally, a discussion on the segmentation and other issues is conducted in Chapter 8.
The term colon content is used throughout this project to describe solids in the colon. Colon content
comprises chyme in cecum and colon ascendens, while in the colon transversum and colon descendens
the content is faecal.
v
Contents
Contents
vi
List of Figures
ix
List of Tables
xi
1 Constipation
1.1 Anatomy . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
2 Medical Imaging of the Colon
2.1 Radiography . . . . . . . . .
2.2 Magnetic Resonance Imaging
2.2.1 MRI Parameters . . .
2.2.2 MRI Artifacts . . . . .
2.3 Problem Statement . . . . . .
1
2
.
.
.
.
.
7
7
9
10
11
11
3 Image Analysis System
3.1 Data Set . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
13
13
4 Initial Segmentation
4.1 Purpose . . . . . . . . . . . . . . . . . . . . . . .
4.2 Different Methods for Initial Segmentation . . . .
4.2.1 Filtering . . . . . . . . . . . . . . . . . . .
4.2.2 Region Filling . . . . . . . . . . . . . . . .
4.2.3 Edge Detection . . . . . . . . . . . . . . .
4.2.4 Global Thresholding Based on Intensities
4.2.5 Manual Intervention . . . . . . . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
15
15
15
15
15
15
17
21
5 Segmentation
5.1 Purpose . . . . . . . . . . . . . . . . . . . . . .
5.2 Texture analysis . . . . . . . . . . . . . . . . .
5.2.1 Micro-features . . . . . . . . . . . . . .
5.2.2 Co-occurrence matrix . . . . . . . . . .
5.2.3 Difficulties Regarding Texture Analysis
5.3 Region Growing . . . . . . . . . . . . . . . . . .
5.3.1 Common Approaches . . . . . . . . . .
5.4 Region Growing Algorithm . . . . . . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
23
23
23
23
24
24
24
25
25
6 Post Processing
6.1 Purpose . . . . . . . . . . . . . . . . . . . . . . . .
6.2 Estimation of Cross-sectional Areas . . . . . . . . .
6.2.1 Approach . . . . . . . . . . . . . . . . . . .
6.2.2 Difficulties Regarding Cross-sectional Areas
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
29
29
29
29
32
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
vi
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
CONTENTS
6.3
.
.
.
.
.
.
.
.
.
32
32
33
34
34
35
36
37
37
7 Results and Validation
7.1 Results from Segmentation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
7.1.1 Test of Accuracy: Comparison to ground truth using Dice Similarity Coefficients
and a Qualitatively Validation . . . . . . . . . . . . . . . . . . . . . . . . . . .
7.2 Results from Volume Estimation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
7.2.1 Test of Reliability: Comparing Consecutive Scans of one Subject . . . . . . . .
7.2.2 Computational Times . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
39
39
8 Discussion
8.1 Segmentation . . . . . . . . . . . . . . . .
8.2 Automatic Versus Manual Segmentation .
8.3 Sensitivity to Selection of Seed Point . . .
8.4 Optimising the Algorithm for Clinical Use
45
45
46
46
46
6.4
6.5
6.6
Diameter of Inner Joined Spheres . . . . . . . . . . . . . . . . .
6.3.1 Approach . . . . . . . . . . . . . . . . . . . . . . . . . .
6.3.2 Difficulties Regarding Diameter of Inner Joined Spheres
Model-based Approach . . . . . . . . . . . . . . . . . . . . . . .
6.4.1 Approach . . . . . . . . . . . . . . . . . . . . . . . . . .
6.4.2 Difficulties Regarding the Model Based Approach . . . .
Downgrading the Complexity of the Measurements . . . . . . .
Estimation of Volumes . . . . . . . . . . . . . . . . . . . . . . .
6.6.1 Approach . . . . . . . . . . . . . . . . . . . . . . . . . .
Bibliography
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
39
42
43
44
49
vii
List of Figures
1.1
1.2
1.3
1.4
2.1
2.2
2.3
2.4
2.5
3.1
3.2
3.3
4.1
4.2
4.3
4.4
4.5
4.6
4.7
4.8
5.1
5.2
5.3
The human colon comprises four parts; colon ascendens, colon transversum, colon descendens, and colon sigmoideum. The transitions between the ascending, transverse, and
descending segments are flexura coli dextra and flexura coli sinistra, respectively [3]. . . .
Interpatient variations in shape and location of colon. The variation in distribution, shape,
texture, location and intensities of colon content in between patients are high, and even
within patients, due to time of scan, degree of constipation, and type of food intake [4, 5].
The colon wall [6]. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Peristalsis something something. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
2
3
4
5
A speeding electron hits a tungsten atom. Energy is released in form of a X-ray photon [7]. 7
Abdominal Radiography. The air bubbles in the colon appears as black spots [8]. . . . . .
8
Precession: A rotation of an axis on rotation [9]. . . . . . . . . . . . . . . . . . . . . . . .
9
Left: Gradient coils creating the static fields [10]. Right: RF coil used for abdominal MRI
[11]. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
Differences in signal intensity between water and fat with varying repetition time, TR [12]. 11
Traditional division of an image analysis system [13]. The components are processed in the
following chapters. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Two coronal MR images from a LAVA series. The colon is labelled with green arrows while
artifacts are labelled with red arrows. . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Axial representation of the data set. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Left: Edge detection using a Sobel kernel. Right: Edge detection using a Gaussian smoothing filter and a Laplacian filter kernel. . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Edge detection using a more sophisticated method, the Canny edge detector. . . . . . . .
Left: Coronal slice from a LAVA series. Right: The associated histogram. . . . . . . . . .
Left: Cropping colon content in a coronal slice. The cropped image is labelled with white.
Right: The histogram from the cropped image. . . . . . . . . . . . . . . . . . . . . . . . .
Histogram of one complete series. The lines indicate the cluster centers, and the calculated
threshold. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Above: Histogram of original image (coronal slice of data set). Below: Histogram of liver
tissue. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Left: True positives from FCM using 7 clusters. Right: True positives from FCM using 5
clusters. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Left: Candidate areas for initial seed point selection based on FCM clustering and morphological filtering. Right: Areas of colon content inside the manually drawn region of
interest. These areas are used for final seed points for region growing. . . . . . . . . . . .
Co-occurrence matrix, [14] . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
4- and 8-connected neighborhoods for region growing [15]. . . . . . . . . . . . . . . . . . .
Crop of the right flexure at the transition from the ascending to the transverse colon. The
contrast between bright colon content and the colon lumen and surrounding tissue is quite
clear. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
ix
13
14
14
16
17
18
18
20
21
21
22
24
25
26
List of Figures
5.4
5.5
Crop of the resulting gradient filled image after applying a sober filter and morphological
flood-fill operation. The colon content of areas of high gradient peaks results in high
contrast between content and surrounding tissue. . . . . . . . . . . . . . . . . . . . . . . .
Segmentation after region growing has been applied. The pink color shows the initial
seed points segmented based on FCM clustering, and the blue color shows the expanded
segmentation from region growing. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
6.1
27
28
Cross-sectional area of a component (yellow). The centroid is marked with a dot, and the
direction of the component is marked with an arrow. . . . . . . . . . . . . . . . . . . . . .
6.2 The direction of each component depends on the components of the eigenvector having the
largest absolute eigenvalue: When the absolute value of the component in the x-direction
was largest, the component was classified into the transverse segment (red). Conversely,
when the absolute value of the component in the y-direction was largest, the component
was classified into either the ascending or descending segments (purple). . . . . . . . . . .
6.3 Segmented colon content. The red lines indicate the plane through each component. . . .
6.4 Axial view of a disk (blue) in a component (not shown). The interslice area is calculated
assuming that the colon does not change in shape in between the slices. . . . . . . . . . .
6.5 Left: Distance map of the segmented colon content. Right: Enlarged segment of the
distance map. Peak points are brightest resulting in a rough skeleton of the shapes. The
values of the distance map (not shown) are radii of inner joined spheres. . . . . . . . . . .
6.6 From left to right: Different estimates of the diameter of the colon content. 1) A reliable
estimate. The diameter of the inner sphere is close to that of the colon content. 2) Less
reliable estimates. The irregular shape limits the size of the diameter of the inner sphere.
3) Not reliable estimates. 4) Less reliable estimates. If this is the situation in one of the
flexures, the reliability might increase. . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
6.7 Since the components of the flexures can be composed of both ascending/descending and
transverse part, the majority of the δ points will be located in the flexures. In these cases
the distance map will lack reliability as an estimation of the diameter of the colon content.
6.8 Estimating a rough distribution of the colon using lines created by choosing flexure points.
6.9 Two dividings of colon according to different placements of corner points. . . . . . . . . .
6.10 Problematic position of transverse segment for use of this method. . . . . . . . . . . . . .
34
34
35
36
7.1
7.2
7.3
7.4
7.5
7.6
7.7
7.8
39
40
40
41
41
42
43
44
Manual segmentation in ImageJ indicated by yellow. . . . . . . . . . . . . . . . . . . .
s values in each slice, series 4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
s values in each slice, series 5 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Bright intensity terminal ileum segmented as part of colon content . . . . . . . . . . . .
Manual (green) and automatic (red) segmentation resulting in s = 0, 74 . . . . . . . . .
Manual (green) and automatic (red) segmentation resulting in s = 0, 39 . . . . . . . . .
Total volume segmented from data set # 5 . . . . . . . . . . . . . . . . . . . . . . . . .
A leak in the right side of the patient was seen in all 3 series used to test the reliability.
x
.
.
.
.
.
.
.
.
29
30
31
32
33
33
List of Tables
4.1
*) The FCM clustering function in Matlab has a (default) maximum number of iterations.
**) Proportion of false positives to the total number of connected components. . . . . . .
6.1
Percent of total volume in each part of colon. µ is the mean, and σ is the standard deviation. 35
7.1
7.2
7.3
Estimation of volumes. *) Manual segmentation, ground truth. . . . . . . . . . . . . . . .
Test of reliability. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Total computational times for each series. . . . . . . . . . . . . . . . . . . . . . . . . . . .
xi
20
42
43
44
1
Constipation
Constipation is a complex of symptoms of many different pathophysiological processes. Due to a variety of symptoms and risk factors associated with constipation, the etiology might be multifactoral, and
this calls for both a determination of the epidemiology of constipation and a measurement to assess
constipation. Many definitions of constipation exist and this is directly reflected in the prevalence of
constipation. According to a review made by Connel et al the prevalence ranges from 3.4-27.2%. The
wide range is also due to different methods of ascertaining constipation. [16]
The first objective definition of constipation dates back to the 1960s where the bowel frequency was
used as an indicator for constipation. This definition was used for decades until Sander and Drossman
in 1987 suggested that also other symptoms should be taken in consideration. They also found that
physicians and patients had different understandings of constipation. This lead to the definition of the
Rome Criteria with the intention to be able to compare patients. Today the Rome II criteria which
comprise two or more of the following abnormalities for at least three months is used. [2, 17]
The Rome II Criteria:
• Fewer than three bowel movements (BMs) per week
• Hard stool in more than 25% of BMs
• A sense of incomplete evacuation in more than 25% of BMs
• Excessive straining in more than 25% of BMs
• A need for digital manipulation to facilitate evacuation
In the literature three subgroups of pathophysiology regarding constipation exits: Functional constipation (slow-transit constipation), Irritable Bowel Syndrom (IBS), and Outlet Obstruction. Being
able to distinguish between these subgroups often improves the outcome of treatment.
The known etiologies of constipation include mechanical obstruction, metabolic disturbances, neurological disorders, and medication side effects. Furthermore, some patients with constipation lack a
known cause, that is they suffer from idiopathic constipation. A differential diagnosis is Hirschsprungs
disease (Megacolon). Due to a lack of nerve cells in the myentheric plexus in the sigmoid colon,
this part becomes spastic. This results in an accumulation of feces proximally to this point. The diameter could expand to 8-10 cm. [18] A description of the anatomy of the colon is found in Section 1.1.
How is constipation being diagnosed in the clinic? According to World Gastroenterology Organization, no golden standard for assessing constipation exists. However, a recommendation suggests that
history taking including checking the Rome Criteria, physical examination including checking for palpable faeces and gas, and finally diagnostic techniques should be utilised. The latter is often done
by stool analysis or medical imaging of the colon to assess seriousness, or an analysis of colon transit
time depending on the outcome of the history taking and physical analysis.
1
CHAPTER 1. CONSTIPATION
Treatment of constipation is symptomatic. Change of diet and life style and stopping of medication
causing constipation are often the first steps. Further steps are use of fibers and different laxatives
and pelvic floor physiotherapy. [2]
1.1
Anatomy
This section serves to provide the reader with anatomy of the colon (Intestinum Crassum) and its
position and function. The digestive tract is divided into the upper and lower gastrointestinal tracts.
The upper part is composed of the mouth, esophagus, stomach and duodenum, the latter being the
first segment of the small intestine. The lower part is composed of the jejunum and ileum segments of
the small intestine and the entire large intestine. In the small intestine all nutrients from chyme are
digested, and in the large intestine water, sodium and some fat soluble vitamins are absorbed.
The large intestine is composed of the cecum, colon, rectum and anal canal. Hence, the colon is
a segment of the large intestine, and its function is storage of fecal matter. Colon is composed of
four further segments; colon ascendens, colon transversum, colon descendens, and colon sigmoideum.
These segments encircle the small intestine as shown in Figure 1.1. [19]
Fig. 1.1: The human colon comprises four parts; colon ascendens, colon transversum, colon descendens, and
colon sigmoideum. The transitions between the ascending, transverse, and descending segments are flexura coli
dextra and flexura coli sinistra, respectively [3].
Cecum is 6 cm in length and has a diameter of 7 cm. Cecum receives chyme from ostium ileocaele,
from here it continues to colon ascendens. This segment is 0.15 m and has a relatively large diameter,
though smaller than that of cecum. The uppermost part lies close to the lower part of the right
kidney. Flexura coli dextra is the transition from colon ascendens to colon tranversum. This flexure
is often obtuse-angled and is also termed flexura hepatica, which indicates that it lies closely to the
liver. Colon transversum is approximately 0.5 m and is a U-shaped sling with the point of folding
just reaching umbilicus (navel). It can also be V-shaped with the tip reaching the pubic symphysis.
Different manifestations of colon in terms of shape and location is shown in Figure 1.2. Flexura coli
sinistra is the transition from colon transversum to colon descendens colon. This flexure has an acute
angle and is also termed Flexura splenica, which indicates that it is close to the spleen. Colon descendens is 0.25-0.3 m and continues to crista iliaca (wing on the hip bone). This segment is often
contracted with a small diameter. The last segment of the colon is the sigmoid colon. This segment
is 0.4 m in average but can vary from 0.15-0.8 m. Colon sigmoideum has many appearances. It can
wind in various ways in the abdominal cavity.
2 of 50
1.1. ANATOMY
Fig. 1.2: Interpatient variations in shape and location of colon. The variation in distribution, shape, texture,
location and intensities of colon content in between patients are high, and even within patients, due to time of
scan, degree of constipation, and type of food intake [4, 5].
3 of 50
CHAPTER 1. CONSTIPATION
Fig. 1.3: The colon wall [6].
Figure 1.3 shows the physiologic anatomy of the intestinal wall. It comprises four layers. The innermost layer is Tunica mucosa, which epithelial lining amongst others contains Goblet cells. These cells
produce mucus, which provides lubrication for the transportation of chyme and protects the intestinal
mucosa. Tela submucosa is loose connective tissue, storing the submucosal plexus that controls
secretion. Tunica muscularis is smooth muscle characterised by a inner circular layer and three
longitudinal bands, teaniae coli. Because teaniae coli are shorter than the intestinal wall, Haustrae
coli emerges. On the surface of the circular muscle layer is the myentheric plexus, which contributes
to peristalsis. This phenomenon is explained in the next section. The outermost layer of the intestinal
wall is Tunica serosa. [18]
4 of 50
1.1. ANATOMY
Fig. 1.4: Peristalsis something something.
For transportation of the chyme different movements are performed. The peristalsis has already been
mentioned. This is a series of wave-like muscle contractions as shown in Figure 1.4. Some medicaments are known to inhibit the peristalsis resulting in reduced bowel movement which can lead to
constipation [20]. Another important type of movement is mass movement. This movement usually
occurs 1-3 times a day and very often shortly after breakfast has been ingested. In response to a irritated point in colon transversum, a constricted ring appears. 0.2 m distal to this point the haustrae
contract as one unit. This contraction lasts 30 seconds with increasing intensity. After a relaxation
lasting 2-3 minutes, another mass movement will occur. Mixing movements is a category of different
movements. Local intermittent constrictive contractions, which last for 5-30 seconds, are examples
of mixing movements. Due to cocontraction of coli teaniae and the circular muscle in the Tunica
muscularis haustrations proximal and distal to the contraction occur. These haustrations expose the
chyme to the epithelial lining, which enhances the resorption of fluids and dissolved substances. For
a healthy person it takes 8-15 hours for the chyme to pass from ostium ileocaele through the colon. [21]
The next sections are concerned with two methods of how to perform imaging of the colon as a
technique for diagnosing constipation.
5 of 50
2
Medical Imaging of the Colon
2.1
Radiography
A diagnostic technique to assess the degree of constipation is abdominal radiography (AR). Radiography is obtained by X-rays, which are ionising radiation. X-rays are produced by movements of
electrons in atoms. Orbitals are energy levels around an atom’s nucleus, and when an electron goes
to a lower orbital, energy is released as a photon. When this photon collides with another atom,
the energy of the photon is absorbed by the new atom. This forces an electron to a higher energy
level. When the electron falls back to its original energy level, energy is released in form of a photon. Hence, X-rays are electromagnetic energy carried by photons. The energy levels of X-rays bring
shorter wavelengths than visible to the human eye. The equation below describes the relationship
between wavelength λ and photon energy hω:
hω · λ = hc = 1239.842eV · nm
where h is Planck’s constant, h is the reduced Planck’s constant, and c is the speed of light. [22]
Fig. 2.1: A speeding electron hits a tungsten atom. Energy is released in form of a X-ray photon [7].
In practice X-rays are made from a X-ray machine by means of two factors. The first factor is the
voltage difference between a cathode and an anode. By heating the cathode, electrons are discharged.
The anode is made of tungsten and attracts the electrons. When a speeding electron collides with a
tungsten atom, an electron is forced to a lower orbital. As shown in Figure 2.1 As aforementioned,
energy is then released as a photon. When the energy drop is large enough, the photon has a high
energy level making it a X-ray. [23, 24]
When X-rays hit a material a proportion of the X-rays is absorbed, depending on the density of the
material. If placed on a film sensitive to X-rays, the proportion of X-rays passing through the subject
is detected and can be used for a 2D representation of the subject. Since X-rays travel in straight
lines, only small deformations occur in X-ray images. [23, 24]
7
CHAPTER 2. MEDICAL IMAGING OF THE COLON
In the case of assessing constipation from radiography, the amount of colon content is interpreted from
the air bubbles on the images. The fact that colon content is not visible implies some difficulties in
interpreting these images. A study made by Reuchlin-Vroklage et al proved that using plain AR to
assess the amount of colon content is highly subjective and has high interobserver and intraobserver
variability. This impairs the diagnostic accuracy. [25]
To improve the readings from plain AR, different rating tools have been developed, e.g. the Barr,
Leech and Blethyn methods. The rating tool described by Barr et al. evaluates the amount and nature
of content in the ascending, transverse and descending colon, and the rectum. Leech et al. divide the
abdomen into three zones and then evaluate the amount of colon content and bowel dilation in each
zone. Finally Blethyn et al. base their rating tool on the amount of colon content and whether it
is continuous or discontinuous. Moylan et al. evaluated these tools in terms of reliability and found
a poor reliability among multiple examiners [26]. A study made by Bongers et al. agrees that plain
AR has limited value in diagnosing children with functional gastrointestinal disorders [27]. When a
patient is referred, an objective method is required, when the knowledge of ground truth is absent [13].
The biological effects of exposure to ionising radiation are caused by deterministic and stochastic
effects. Deterministic effects occur when a certain threshold has been exceeded. Consequences are
cataract, sterility, or radiation sickness. Stochastic effects occur with no threshold limits but the risk
of an effect occurring is linearly increasing with the dose. Consequences are cancer or hereditary
defects. During an examination the patient is exposed to a radiation dose corresponding to the dose
of background radiation during a couple of days to a couple of years [28]. A principle of radiation
protection is specified as the As Low As Reasonable Possible-principle, which implies that the risks and
benefits of radiography should be compromised. Therefore, a method to avoid unnecessary exposure
should be identified.
Fig. 2.2: Abdominal Radiography. The air bubbles in the colon appears as black spots [8].
8 of 50
2.2. MAGNETIC RESONANCE IMAGING
2.2
Magnetic Resonance Imaging
MRI utilise magnetism and radio frequencies (RF) to generate sectional images of the body. Hence, no
exposure to radiation for the patient. The nucleus of an atom has a mass and an angular momentum
due to its spin. The mass of the nucleus gives rise to a moment of inertia. This inhibits the nucleus
from aligning in a magnetic field. This factor, the ratio of the magnetic moment to the moment of
inertia, is termed the gyromagnetic ratio. Specific gyromagnetic ratios for different isotopes allow a
differentiation during MRI. [23, 24]
MRI is accomplished through various measurements of the rotational precession of the nucleus and
the precession of the magnetic field caused by the nucleus. The precessions are shown in Figure 2.3.
Because the gyromagnetic ratio is fixed for all elements, the precessional frequency is fixed for a given
strength of a magnetic field.
Fig. 2.3: Precession: A rotation of an axis on rotation [9].
The nuclear magnetic resonance signals are produced by elements having nuclear angular momentum.
Hydrogen atoms are of major importance due to their high gyromagnetic ratio. The magnetic resonance emerges from protons alternating from parallel to antiparallel in proportion to a static field.
The alternation is caused by an electromagnetic RF alternating at the Larmor frequency being applied
to the static field. Excessive energy is emitted from protons antiparallel to the static fields, which is
emitted from the protons as electromagnetic radiation.
The precessional frequency is also known as the Larmor frequency. The Larmor equation below expresses what RF will be emitted by a specific element. ω is the rate of precession, γ is the gyromagnetic
ratio, and B0 is the strength of the static magnetic field. When this RF is detected by a radio receiver
turned to this frequency, MRI can occur.
ω = γ · B0
Summed up, applying a Larmor frequency RF and then detecting the RF emissions from the protons,
produce the magnetic resonance signal. Afterwards techniques for digital reconstructions can create
sectional images. [23, 24]
The appropriate pulse sequence and spatial localisation within the subject are obtained by means of
three sets of gradient coils in the MR scanner, one for each of the x, y, and z axes. Radiofrequency
coils serve as both the transmitter and the receiver of the magnetic resonance signals. Preferably these
coils are placed as close to the requested body part, and that is why they are shaped for their purpose.
Figure 2.4 shows the gradient coils and a radiofrequency coil often used for abdominal MRI.
9 of 50
CHAPTER 2. MEDICAL IMAGING OF THE COLON
Fig. 2.4: Left: Gradient coils creating the static fields [10]. Right: RF coil used for abdominal MRI [11].
2.2.1
MRI Parameters
The pulse sequence being used determines the parameters controlling the MRI process. The following
section briefly outlines the parameters starting with three intrinsic parameters followed by four parameters controllable by the operator.
Proton spin density (ρ) is the quantity of resonating spins in a tissue. It determines the magnetic
resonance strength corresponding to the brightness in the image. When an image is proton density
weighted, a long repetition time (TR) between the pulses has been used to detect RF signal from a
larger number of protons. Due to small differences between the percentages of hydrogen in various
tissues, these images have relatively low contrast. Longitudinal relaxation time (T1) is a measure
of the time required for the protons to realign with the external magnetic field to 63% of the maximum
possible strength. The T1 varies significantly for hydrogen in different tissues. The molecular motion
in fat nuclei is rather slow, the T1 is equivalently short resulting in a rapidly regained longitudinal
magnetisation. In water it takes longer to regain the longitudinal magnetisation due to the high mobility of water molecules. The water nuclei are more adversative to give up energy to surrounding
tissues. In a T1 weighted image tissues with short T1 appear bright, e.g. fat and protein-containing
fluid. Transverse relaxation time (T2) is a measure of the time required for the protons to lose
63% of their alignment with each other (spin-spin) after a RF pulse has cause them to align at an angle
to the external magnetic field. When the transverse magnetisation of the nuclei occurs, a transient
MR signal is produced. This signal will decay to zero during T2. This phenomenon is called the free
induction decay (FID). Fat is efficient at exchanging energy resulting in a short T2. The converse is
true of water, which is less efficient at exchanging energy resulting in a long T2. Since, the efficiency
of energy exchange is a major factor, T2 is temperature dependent. In a T2 weighted image water
appears brighter than fat. Repetition time (TR) is the time interval between successive pulses
sequences. Variations in this value have great impact on the characteristics of the image contrast.
Figure 2.5 shows differences in signal intensity with varying TR. Echo time (TE) is the time interval
between a 90 ◦ pulse and the peak of the echo signal in a specific pulse sequence, the spin-echo pulse
sequence. Inversion time (TI) is the time between a 180 ◦ pulse and a 90 ◦ inversion pulse in a so
called inversion recovery pulse sequence. Controlling this factor allows water or fat suppression. Flip
angle α is used in echo imaging. The longitudinal magnetisation is maintained by using a flip angle.
[23, 24]
10 of 50
2.3. PROBLEM STATEMENT
Fig. 2.5: Differences in signal intensity between water and fat with varying repetition time, TR [12].
2.2.2
MRI Artifacts
To secure a high image quality and optimal use of the image, knowledge about artifacts is important.
Image artifacts are structures not normally seen, but they occur at images due to different sources of
limitations. Two major groups of artifacts exist. The first group of artifacts is caused by limitations
of the MRI devices (hardware and software). For instance, the coils can produce artifacts which are
seen as ringing in the images. The second group of artifacts is caused by the physiology of the subject.
This group can be subdivided into motion artifacts and inherent physics. Respiratory artifacts is a
type of motion artifacts impairing the image quality due to long scan times. Motion artifacts can be
reduced by reducing scan time, cardiac gating, or use of a breath-holding technique. An example of
artifacts caused by inherent physics is chemical shifts. Chemical shift artifacts are caused by a slight
difference (parts per million) between resonating frequencies of similar protons. E.g. the resonance
frequency of hydrogen in fat is slightly higher than that of hydrogen in water. This can result in a
bright band next to a dark band at fat-water interfaces. This kind of artifacts can be reduced by
using higher gradient, even though this also can contribute to a higher amount of noise in the image.
[23, 24]
2.3
Problem Statement
From the above sections it is evident that the state of the art to assess the degree of constipation is
inadequate. Plain AR is highly subjective and has a high degree of variability in the readings. In
addition to this the patient is exposed to ionising radiation, which can induce some undesirable effects.
MRI stands out among imaging techniques for soft tissues. In order to assess the degree of constipation
in terms of eliminating subjectivity, an algorithm could be developed to quantify constipation from
11 of 50
CHAPTER 2. MEDICAL IMAGING OF THE COLON
MRI. This gives rise to the problem statement below:
How can an algorithm for automatic segmentation of the colon content from MRI be
developed?
The objective is to develop an image analysis system that comprises an algorithm for semi-automatic
segmentation of the colon content. The output of this image analysis system are parameters that
could support the physician in assessing the degree of constipation.
The scope of the project applies to physicians responsible for the management of assessing constipation. A compromise between a perfect segmentation and a minimum of operator intervention is
assigned a high priority. The limitations in this projects are mainly due to the limited amount of time
and limited amount of data.
12 of 50
3
Image Analysis System
This section serves to create an overview of the algorithm. It consists of an input, three blocks of
image processing, and finally an output. A graphical representation of the overall algorithm is shown
in Figure 3.1.
Fig. 3.1: Traditional division of an image analysis system [13]. The components are processed in the following
chapters.
The input to this image analysis system is a volume data set described in Section 3.1. The first block
is processed in Chapter 4. The segmentation block is processed in Chapter 5. The output of the image
analysis system is a set of parameters that are calculated in the rightmost component in Figure 3.1.
This process is described in Chapter 6. The image analysis system is tested in Chapter 7.
3.1
Data Set
Five sets of MRI were supplied acquired at the Department of Radiology, Aalborg Hospital. The
images were provided in the Digital Imaging and Communications in Medicine (DICOM) format.
Conformance to this standard claims amongst more a protocol to be followed for network communications, including syntax and semantics when using this protocol, and also a set of media storage
services to be followed for instance file format and medical directory structure.
Different series were investigated in order to visually extract features for the segmentation. All series
were from a GE Medical System scanner using a magnetic field strength of 1.5 T. Series with Liver
Acquisition with Volume Acquisition (LAVA) sequence applied were chosen. This sequence is derived
from a 3D spoiled gradient echo pulse sequence. The gradient echo is achieved by dephasing and then
rephasing the spins with negatively and positively pulsed gradients, respectively, creating the echo.
The first RF pulse in the sequence tilts the magnetization by a flip angle α. The contrast in the image
is controlled by α. For the data sets, TR = 3.784-3.852 ms, TE = 1.776-1.804 ms, TI = 7 ms, α =
12 ◦ . The rather small value of α indicates that the longitudinal magnetization is slightly decreased
resulting in a faster recovery time. This allows for the short TR. As shown in the Figure 2.5 this short
TR time results in a preferable contrast.
Additional, prospective scans of two subjects were performed using a field strength of 3 T. For these
scans, TR = 4.508-4.512 ms, TE = 2.1128-2.128 ms, TI = 5 ms, α = 12 ◦ . These scans were used to
test for reliability.
Short TE and TR enhances T1 in the data set. The colon content is fatty and high in protein, which
results in this tissue appearing bright compared to surrounding tissue. This indicates that an untrained eye could distinguish colon content from other tissues explicitly, which appears to advantage
13
CHAPTER 3. IMAGE ANALYSIS SYSTEM
when the segmentation algorithm is to be continuously validated.
Figure 3.2 shows two images from a LAVA series:
Fig. 3.2: Two coronal MR images from a LAVA series. The colon is labelled with green arrows while artifacts
are labelled with red arrows.
The images in Figure 3.2 demonstrate more features:
• Colon content is distinct/high in intensity in images from a LAVA series
• Colon has a wall and the edge appears quite clear
• The colon is apparent in more slices
• Artifacts are as prominent as colon content which necessitates a technique for removal of artifacts
Figure 3.3 shows an outline of an axial view of the data set.
Fig. 3.3: Axial representation of the data set.
The terms slice thickness (ST) and spacing between slices (SBS) are explained in figure 6.8. The total
volume (TV) is dependent of the number of slices (NS), by:
TV = (ST + SBS) · NS
For the given data set, ST = 3.60 mm and SBS = 1.80 mm.
14 of 50
4
Initial Segmentation
4.1
Purpose
To select and extract features for the further segmentation. The selection and extraction will result
in an initial segmentation. Based on unfortunate features in the data set, methods for suppression of
or removal of artifacts are desired.
4.2
Different Methods for Initial Segmentation
Many ways of extracting features from an image exist [13]. The given data set and applications are
decisive factors. In broad outlines the features can be based on pixel intensities or on calculations of
these. The selection and extraction of features are emphasised since the purpose is to maximise tissue
contrast differentiation. The following section contains considerations regarding selection of method.
4.2.1
Filtering
To filter out noisy single high-intensity pixels and to be able to retrieve statistic measurements on
single connected components for later determination of region growing criteria, all small connected
components (< 50 voxels) were removed.
4.2.2
Region Filling
The symptoms of constipation are due to increased content in the colon. The factors contributing to
the symptoms are both solids and air bubbles. Hence, the segmentation should include both solids
and air bubbles inside solids. In the implementation air bubbles were filled using a flood fill algorithm
in Matlab.
4.2.3
Edge Detection
As stated in section 3.1 the colon content has three distinct features. One feature is concerned with
the colon having a wall. This implies that edge detection could be useful as a method for initial segmentation. An edge in an image is considered as a gradient within the image. Hence, edge detection
extracts the features of an image where an abrupt change in the intensities is present. To detect these
changes edge detection makes use of differential operators. Different edge-detector filter kernels have
been proposed to approximate derivatives of the image gradients. However, using edge-based features
can result in incorrect edges due to noise, or in an over- or under-segmentation.
The following Figure 4.1 shows an image from the data set with different kernels applied.
15
CHAPTER 4. INITIAL SEGMENTATION
Fig. 4.1: Left: Edge detection using a Sobel kernel. Right: Edge detection using a Gaussian smoothing filter
and a Laplacian filter kernel.
The image to the left in Figure 4.1 uses the Sobel kernel, which performs a 2D spatial gradient measurement, the first spatial derivative. The Sobel kernel consists of a pair of 3x3 kernels. The kernels
are identical except that the second is rotated by 90 ◦ . This design assures that the edges horizontally
and vertically relative to the pixel grid are measured. The Sobel kernel is not as sensitive to noise as
other first derivative edge-detectors, since it differentiates in one direction and implements a Gaussian
average filter in the other direction. This reduces the impact from noisy pixel. A first order derivate
filter gives the strength of a gradient but does not provide the precise onset of the edge. The latter
can be obtained by using a second order derivative filter. The result of using a Sobel kernel is a broad
but not continuously outline of the colon content.
The image to the right in figure 4.1 uses the Laplacian of Gaussian method. The Laplacian is a measure of the second spatial derivative of gradients. A second order derivative filter identifies the precise
onset of the edge, but leaves only little information on the strength of the edge and also amplifies
noise. Since this method is very sensitive to noise, a Gaussian smoothing filter has been applied to
suppress noise in the image prior to computing the Laplacian, as the title also indicates. It is evident
that the result of using the Laplacian of Gaussian is an over-segmentation.
The Canny edge detector is acknowledged as one of the best edge detection methods. This method
includes a smoothing by use of a Gaussian kernel, calculations of edge strengths and directions,
suppression of pixel not identified as edge pixels, and finally thresholding the remaining pixels. Figure
4.2 shows the result of applying the Canny edge detector. Despite of acknowledgment of the method,
the result includes too many false positives to base an initial segmentation of this method.
16 of 50
4.2. DIFFERENT METHODS FOR INITIAL SEGMENTATION
Fig. 4.2: Edge detection using a more sophisticated method, the Canny edge detector.
Edge detection as a method for initial segmentation was not satisfying, mainly because no clear border
between colon content and surrounding tissue is apparent. Even though three very different kernels
were applied, indications of an over-segmentation were obtained.
4.2.4
Global Thresholding Based on Intensities
Another way of extracting features is intensity thresholding. The basic idea is that pixels having a
value above a given threshold T are assigned to one region. The output b(x,y) from thresholding an
intensity image I(x,y) is a binary image:
(
b(x, y) =
1
0
if I(x, y) > T
otherwise
This method is really applicable when the input image is divisible into a foreground of objects of the
same intensity and and a background having intensities distinct to the foreground. I.e. this method
is limited to rather simple cases.
The threshold can be estimated in many ways, the most simple being empirically. Operator selection
of thresholds has been reported in a study [13]. However, in the task of segmenting the colon content, the need of manual intervention should be minimised in order to stress the physician minimally.
In addition to that inter- and intra variation might influence the selected threshold. An automatic
method for choosing the thresholding is necessitated.
To identify a proper threshold for the data set described in Section 3.1, histograms were made. An
image and its associated histogram from the LAVA series are shown in Figure 4.3:
17 of 50
CHAPTER 4. INITIAL SEGMENTATION
Fig. 4.3: Left: Coronal slice from a LAVA series. Right: The associated histogram.
To extract features characteristic for the colon content, a cropping was done. The result is shown in
figure 4.4:
Fig. 4.4: Left: Cropping colon content in a coronal slice. The cropped image is labelled with white. Right:
The histogram from the cropped image.
Two histograms in Figure 4.3 and Figure 4.4, respectively, indicate that colon content has higher intensities compared to surrounding tissue. Please note that the resolutions on the abscissas are different.
It is assumed that colon content covers the same intensity spectrum for all slices from the same series.
This property yields that globally thresholding based on intensities could result in a valid initial segmentation. Which threshold intensity to use is the next question. Since no training data was used,
an unsupervised method for classification was required.
Unsupervised classification is often termed as cluster analysis. In this analysis a search for similarity
in the data set is conducted in order to determine whether so-called clusters can be formed. Fuzzy cmeans (FCM) is a method of clustering which allows one piece of data to show a degree of membership
to two or more clusters.
18 of 50
4.2. DIFFERENT METHODS FOR INITIAL SEGMENTATION
Clustering Using Fuzzy c-means
To identify a global threshold, clustering using FCM was attempted. This was done to group colon
content in one cluster in order to conduct an initial segmentation. Clustering is a mathematical tool
for achieving natural groupings in a data set. The objects inside each cluster show a certain degree of
similarity. The algorithm for this method is as follows:
1. Choose a number of clusters and place initial cluster centers in the feature space
2. Measure distances from all samples to the centers resulting in an initial classification
3. Compute mean values of the clusters to find new cluster centers
The above algorithm is repeated until the algorithm converges meaning that the new cluster centers
are equal to or very close to the former cluster centers.
In Matlab the function fcm was used to conduct clustering using FCM. This which gave rise to the
following issues:
1. What is the optimal number of clusters?
2. Which are the optimal initial cluster centers?
3. Which cluster is used as colon content/true positives?
Ad 1: The optimal number of classes is a compromise on how many false positives to exclude, how
many true positives to include.
Ad 2: The FCM clustering function in Matlab suggests random values for initial cluster centers. Being
able to choose the initial cluster centers in advance might reduce the number of iterations.
Ad 3: To address the aforementioned issues, different numbers of clusters were used on the data set.
As shown in Figure 4.4 the colon content has high intensity between 0.3 and 0.6, which is among the
highest intensities in the image. This indicates that the clusters with the highest intensities should
be identified as colon content/true positives. The threshold is calculated as the average of the highest
cluster center and the second highest cluster center, as illustrated in figure 4.5.
19 of 50
CHAPTER 4. INITIAL SEGMENTATION
Fig. 4.5: Histogram of one complete series. The lines indicate the cluster centers, and the calculated threshold.
Table 4.1 contains the results from this review.
No of clusters
2
3
4
5
6
7
Threshold intensity
0.1335
0.3165
0.3601
0.4023
0.4805
0.5292
No of iterations
29
52
73
100*
100*
100*
Proportion**
85/85
76/88
51/84
37/67
39/68
26/54
Table 4.1: *) The FCM clustering function in Matlab has a (default) maximum number of iterations. **)
Proportion of false positives to the total number of connected components.
As previously mentioned in this section most of the colon content has intensities higher than 0.3. As
shown in Table 4.1 this excludes the threshold obtained using two clusters. From segmentations using
3 or 4 clusters, some parts of the liver tissue were included. These are considered as false positives. To
identify the number of clusters resulting in a satisfying compromise between true positives and false
positives, a histogram of the liver was made. In figure 4.6 histograms of both original image (coronal
slice from data set) and cropped image of liver are shown. When comparing the latter histogram
with the histogram of colon content in figure 4.4, it is noticed that colon content is slightly higher in
intensity.
20 of 50
4.2. DIFFERENT METHODS FOR INITIAL SEGMENTATION
Fig. 4.6: Above: Histogram of original image (coronal slice of data set). Below: Histogram of liver tissue.
Based on this a FCM using 5, 6, or 7 clusters could be used. It could be desirable to choose 7 clusters
since this one had the fewest false positives.
Fig. 4.7: Left: True positives from FCM using 7 clusters. Right: True positives from FCM using 5 clusters.
However, as shown in figure 4.7, choosing 7 clusters only finds true positives in some parts of the
present colon content. Use of 6 clusters gave similar results. Since a FCM using 5 clusters resulted in
true positives in more components of the colon content, this approach was chosen. However, because
the artifacts and the breast tissue demonstrate intensities as high, or even higher than colon content,
this approach still included a number of false positives. This fact is considered in the next section.
4.2.5
Manual Intervention
In order to narrow down the initial segmented areas to only contain true positives, i.e. connected
components representing colon content, regions of interests were drawn on several images of the series
with the initial segmentation. As shown in Figure 4.8 to the left, FCM clustering and morphological
filtering include both true positives and false positives. Depending on the quality of the manual
segmentation, the resulting areas represent only components with true colon content. Regardless of
21 of 50
CHAPTER 4. INITIAL SEGMENTATION
the quality of this intervention, this step composes the foundation for the region growing described in
chapter 5.
Fig. 4.8: Left: Candidate areas for initial seed point selection based on FCM clustering and morphological
filtering. Right: Areas of colon content inside the manually drawn region of interest. These areas are used for
final seed points for region growing.
22 of 50
5
Segmentation
5.1
Purpose
The purpose of segmentation is to divide an image into desired regions. These regions are then targets
for a classification.
5.2
Texture analysis
Texture can be described as an area, where the variety of intensity has specific features, that is
perceived as uniform. Textured region comprises of a set of pixels, satisfying a given gray-level
property, which is repeated in an image. They are frequent among others in medical images and well
recognised by human observers. Textures help in recognising borders, types of structures, shapes of
objects and their spatial placement [15]. Textures can be divided into categories according to:
• periodicity – strong textures with regular spacing, or rather stochastic textures without expressed
spatial regularity (weak textures)
• size – large or small primitives (groups of pixels), determine whether the texture is perceived as
fine or coarse
• orientation of primitives – isotropic (same in all directions) or directional (with one leading
direction)
5.2.1
Micro-features
Using this method it is assumed, that the texture contains periodic transitions of image brightness
(called micro-features). This means that sets of points have a specific intensity distribution in the
space domain. The extraction of a local feature is performed by a 2D convolution of the image with
a filter mask, that represents a model of the micro-feature pattern. These masks are empirically set
matrixes, usually of a 3x3 size. Nine masks were tested on our image data, here is an example of 2 of
these masks: [1 2 1; 2 4 2; 1 2 1]/36 and [1 0 -1; 2 0 -2; 1 0 -1]/12.
The segmentation method comprised of the following steps. The whole image was convolved with the
nine mentioned masks. The result of convolution of reference pixel (center of a single component after
FCM clustering) was saved in a separate variable. The convolved image was then scanned pixel by
pixel and using the euclidean distance formula:
v
uN
uX
D(a, b) = t (ai − bi )2
i=1
a difference of each pixel from reference pixel (both after convolution) was calculated for each mask.
The distance for all masks in each pixel was then averaged. Finally a simple threshold value (a) was
set. Every pixel with a distance < a was concerned as a member of the texture. The problem arose
with the final step – setting a threshold. The liver appeared to have very similar texture properties
and was very often included as undesired false positive result. Setting a small threshold resulted in
23
CHAPTER 5. SEGMENTATION
segmenting small areas around the center of each component, with missing areas while approaching
the boundary of components (drops in intensity values). This could partially be solved by setting a
multilayer perceptron artificial neural network as a classifier, as proposed in [29].
5.2.2
Co-occurrence matrix
A gray level co-occurrence matrix (GLCM) is a tabulation of frequency of different pixel brightness
value combinations occurring in an image. It is a n x n matrix P , where n represents the number
of gray levels within an image. In a GLCM texture, the relation between two pixels (reference and
neighbor) at a time is considered. P [i, j] accumulates the pixel pairs having intensity values i and j.
Pixel pairs in the matrix are defined by a displacement vector d = (dx, dy), where dx is the number
of pixels moved in horizontal and dy a number of pixels moved in vertical direction.
Fig. 5.1: Co-occurrence matrix, [14]
For quantification purposes various textural features can be calculated, including energy, variance,
correlation etc. Since the measures consider relations between groups of two pixels (in the original
image), the texture calculations are of second order. The main problem using this method, considering
the number of gray scale levels in a DICOM image, is computational time. The number of levels can of
course be reduced, resulting commonly in a 32x32 or 64x64 co-occurrence matrix. Further, for different
numbers of displacement vectors, this matrix grows in third dimension. In our case the following
displacement vectors, forming 4-connected pixels, were used: d1 = (1, 0), d1 = (0, 1), d1 = (−1, 0)
and d1 = (0, −1). This quantisation leads in final to loss of valuable details in image and many false
positives especially among the liver area and parts of the neighboring small intestine.
5.2.3
Difficulties Regarding Texture Analysis
For the co-occurrence matrix a limitation presents the computational time, depending on the predefined number of levels and displacement vectors. The same accounted for micro-features based on the
number of chosen matrixes to convolve with the original image.
The main difficulty in applying texture analysis in these series was with redefining and setting parameters for including voxel as a part of the texture. In other words, it would be necessary to implement
a neural network classifier, or similar complex sorting algorithm to correctly classify voxels in the
image. A basic approach by applying simple thresholds resulted in varying results of over- or undersegmentation. Additional user interaction would be necessary for inputting and fine tuning thresholds
for single image series, which is acceptable in technical practice, however not in clinical application.
Texture analysis could provide satisfying results when applied and adjusted on a series by series basis
which does not conform with the aim of the project.
5.3
Region Growing
Region growing falls into region-based segmentation methods. That is segmentation based on areas
of certain common property, using the concept of homogeneity. The principle of region growing is
24 of 50
5.4. REGION GROWING ALGORITHM
planting a seed in a potential region. The seed has properties typical for that region, usually intensity
or local mean. The surrounding pixels are aggregated to the region when fulfilling certain criteria of
homogeneity. These criteria are termed as growing criteria.
This iterative approach is then followed: For every pixel already recognized as a member of region
(initially the seed), check neighboring pixels (in 2D, 4- or 8- connected) and compare parameters of
these candidates to parameters of the seed or other set condition. If the conditions are met, add the
current pixel to region, if not, mark pixel as not accepted. Algorithm ends, when there are no pixels
to be added – for all pixels belonging to region, their neighbors are either members of the region or
marked as not accepted [15].
Fig. 5.2: 4- and 8-connected neighborhoods for region growing [15].
5.3.1
Common Approaches
Static Criterion
New pixel (pj ) is added to region after comparing with original seed pixel (ps ) in a predefined value
(tolerance) T according to formula: |ps − pj | ≤ T . The initial selected seed largely influences resulting
region. This method is not robust against noise. Another approach to initialize the region is to use
multiple seeds. This is either done by calculating local statistics in a mask around an initial seed, for
instance mean value or standard deviation, or by planting more initial seeds in an intended region to
better span the variance.
Dynamic Criterion
New pixel is added to region after comparing with previously added pixel (pi ), under condition that
|pi − pj | ≤ T . Another option is a comparison to current mean value of region |pj − pa | ≤ T . This
helps to grow on a slowly changing area. The order in which pixels are added to region affect the final
region [15].
5.4
Region Growing Algorithm
The main aim of the region algorithm in this project is to segment all desired areas of colon content
in the image series. This works ideally by letting initial seed points grow and extend their areas to
include every single pixel which is considered as colon content and not include any unwanted non-colon
content pixels. This should ideally produce a segmentation where every single pixel belongs to either
the desired colon content (true) or non-colon content (false). This way the region growing produces
a binary image where all pixels either are true or false e.g. colon content or not. To achieve this,
two conditions must be true: a seed point is placed in every colon content component of the image
and a ideal criterion that makes the region growing include all true colon content pixels and exclude
all non-colon content pixels. This then requires two things: correct placement of seeds and correct
growing criteria. As earlier described, the initial segmentation for selection of seed points relies on
FCM clustering and drawing of manual regions of interest. This ideally produces seed points in all
components of colon content, which then requires the selection of criteria for the region growing to
25 of 50
CHAPTER 5. SEGMENTATION
expand the areas to the desired segmentation.
A common criterion used for region growing is based on the intensities of the image and thereby the
contrast between wanted and unwanted areas. This feature is usable as a criterion for segmentation
of the colon content while the intensity of the colon content generally is brighter than that of colon
lumen and surrounding tissue as seen in Figure 5.3.
Fig. 5.3: Crop of the right flexure at the transition from the ascending to the transverse colon. The contrast
between bright colon content and the colon lumen and surrounding tissue is quite clear.
Two approaches of using the intensity as a criterion for growing were tried out. The initial segmentation
results in a number of connected components that all are possible colon content. In the first approach
the pixel with maximum intensity of each component was used as seed point. The threshold for
inclusion was calculated as the intensity mean of a 3x3x3 mask around the seed point. This would
label each new potential pixel I as either true (colon content) or false (non-colon content) by the
following:
(
I=
true
f alse
if Tlow < I < Thigh
otherwise
where Tlow and Thigh are the lower and upper threshold that the potential new pixel has to satisfy
before being added to the region and is calculated as:
Tlow = µs,3x3x3 − γ
(5.1)
Thigh = µs,3x3x3 + γ
(5.2)
where µs,3x3x3 is the mean of a 3x3x3 mask around the original seed pixel and γ is an empirical chosen
value e.g. 0.2.
This approach however did not take into account the already segmented pixels in the components
from the initial segmentation and the empirical chose value γ did not take the span of pixel intensities
of the colon content into account. Therefore, the second approach used all of the initial segmented
pixels as potential seed points, but checked whether surrounding pixels had already been marked as
colon content from the initial segmentation. Also, the threshold used in this approach were based on
statistics of the initial segmented components. This resulted in the following threshold Tlow and Thigh :
Tlow = µcc − α · σcc
(5.3)
Thigh = µcc + α · σcc
(5.4)
26 of 50
5.4. REGION GROWING ALGORITHM
where µcc is the mean intensity of the connected component used as seeds, α is a empirically chosen
growing parameter, σcc is the standard deviation of the connected component.
This resulted in an improved segmentation by tuning of the α parameter and takes into account the
variation in pixel intensities of the colon content initially segmented.
However, the region growing would still tend to leak in areas when the contrast between colon content
and surrounding tissue was low. Using a too low value of α would result in under-segmentation and
a too high value would result in leakage and over-segmentation. Therefore a second growing criterion
was added to the region growing to give better control of the growing.
To retrieve the second growing criterion, edge detection based on sobel filters in horizontal and vertical
direction was performed on the original image series. They were respectively used to compute the
two-dimensional convolution as estimates of first order derivatives on the x- and y-axis. These two first
order derivatives were further combined into a gradient magnitude. Colon content having a brighter
intensity than surrounding tissue tends to have a surrounding dark area formed by the colonic wall,
making a step in intensity that could be captured using the mentioned edge detection method. Forming
a gradient image with edges allowed performing of a morphological flood-fill operation that fills the
surrounded areas of colon content. The resulting gradient filled image can be seen in Figure 5.4.
Fig. 5.4: Crop of the resulting gradient filled image after applying a sober filter and morphological flood-fill
operation. The colon content of areas of high gradient peaks results in high contrast between content and
surrounding tissue.
This laid the basis for the second region growing criterion. For every new potential pixel of a region,
the mean of a 5x5x5 mask on the pixel position was calculated from the gradient filled image which
was compared to the mean of the all pixels in the connected component of the seed points. This
resulted in the following second growing criterion:
(
I=
true
f alse
if µgradient,5x5x5 > β · µgradient,cc
otherwise
where I is the new potential pixel under investigation, µgradient,5x5x5 is the mean of a 5x5x5 mask on
the gradient image at the pixel position currently being investigated, β is a empirically chosen growing
parameter, and µgradient,cc is the mean of the connected component from the gradient image.
The resulting segmentation using both intensity and gradient features of the image results in a better segmentation with less vulnerability to surrounding low contrast tissue and hence less leakage.
Figure 5.5 shows the result from applying the region growing algorithm.
27 of 50
CHAPTER 5. SEGMENTATION
Fig. 5.5: Segmentation after region growing has been applied. The pink color shows the initial seed points
segmented based on FCM clustering, and the blue color shows the expanded segmentation from region growing.
28 of 50
6
Post Processing
6.1
Purpose
The purpose of the post processing is to add clinical value to the segmentation. By providing the
physician with measures, he or she can relate to a degree of constipation, the level of subjectivity in
assessing constipation might decrease. The proposed algorithm presents an objective measurement of
the colon content.
To add clinical value to the segmentation, some measures should be extracted from the segmentation.
These measures should be objective and allow for a comparison of scans in time and performing of
follow-ups of treatments.
6.2
Estimation of Cross-sectional Areas
The degree of constipation is related to the cross-sectional area of the colon content, given that is a
measure of how loaded the colon is.
Fig. 6.1: Cross-sectional area of a component (yellow). The centroid is marked with a dot, and the direction
of the component is marked with an arrow.
6.2.1
Approach
To obtain an estimate of the cross-sectional area it was assumed that colon content was tubular shaped
or like a pearl necklace consisting of many small components. With this assumption, the largest crosssectional area for each component would be located in the centroid of each component. The centroid
was provided by a Matlab function regionprops and its method Centroid.
To be able to calculate the correct cross-sectional area, the direction of each component of the colon
content was found. This was done by calculating a convex hull of each component was calculated
by means of a Delaunay Triangulation (DT). In 3D the DT is a triangulation of every 3-simplexes
created from points within the component which circumsphere does not contain any other points in
the component. From the DT a principal component analysis (PCA) was conducted. The PCA is an
orthogonal transformation converting a set of observations into linearly uncorrelated variables. This
transformation is based on calculated pixel intensities. PCA uses linear correlation and identifies a
connection between samples and a linear trend line. The variable having the largest variance is assigned as the first principal component. The second principal component has the highest variance
29
CHAPTER 6. POST PROCESSING
possible under the constraint to be orthogonal to the preceding component, etc. Hence, this analysis
transforms data into a new coordinate system wherein the first coordinate is held by the component
having the largest variance.
Hence, the principal components are obtained by computing a covariance matrix for the coordinates
that compose the DT of each component, and then finding the eigenvectors and eigenvalues from this
matrix. The eigenvector having the largest eigenvalue has the direction of the component, assuming
that the largest variation is the largest distance.
To retrieve the location of the content, the eigenvector was evaluated. Depending on the eigenvector
having the largest absolute eigenvalue, each component was classified into belonging to either the
descending/ascending or the transverse colon. When the eigenvalue was larger in the x-direction,
the component was classified into the transverse segment, and when the eigenvalue was larger in the
y-direction, the component was classified into either the ascending or descending segment. In case
having the larger eigenvalue in the z-direction, the second largest eigenvalue was used to indicate of
the direction of the component. Figure 6.2 summarises this.
Fig. 6.2: The direction of each component depends on the components of the eigenvector having the largest
absolute eigenvalue: When the absolute value of the component in the x-direction was largest, the component
was classified into the transverse segment (red). Conversely, when the absolute value of the component in the
y-direction was largest, the component was classified into either the ascending or descending segments (purple).
The cross-sectional area was calculated as the plane through the centroid orthogonal to the direction,
i.e. the eigenvector.


a


N =E= b 
c
The coordinates for the centroid are cf. item 1:


x0


center =  y0 
z0
30 of 50
6.2. ESTIMATION OF CROSS-SECTIONAL AREAS
These properties are utilised in the following function for a plane:
plane(x, y, z) = b · (x − x0 ) + a · (y − y0 ) + c · (z − z0 )
The coordinates for each component were evaluated in the equation for the plane. The sets of coordinates resulting in the equation being approximately equal to zero are coordinates on the disk.
Fig. 6.3: Segmented colon content. The red lines indicate the plane through each component.
The area of this plane is one measure of how constipated a patient is. To compute the area the slice
thickness, spacing between slices and pixel spacing are read from the dicomHeader.
In general the area of the plane is calculated as the area in the slices added to the area between slices:
oneSliceArea = areaInSlice + areaBetweenSlices
The area in slice is computed from the slice thickness and the length of the plane in the current slice.
The area between slices is estimated by the approach shown in Figure 6.8. The length of the disks
in the surrounding slices, slice(n) and slice(n+1) are multiplied by half the spacing between slices,
respectively, and then summed. Using this approach it is assumed that the colon does not change in
shape inbetween the slices. This is a fair assumption since the spacing between slices is rather small
compared to the slice thickness.
31 of 50
CHAPTER 6. POST PROCESSING
Fig. 6.4: Axial view of a disk (blue) in a component (not shown). The interslice area is calculated assuming
that the colon does not change in shape in between the slices.
The area is then calculated as the length of the plane in each slice (planeLength) multiplied by the
pixel spacing, this is then multiplied with the slice thickness (ST) multiplied by the spacing between
slices (SBS):
totalArea =(planeLength(1) · pixelSpacing) · (ST + SBS · 0.5)
+ planeLength(n) · pixelSpacing) · (ST + SBS · 0.5)
+
n−1
X
planeLength(i) · pixelSpacing) · (ST + SBS)
i=2
with n being the number of slices the area appears in.
6.2.2
Difficulties Regarding Cross-sectional Areas
Since the degree of constipation depends on the volume of the colon content, mainly, initial attempts
on adding clinical value to the segmentation were based on geometrical parameters. However, the
segmentation resulted in components having more than one direction, e.g. belonging to both the
ascending and transverse part. From this incorrect direction vectors would be calculated which gives
rise to incorrect cross-sectional areas.
6.3
Diameter of Inner Joined Spheres
The diameter of the colon content could be a measure of the degree of constipation. The shape of
the colon content can range from simple and circular to very complex and not ideal in any way. That
complicates the task of estimating the diameter. However by assuming that the content in some degree
is tubular it could be possible to estimate a diameter.
6.3.1
Approach
A distance map provides the distances from all pixels (ones) in one component to the nearest zero
pixel. The peak points of this distance map would then compose radii of inner joined spheres. The
amount of peak points in a component depends on the complexity of the object. The more irregular
the more peak points are expected. Figure 6.5 below shows a distance map of the segmentation from
one series.
32 of 50
6.3. DIAMETER OF INNER JOINED SPHERES
Fig. 6.5: Left: Distance map of the segmented colon content. Right: Enlarged segment of the distance map.
Peak points are brightest resulting in a rough skeleton of the shapes. The values of the distance map (not
shown) are radii of inner joined spheres.
6.3.2
Difficulties Regarding Diameter of Inner Joined Spheres
The quality of this estimate truly depends on the shape of the colon content. Colon content is tubular
to some degree. The smooth muscle layer in the colon wall, Tunica muscularis, has an inner circular
layer. The surface of this layer is the myentheric plexus, which contributes to peristalsis, as described
in Section 1.1. This feature of the colon wall could contribute to the colon being predominantly circular. However, the segmented volumes can be far from circular in that the concavities are dominant.
This is shown in Figure 7.5, where the algorithm nicely segments the bright colon content.
This method tends to produce small values as estimates of the diameter, especially when the shape is
complex. The larger the value, the more reliable the estimate is. The Figure 6.6 shows four examples
of shapes of colon content and associated inner spheres. It is seen that the more circular the shape of
the colon content is, the more reliable the estimate is.
Fig. 6.6: From left to right: Different estimates of the diameter of the colon content. 1) A reliable estimate.
The diameter of the inner sphere is close to that of the colon content. 2) Less reliable estimates. The irregular
shape limits the size of the diameter of the inner sphere. 3) Not reliable estimates. 4) Less reliable estimates.
If this is the situation in one of the flexures, the reliability might increase.
In terms of adding clinical value to the algorithm, however, the δ largest values from distance maps
of the entire segmentation could indicate a distribution of the colon content. In some cases every of
the δ points will be located in the flexures. This will not add any information since the flexures, after
segmentation, often will be composed by both the ascending/descending and transverse part, hence
this will give large distances in the flexures.
33 of 50
CHAPTER 6. POST PROCESSING
Fig. 6.7: Since the components of the flexures can be composed of both ascending/descending and transverse
part, the majority of the δ points will be located in the flexures. In these cases the distance map will lack
reliability as an estimation of the diameter of the colon content.
6.4
Model-based Approach
A very simple model of the colon was made. It simply divided the colon content into three parts
corresponding to ascending, transverse and descending segments.
6.4.1
Approach
The approach is shown in Figure 6.8. By clicking in the two flexures of the colon three lines were
drawn, estimating approximately shape of the colon. A line connecting two selected points represented
the transverse part and two lines drawn vertically in the inferior direction of the image represented
the ascending and descending part of the colon. Angles formed in both of the colon corners were split
by axial lines, these two lines (division lines) intersected roughly in the center of the abdomen creating
a central point from which another division line was drawn vertically in the inferior direction. The
division lines now created three areas in the image plane, which could be used to cluster segmented
pixels into three parts of the colon.
Fig. 6.8: Estimating a rough distribution of the colon using lines created by choosing flexure points.
34 of 50
6.4. MODEL-BASED APPROACH
The division of colon content volume in three anatomical colon segments - ascending, transverse,
descending, could provide valuable information. Further work could include showing the areas with
largest accumulation of colon content. Knowing, that largest masses of content tend to cumulate in
the right side of the colon, especially in the connection of terminal ileum and cecum, could help in
evaluating results from segmentation.
However, due to heterogeneous manifestations of the locations and shapes of the colon content, this
model-based approach showed some difficulties.
6.4.2
Difficulties Regarding the Model Based Approach
A drawback to this approach was its sensitivity. According to the locations of the clicks of the mouse,
the total volume was divided into three partial volumes. To easily repeat a test of sensitivity, the two
points were always placed in the inner side of the colon flexure. This way the impact of placing flexure
points on the distribution of volume among the three main parts of the colon (ascending, transverse,
descending) was calculated. The test was repeated 10 times, with calculated results of percent of total
volume (TV) in each part of the colon in Table 6.1.
# measurement
1
2
3
4
5
6
7
8
9
10
µ
σ
% of TV in ascending
26.22
32.40
29.95
30.66
33.83
29.94
31.13
34.54
32.33
30.31
31.13
2.35
% of TV in transverse
54.27
50.41
51.05
48.80
47.85
49.13
49.27
46.78
49.35
50.77
49.77
2.05
% of TV in descending
19.51
17.19
19.00
20.54
18.31
20.94
19.60
18.68
18.32
18.92
19.10
1.10
Table 6.1: Percent of total volume in each part of colon. µ is the mean, and σ is the standard deviation.
Figure 6.9 shows a slice with colon content divided into three parts, differentiated by color. Mouseclicks in corner points are marked for each image. Since it is a 2D projection, it may be hard to
distinguish exactly the position of the flexure, while colon content tends to form larger masses in the
connection of the terminal ileum and cecum and also the flexures.
Fig. 6.9: Two dividings of colon according to different placements of corner points.
35 of 50
CHAPTER 6. POST PROCESSING
The image on the left results in dividing the colon into percentage of the total volume in ascending
33.18%, transverse 48.60% and descending 18.22%. The right image gives a result of 32.73% in
ascending, 45.21% in transverse and 22.06% in descending part of colon. The human colon is very
individual with various lengths and location of the single colon parts in the abdomen. In the example
shown in Figure 6.10 the transverse part of colon and its location would completely disable an accurate
use of this method, due to the placement of flexures above this segment. In this case, most volume of
the transverse part would be estimated as either ascending or descending.
Fig. 6.10: Problematic position of transverse segment for use of this method.
6.5
Downgrading the Complexity of the Measurements
The measurements requested by the physician seemed simple. However, in this case these rather
simple measurements required complex assumptions due to the nature of colon content. First of all
anatomically all individuals are different. Secondly, different eating habits and levels of exercise lead
to diverse manifestations of colon content. This complicates the statement of valid assumptions. The
following assumptions could ease the extraction of parameters for the physician, but as will follow,
the assumptions are invalid.
Colon Content is Present in All Segments
In some patients this assumption could be valid. However, even very constipated patients might have
the majority of colon content in the right half and only little in the left half, as seen in the results
Section 7.2. However, if such an assumption were to be made, more solutions to get the requested
measurements arise. Fitting to a simple model as suggested in section 6.4 could provide a valid
direction of the components leading to a possible calculation of the true cross-sectional area. On the
other hand many variations of the shape and location of colon exist, as shown in Figure 1.2.
Colon Content is Tubular
Colon content is tubular to some degree. The smooth muscle layer in the colon wall, Tunica muscularis, has an inner circular layer. The surface of this layer is the myentheric plexus, which contributes
to peristalsis, as described in Section 1.1. This feature of the colon wall could contribute to the colon
being predominantly circular. However, the segmented volumes can be far from circular in that the
concavities are dominant. This is shown in Figure 7.5, where the algorithm nicely segments the bright
36 of 50
6.6. ESTIMATION OF VOLUMES
colon content. Assuming that colon content is tubular would allow for valid estimates of the diameter
of the components based on a distance map or skeletonisation. The distance map could provide a
distribution of the diameters along a component. Skeletonisation, on the other hand, could provide
both a measure of lengths and a distribution of cross-sectional areas. The approach in section 6.2 was
not successful in the flexures, but using the skeletonisation might overcome that issue.
However, due to the diverse appearance of colon content, even simpler measures were aimed at. The
proposed algorithm calculates the total and two partial volumes. This description of volumes can
indicate a rough distribution, which might add clinical and robust value to the algorithm.
6.6
Estimation of Volumes
Previous attempts to add clinical value to the segmentation demonstrated limitations.
6.6.1
Approach
To calculate the total volume (TV) and some partial volumes, the following is needed.
1. Total voxels of each component
2. Volume of each component
The total voxels of each component is provided by the Matlab function regionprops and its method
Area.
When provided with the total number of voxels of each component the total volume of the component
is computed by the slice thickness, spacing between slices and the pixel spacing, which are read from
the dicomHeader.
In general the volume is calculated as the volume in the slices added to the volume between slices:
oneSliceVolume = volumeInSlice + volumeBetweenSlices
The volume in slice is computed from the slice thickness and the volume of the component in the slice,
i.e. the total amount of pixels multiplied with the pixel spacing.
To calculate the volume between slices the same assumptions are made as when calculating the area
between slices, thus the volume of the whole component is calculated as:
totalV olume =(numberOf P ixels(1) · pixelSpacing 2 ) · (sliceT hickness + spacingBetweenSlices · 0.5)
+ numberOf P ixels(n) · picelSpacing 2 ) · (sliceT hickness + spacingBetweenSlices · 0.5)
+
n−1
X
numberOf P ixels(i) · pixelSpacing 2 ) · (sliceT hickness + spacingBetweenSlices)
i=2
To find a distribution of the colon content the segmented content is divided into a left and a right
part by manually outlining the middle of the images. Afterwards a right and left volume is calculated
the same way as the total volume.
37 of 50
7
Results and Validation
To assess the performance of the proposed algorithm for segmentation of colon content, the accuracy
and reliability were validated. The accuracy was evaluated both quantitatively and qualitatively, while
the reliability was evaluated mainly quantitatively.
7.1
Results from Segmentation
Accurate classification of segmented tissues is important due to correct readings of the segmentation.
To assess the accuracy of a segmentation, knowledge about a ground truth, a voxel-wise gold standard,
can be useful. A physician was asked to manually outline regions of colon content on two MRI series
using the programme ImageJ (ROI Manager) as seen in Figure 7.1.
Fig. 7.1: Manual segmentation in ImageJ indicated by yellow.
These segmented regions were then imported to Matlab and compared to results from our automatic
segmentation using the Dice Similarity Coefficient and a qualitatively review of the segmentations as
described in the following section [30].
7.1.1
Test of Accuracy: Comparison to ground truth using Dice Similarity
Coefficients and a Qualitatively Validation
Dice similarity coefficients s were calculated based on the results of the algorithm and results from
two manual segmentations. With ground truth available, the s value is a measure of the accuracy
of a system, since it is a simple measure of spatial overlap between two sets. The value of s ranges
39
CHAPTER 7. RESULTS AND VALIDATION
from 0-1, with 0 indicating no spatial overlap and 1 indicating complete overlap between two sets of
segmentation results (X and Y ).
s=
2|X ∩ Y|
|X| + |Y|
(7.1)
s values were calculated based on the results of the algorithm and results from two manual segmentations (ground truth). Dice coefficient for the first series was s = 0.40 and s = 0.63 for the second.
Total volume segmented by manual outlining was 1 l in first series and 1.4 l in second series. Volume
estimation using the proposed algorithm gave results 0.8799 l and 0.6772 l respectively. When comparing s with estimated volumes, it is evident that in the first series (small s value, similar volume)
the algorithm segments a lot of false positives. In the second case (larger s value, greater difference in
volume) by visualising results from both segmentations, it is seen that the automatic has fewer false
positives, but also does not extract the dark areas as in the manual segmentation. Figures 7.2 and
7.3 show how the changes in s values slice by slice in both segmented series. In series 4 a rapid drop
occurs when manual segmentation outlined very dark areas and borders of colon content, see fig. 7.6.
Results from series 5 are more consistent, s ranging from 0.6–0.8 in the central slices, with drops in
the few first and last slices.
1
0.9
Dice similarity coefficient
0.8
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0
0
10
20
30
Slice number
40
50
60
Fig. 7.2: s values in each slice, series 4
0.9
0.8
Dice similarity coefficient
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0
5
10
15
20
25
30
Slice number
35
40
45
Fig. 7.3: s values in each slice, series 5
40 of 50
50
7.1. RESULTS FROM SEGMENTATION
In both series, ground truth contained also areas of dark intensities, where presence of colon content is
presumed by clinical experience of physician. Contours of the colon content were outlined, while it is
very hard and time demanding, requiring zooming in, to outline the gaps and holes of the content itself.
This is of course very challenging for any automatic algorithm. Colon content is presented as bright
intensities, with drops in the boundaries towards surrounding tissue. Including a wide span of pixel
intensities would at the same time cause a certain amount of leakage. In one series the region growing
algorithm segmented also areas of the terminal ileum, which had similar (and higher) intensities. In
a few slices the ileum (marked by yellow arrow) anatomically connects with the cecum, thus forming
a large connected component, as shown in Figure 7.4, recognised by the algorithm as colon content.
This creates a large false positive area, rapidly decreasing the s value.
Fig. 7.4: Bright intensity terminal ileum segmented as part of colon content
s-values can be calculated for each slice separately, allowing us to view specific parts of the image,
where results from automatic segmentation best agree with those from manual outlining. Figure 7.5
and 7.6 show the outlines of regions segmented manually (green) and using region growing algorithm
(red). Clearly can be seen, that red lines follow the bright edges of regions, while the green lines take
into account the whole bright region including darker intervening areas forming a rounder contour.
Fig. 7.5: Manual (green) and automatic (red) segmentation resulting in s = 0, 74
In the first Figure 7.5, the segmented region from automatic segmentation is similar to the manual,
in this case is s = 0, 74. The image nicely represents the difference between contours of colon content
and the visible parts represented by higher intensity voxels.
41 of 50
CHAPTER 7. RESULTS AND VALIDATION
Fig. 7.6: Manual (green) and automatic (red) segmentation resulting in s = 0, 39
In the second Figure 7.6 on the other hand a large difference between the algorithm result and result
from what the physician outlined as part of colon content from clinical experience.
Summed up, during this qualitative validation three major areas were found in which the segmented
areas differed from ground truth:
1. The proposed algorithm causes leakage in some series (false positives)
2. Ground truth includes a wider span of intensities.
3. Ground truth has a inconsistently tendency in some slices.
7.2
Results from Volume Estimation
Results from calculation of total volume of colon content, further divided into left and right according to
a manually selected center point of the patient are shown in Table 7.1. Volumes from both automatic
and manual segmentation are mentioned. As mentioned in previous section, manual segmentation
included whole contours of colon content with dark surrounding areas which the algorithm can not
segment. Therefore larger volumes in both manually segmented series (4* and 5*) were estimated
compared to results from automatic segmentation.
Data Set
#1
#2
#3
#4
# 4*
#5
# 5*
Total Volume [l]
0.8667
0.0775
0.6542
0.8799
1.0089
0.6772
1.4
Right Volume [l]
0.5091
0.0182
0.6223
0.5881
0.7935
0.4493
0.8835
Left Volume [l]
0.3575
0.0592
0.0319
0.2918
0.2154
0.2279
0.5156
Table 7.1: Estimation of volumes. *) Manual segmentation, ground truth.
Difference in ground truth to results from algorithm are also reflected in the calculated s values for
both series s = 0.40 and s = 0.63, respectively. Results also show, that the right side tends to contain
a larger volume of colon content. This could be due to lack of motility in the colon of constipated
patients and the fact, that content tends to accumulate in the cecum (where content enters the colon
42 of 50
7.2. RESULTS FROM VOLUME ESTIMATION
after leaving the terminal ileum) and flexures of the colon. An illustration of this situation is shown
in Figure 7.7. Long term constipation on the other hand involves the entire length of the colon.
Fig. 7.7: Total volume segmented from data set # 5
7.2.1
Test of Reliability: Comparing Consecutive Scans of one Subject
Reliability refers to the reproducibility of the results from a test [31], which has been reported as an
important measure of confidence in segmentations [13]. Better reliability implies better detection of
changes in measurements. The standard deviation is a simple measure of variation from scan to scan.
To obtain a trustworthy estimate of the reliability many scans of many subjects are required. However,
due to practicalities, only 3 scans of 2 subjects were available. Unfortunately only the scans from one
subject were useful, since the scans from the other subject barely had any visible colon content.
Due to a different magnetic field strength, the algorithm was adjusted concerning two points. Firstly,
the global thresholds for identifying candidate initial seed points were manually selected for each
series. The final initial seed points were chosen as described in Section 2.2. Secondly, the α threshold
parameter was identical with that from the 1.5 T scanner, while the β threshold parameter was tuned
to the 3 T scanner. Hence, the following parameters were used: α = 4 and β = 3
Results from the 3 volume estimations are shown in Table 7.2 with mean (µ) and standard deviation
(σ) for the total volume and left and right volumes.
Total Volume [l]
Right Volume [l]
Left Volume [l]
1
0.5625
0.5460
0.0164
2
0.5400
0.5282
0.0118
3
0.5157
0.4893
0.0264
µ
0.5394
0.5212
0.0132
σ
0.0234
0.0290
0.0075
Table 7.2: Test of reliability.
Visible consistency in the resulting volumes can conclude and indicate a certain level of reliability
while segmenting consequent scans from the same subject. The volume ratios are preserved for both
left and right volume. Figure 7.8 shows regions where region growing leaked into surrounding tissue,
forming a false positive result, which added up to the total volume and subsequently either left or
right volume.
43 of 50
CHAPTER 7. RESULTS AND VALIDATION
Fig. 7.8: A leak in the right side of the patient was seen in all 3 series used to test the reliability.
7.2.2
Computational Times
What is most important, accuracy or speed? In surgical planning, accuracy might come first. However,
regarding assessment of colon content or treatment follow-ups both speed and accuracy are important,
and the crucial factor could be the level of user intervention. The physician might want a accurate
result, but he or she does not have time for higher levels of intervention. Balancing accuracy with the
level of intervention is really the task. The proposed algorithm is semi-automatic, and all the physician
has to do is to review if only true positives are present. In the following computational times for the
algorithm are presented. A comparison with the time taken to do a manual segmentation is made.
The total computational time for each series and the average computational time are shown in Table
7.3.
Time [min:s]
1
35:24
2
24:17
3
46:05
4
36:03
5
30:23
µ
34:26
Table 7.3: Total computational times for each series.
The two manual segmentations took 41:20 and 54:27 min, respectively, with a average of µman = 47:54
min.
In the automatic segmentation attention from the physician is only needed while drawing the regions
of interest for selection of initial seed points.
44 of 50
8
Discussion
This paper proposes a novel method for semi-automatic segmentation and quantification of colon
content from MRI based on region growing. The quantification of colon content relies on a well
executed segmentation of the colon content. The algorithm has proved successful in segmenting bright
colon content allowing for a robust quantified representation of the colon content.
8.1
Segmentation
Semi-automatic quantification of colon content from MRI is proposed using a region growing algorithm. The challenge is in segmenting disconnected components with irregular shapes having no or
limited knowledge about their position in the abdomen. Series also differ in intensity representation of
colon content, making it necessary to adjust parameters of our segmentation algorithm. The proposed
algorithm for segmentation of colon content from MRI, is dependent on a minor user intervention.
The operator has to draw a region of interest for planting of seeds. The level of user intervention
compromises the accuracy of the segmentation. In some scans artifacts and unwanted tissues have
similar or even higher intensities than that of the colon content. When drawing regions of interest the
unwanted areas are excluded resulting in an increased number of false positives in the result.
The proposed algorithm utilises region growing to expand the initially segmented areas used as seed
points. The growing of these regions applies intensity differences and gradient differences between
colon content and surrounding tissue including air in the colon lumen. Hence, the segmentation
requires colon content to be distinguishable from neighboring tissue. This holds true for both the
placement of seed points which is based on a fuzzy c-means clustering and the first of two growing
criteria in the region growing. The results show that the algorithm is successful in segmenting the
desired colon content. However, in MRI series with a low contrast, the algorithm might leak. This
will result in an incorrect classification of tissues, and hence an over-segmentation. To hinder this a
second growing criterion is formed. This relies on the gradient magnitudes of the images. The colon
content tends to consist of areas of several high gradient peak areas. This property is used to control
the region growing by restricting the growing into these areas. Still, some over-segmentation can occur
due to closely located tissues that are similar to colon content in intensities.
The algorithm relies on universal segmentation criteria. This however results in some parts of the
segmentation leaking to surrounding tissue areas. The present results demonstrate that region growing
is capable of segmenting the bright colon content. Differences occur between automatic segmentation
and the ground truth obtained by manual segmentation performed by a physician. Differences are due
to the core of the region growing algorithm being able to segment bright colon content from darker
surrounding matter. Manual segmentation on the other hand takes into account prior anatomical
knowledge of the human colon, therefore outlining contours of the colon content, disregarding the
concavities at the edges of the bright matter. In general, manual outlining segmented a larger volume
of colon content. Automatic segmentation includes false positive bright tissue (little intestine) as colon
content and demonstrates areas of leakage towards the abdomen wall and other soft tissues.
45
CHAPTER 8. DISCUSSION
8.2
Automatic Versus Manual Segmentation
The developed algorithm for segmentation of colon content from MRI, is dependent on a minor user
intervention. The operator has to draw a region of interest, in which the algorithm plants seeds for
region growing. This was chosen in order to increase the amount of false positives in the final segmentation. Without this implementation parts of the liver and artifacts were included. Wyatt et al.
have developed a method for automatic segmentation of the colon for virtual colonoscopy [32]. They
took advantage of the high contrast in the gas/tissue interface in helical computed tomography (CT)
images. The gas-filled lumen appears dark compared to the neighboring tissue. Prior knowledge of
colon geometry was used to localise seed points. They achieved a complete but not highly accurate
representation of the colon.
Backmana et al. suggest that segmentation algorithm for use in medical domains should only have a
minor user intervention [33]. The operators which skills are specialised into the medical field should not
be burdened with modifying parameters, selecting thresholds, or other tasks related to prior knowledge
of the algorithm and application field. They also state that current supervised segmentation algorithm
are superior to the unsupervised algorithms in accuracy, which indicates that the degree of operator
intervention and accuracy of segmentation should be compromised. The algorithm in the work of
Backmana et al. requires one operator intervention, the selection of a single seed voxel. Mancas et al.
developed a semi-automatic region-growing segmentation method in which the operator also has to
select a seed inside the region of interest [34]. Their method is based on both a topology distance map
and a probability map, and when the structure in the image is homogeneous according to a criterion,
they are able to segment the spinal cord.
From this review, the minor operator intervention required in present project is considered as acceptable.
8.3
Sensitivity to Selection of Seed Point
As for now the algorithm is sensitive to the physician picking the right seed points for the region
growing. When drawing a region of interest around a component, the whole 3D component is selected
as seed points, that is choosing a component that are not only true positive will lead to an oversegmentation. A way to overcome this problem could be to draw region of interest around a component,
that both are colon content and not, when it appears on a slice as only true positive, letting this 2D
component be selected as seed points. This will make it possible to choose more seed points in colon
content. The second threshold criterion will hinder leakage into the tissue that was connected with
the outlined 2D component from FCM. However, utilising a manual intervention will always indicate
a certain degree of sensitivity and subjectivity.
8.4
Optimising the Algorithm for Clinical Use
Even though the test of computational times revealed quite good results compared to a manual segmentation. Not only do the times taken to do the segmentations differ, hence µaut = 34:26 min and
µman = 47:54 min., the level of attention from the user also differs widely. A few initiatives might
decrease the computational time even more. Matlab does not fully compile its for-loop, which results
in for-loops tending to be slow. By vectorising the code, the computational time might decrease. Another issue heavily affecting the computational time is the FCM clustering. To locate 5 cluster centers
demands more than 100 iterations. By integrating prior knowledge the number of needed iterations
might decrease resulting in a faster algorithm.
The perspective of the proposed algorithm is to provide an objective measurement for quantification of
colon content. This could support the clinical assessment of constipation. The method and its results
are very promising as new objective measurements for physicians. Future work targets an improved
segmentation, possibly by including additional image features, fine tuning of parameters, and reducing
46 of 50
8.4. OPTIMISING THE ALGORITHM FOR CLINICAL USE
the sensitivity of the manual intervention. Also, further investigations of limitations of the proposed
method and further comparison with current clinical practice should be conducted.
47 of 50
Bibliography
[1]
Committee for Medicinal Products for Human Use. Concept paper on the need of a guideline for
clinical investigation of medicinal products for the treatment of chronic constipation. European
Medicines Agency. 2012;.
[2]
World Gastroenterology Organisation. World Gastroenterology Organisation Practice Guidelines:
Constipation. 2007;.
[3]
Lunn S. The Colon; 2012. Available from: http://www.sharonlunn.co.uk/.
[4]
Cummings JH, Hill MJ, Jenkins DJA, Pearson JR, Wiggins HS. Changes in fecal compostion and
colonic function due to cereal fiber. The American Journal of Clinical Nutrition. 1976;29:1468–
1473.
[5]
Cummings JH, Macfarlane GT. The control and consequences of bacterial fermentation in the
human colon. Journal of Applied Bacteriology. 1991;70:443–459.
[6]
Geneser F. Histologi. 1st ed.; 2008.
[7]
Harris T. The X-Ray Machine. 2012;Available from: http://science.howstuffworks.com/
x-ray2.htm.
[8]
Alessandro DMD. What Causes Recurrent Abdominal Pain?; 2012. Available from: http://www.
pediatriceducation.org/2004/12/27/.
[9]
Stern DP. Educational Web Sites on Astronomy, Physics, Spaceflight and the Earth’s Magnetism;
2012. Available from: http://www.phy6.org.
[10] Rinck P. Magnetic Resonance in Medicine. The Basic Textbook of the European Magnetic
Resonance Forum.; 2012. Available from: http://www.magnetic-resonance.org/MagRes%
20Chapters/03_10.htm.
[11] MedicalExpo. The Virtual Medical Exhibition; 2012. Available from: http://www.medicalexpo.
com/product-manufacturer/toshiba-mri-coil-759-22.html.
[12] MRI Sequences;Available from: http://www.bioc.aecom.yu.edu/labs/girvlab/nmr/course/
COURSE_2012/chapter5.pdf.
[13] Clarke LP, Velthuizen RP, Camacho MA, Heine JJ, Vaidyanathan M. MRI Segmentation: Methods and Applications. Magnetic Resonance Imaging. 1995;13.
[14] Gray level co-occurrence matrix;Available from: http://www.mathworks.se/help/images/ref/
graycomatrix.html.
[15] Jan J. Medical Image Processing, Reconstruction and Restoration; 2005.
[16] Connell AM, Hilton C, Irvine G, Lennard-Jones JE, Misiewicz JJ. Variation of Bowel Habit in
Two Population Samples. British Journal of Medicine and Medical Research. 1965;.
49
BIBLIOGRAPHY
[17] Johanson JF.
Definitions and Epidemiology of Constipation. 2007;Available from:
http://www.researchgate.net/publication/226057621_Definitions_and_Epidemiology_
of_Constipation.
[18] Rostgaard J, Tranum-Jensen J, Qvortrup K, Holm-Nielsen P. Hovedets, halsens og de indre
organers anatomi. 10th ed.; 2006.
[19] Martini FH, Nath JL. Fundamentals of anatomy & physiology; 2009.
[20] Panchal SJ, Müller-Schwefe P, Wurzelmann JI. Opioid-induced bowel dysfunction: prevalence,
pathophysiology and burden. International Journal of Clinical Practice. 2007;61, 7:1181–1187.
[21] Guyton A, Hall J. Textbook of Medical Physiology. 11th ed.; 2006.
[22] Thompson AC, Attwood DT, Gullikson EM, Howells MR, Kortright JB, Robinson AL, et al.
X-Ray Data Booklet. 2009;Available from: http://xdb.lbl.gov/.
[23] Bushberg JT, Seibert JA, Edwin M Leidholdt j, Boone JM. The Essential Physics of Medical
Imaging. 2nd ed.; 2002.
[24] Carlton RR, Adler AM. Principles of Radiographic Imaging. 4th ed.; 2006.
[25] Reuchlin-Vroklage M, Bierma-Zeinstra S, Benninga MA, Berger MY. Diagnostic Value of Abdominal Radiography in Constipated Children. Archives of Pediatrics & Adolescent Medicine.
2005;159.
[26] Moylan S, Armstrong J, Diaz-Saldano D, Saker M, Yerkes EB, Lindgren BW. Are Abdominal
X-Rays a Reliable Way to Assess for Constipation? The Journal Of Urology. 2010;184.
[27] Bongers MEJ, Voskuijl WP, van Rijn RR, Benninga MA. The value of the abdominal radiograph
in children with functional gastrointestinal disorders. European Journal of Radiology. 2006;59.
[28] Health Physics Society. Radiation Exposure From Medical Diagnostic Imaging Procedures. Health
Physics Society Fact Sheet. 2010;.
[29] Andrzej Materka MS. Texture Analysis Methods - A Review. 1998;.
[30] Dice LR. Measures of the Amount of Ecologic Association Between Species. Ecology. 1945;26:297–
302.
[31] Hopkins WG. Measures of Reliability in Sports Medicine and Science. The American Journal of
Sports Medicine. 2000;30.
[32] Wyatt CL, Ge Y, Vining DJ. Automatic segmentation of the colon for virtual colonoscopy.
Computerized Medical Imaging and Graphics. 2000;24.
[33] Backmana NJ, Whitney BW, Furstc JD, Raicuc DS. A prioritized and adaptive approach to
volumetric seeded region growing using texture descriptors. Electronic Imaging. 2006;6065.
[34] Mancas M, Gosselin B, Macq B. Segmentation using a region-growing thresholding. In: Image
Processing: Algorithms and Systems; 2006. p. 388–398.
50 of 50