stud/IV/mj-msc.tex

261 lines
8.6 KiB
TeX
Raw Normal View History

2021-04-03 12:37:59 +03:00
\documentclass[a4paper]{report}
2021-03-30 17:37:13 +03:00
\usepackage[T1]{fontenc}
2021-04-03 07:36:24 +03:00
%\usepackage[bitstream-charter]{mathdesign}
2021-03-30 17:37:13 +03:00
\usepackage[english]{babel}
\usepackage[utf8]{inputenc}
\usepackage{a4wide}
\usepackage{csquotes}
\usepackage[maxbibnames=99,style=authoryear]{biblatex}
\usepackage[pdfusetitle]{hyperref}
\usepackage{enumitem}
\usepackage[toc,page,title]{appendix}
\addbibresource{bib.bib}
\usepackage{caption}
\usepackage{subcaption}
\usepackage{gensymb}
\usepackage{varwidth}
\usepackage{tabularx}
\usepackage{float}
\usepackage{tikz}
\usepackage{minted}
\usetikzlibrary{er,positioning}
\definecolor{mypurple}{RGB}{117,112,179}
\input{version}
\newcommand{\DP}{Douglas \& Peucker}
\newcommand{\VW}{Visvalingam--Whyatt}
\newcommand{\WM}{Wang--M{\"u}ller}
2021-04-01 15:46:34 +03:00
\newcommand{\MYTITLE}{Cartographic Generalization of Lines using free software (example of rivers)}
\newcommand{\MYAUTHOR}{Motiejus Jakštys}
2021-03-30 17:37:13 +03:00
2021-04-01 15:46:34 +03:00
\title{\MYTITLE}
\author{\MYAUTHOR}
\date{\VCDescribe}
2021-03-30 17:37:13 +03:00
\begin{document}
2021-04-01 15:46:34 +03:00
\begin{titlepage}
\begin{center}
\includegraphics[width=0.4\textwidth]{vu}
\huge
\textbf{\MYTITLE} \\[4ex]
\LARGE
\textbf{\MYAUTHOR} \\[8ex]
\vfill
A thesis presented for the degree of\\
Master in Cartography \\[3ex]
\large
\VCDescribe
\end{center}
\end{titlepage}
2021-03-30 17:37:13 +03:00
\begin{abstract}
\label{sec:abstract}
Current open-source line generalization solutions have their roots in
mathematics and geometry, and are not fit for natural objects like rivers
2021-04-01 09:42:08 +03:00
and coastlines. This paper discusses our implementation of {\WM} algorithm
2021-03-30 17:37:13 +03:00
under and open-source license, explains things that we would had
appreciated in the original paper and compares our results to different
generalization algorithms.
\end{abstract}
\newpage
\tableofcontents
\listoffigures
\newpage
2021-04-03 12:37:59 +03:00
\chapter{Introduction}
2021-03-30 17:37:13 +03:00
\label{sec:introduction}
2021-04-01 09:42:08 +03:00
When creating small-scale maps, often the detail of the data source is greater
than desired for the map. This becomes especially acute for natural features
that have many bends, like coastlines, rivers and forest boundaries.
To create a small-scale map from a large-scale data source, these features need
to be generalized: detail should be reduced. However, while doing so, it is
important to preserve the "defining" shape of the original feature, otherwise
the result will look unrealistic.
For example, if a river is nearly straight, it should be nearly straight after
generalization, otherwise a too straightened river will look like a canal.
Conversely, if the river is highly wiggly, the number of bends should be
reduced, but not removed.
Generalization problem for other objects can often be solved by other
non-geometric means:
\begin{itemize}
\item Towns and cities can be filtered and generalized by number of
inhabitants.
\item Roads can be eliminated by the road length, number of lanes, or
classification of the road (local, regional, international).
\end{itemize}
Natural line generalization problem can be viewed as having two competing
goals:
\begin{itemize}
\item Reduce detail by removing or simplifying "less important" features.
\item Retain enough detail, so the original is still recognize-able.
\end{itemize}
Given the discussed complexities, a fine line between under-generalization
(leaving object as-is) and over-generalization (making a straight line) must be
found. Therein lies the complexity of generalization algorithms: all have
different trade-offs.
2021-04-03 12:37:59 +03:00
\chapter{Literature review}
2021-04-01 09:42:08 +03:00
\label{sec:literature-review}
A number of cartographic line generalization algorithms have been researched.
The "classical" ones are {\DP} and {\VW}.
2021-04-03 12:37:59 +03:00
\section{{\DP} and {\VW}}
2021-04-01 09:42:08 +03:00
\cite{douglas1973algorithms} and \cite{visvalingam1993line} are "classical"
line generalization computer graphics algorithms. They are relatively simple to
implement, require few runtime resources. Both of them accept only a single
2021-04-01 15:32:16 +03:00
parameter, based on desired scale of the map, which makes them very simple to
adjust for different scales.
2021-04-01 09:42:08 +03:00
2021-04-01 15:32:16 +03:00
Both algorithms are part of PostGIS, a free-software GIS suite:
\begin{itemize}
\item \cite{douglas1973algorithms} via
\href{https://postgis.net/docs/ST_Simplify.html}{PostGIS Simplify}.
2021-04-01 09:42:08 +03:00
2021-04-01 15:32:16 +03:00
\item \cite{visvalingam1993line} via
\href{https://postgis.net/docs/ST_SimplifyVW.html}{PostGIS SimplifyVW}.
\end{itemize}
2021-04-01 09:42:08 +03:00
2021-04-01 15:32:16 +03:00
Since both algorithms produce jagged output lines, it is worthwhile to process
those through a widely available \cite{chaikin1974algorithm} smoothing
algorithm via \href{https://postgis.net/docs/ST_ChaikinSmoothing.html}{PostGIS
ChaikinSmoothing}.
2021-04-01 09:42:08 +03:00
2021-04-01 15:32:16 +03:00
Even though {\DP} and {\VW} are simple to understand and computationally
efficient, they have serious deficiencies for cartographic natural line
generalization.
<TODO: expand on deficiencies>
2021-04-03 12:37:59 +03:00
\section{Modern approaches}
2021-04-01 15:32:16 +03:00
Due to their simplicity and ubiquity, {\DP} and {\VW} have been established as
2021-04-01 19:30:20 +03:00
go-to algorithms for line generalization. During recent years, alternatives
have emerged. These modern replacements fall into roughly two categories:
2021-04-01 09:42:08 +03:00
2021-03-30 17:37:13 +03:00
\begin{itemize}
\item Cartographic knowledge was encoded to an algorithm (bottom-up
approach). One among these are \cite{wang1998line}.
2021-04-01 09:42:08 +03:00
\item Mathematical shape transformation which yields a more cartographic
result. E.g. \cite{jiang2003line}, \cite{dyken2009simultaneous},
\cite{mustafa2006dynamic}, \cite{nollenburg2008morphing}.
2021-03-30 17:37:13 +03:00
\end{itemize}
2021-04-01 15:32:16 +03:00
Authors of most of the aforementioned articles have implemented the
generalization algorithm, at least to generate the visuals in the articles.
However, I wasn't able to find code for any of those to evaluate with my
desired data set, or use as a basis for my own maps. \cite{wang1998line} is
available in a commercial product.
2021-04-01 16:13:29 +03:00
Lack of robust openly available generalization algorithm implementations poses
a problem for map creation with free software: there is not a similar
2021-04-01 15:32:16 +03:00
high-quality simplification algorithm to create down-scaled maps, so any
cartographic work, which uses line generalization as part of its processing,
will be of sub-par quality. We believe that availability of high-quality
open-source tools is an important foundation for future cartographic
experimentation and development, thus it it benefits the cartographic society
as a whole.
2021-03-30 17:37:13 +03:00
2021-04-03 12:37:59 +03:00
\chapter{Methodology}
2021-04-01 09:42:08 +03:00
\label{sec:methodology}
2021-03-30 17:37:13 +03:00
2021-04-01 16:24:47 +03:00
The original \cite{wang1998line} leaves something to be desired for a practical
implementation: it is not straightforward to implement the algorithm from the
paper alone.
In this paper we describe {\WM} in a detail that is more useful for algorithm:
each section will be expanded, with more elaborate and exact illustrations for
every step of the algorithm.
2021-04-01 16:13:29 +03:00
2021-04-03 12:37:59 +03:00
\section{Automated tests}
2021-04-01 16:13:29 +03:00
2021-04-01 16:24:47 +03:00
As part of the algorithm realization, an automated test suite has been
developed. Shapes to test each function have been hand-crafted and expected
results have been manually calculated. The test suite executes parts of the
algorithm against a predefined set of geometries, and asserts that the output
matches the resulting hand-calculated geometry.
2021-04-01 20:54:20 +03:00
The full set of test geometries is visualized in figure~\ref{fig:test-figures}
on page~\pageref{fig:test-figures}. The figure includes arrows depicting
line direction.
2021-04-01 19:30:20 +03:00
\begin{figure}[H]
\centering
\includegraphics[width=\linewidth]{test-figures}
2021-04-01 20:54:20 +03:00
\caption{line geometries for automated test cases}
2021-04-01 19:30:20 +03:00
\label{fig:test-figures}
\end{figure}
2021-04-01 16:24:47 +03:00
The full test suite can be executed with a single command, and completes in a
few seconds. Having an easily accessible test suite boosts confidence that no
2021-04-03 12:37:59 +03:00
unexpected bugs have snug in while modifying the algorithm.
2021-04-03 12:37:59 +03:00
\chapter{Description of the implementation}
2021-04-01 16:13:29 +03:00
2021-04-03 12:37:59 +03:00
\section{Definition of a Bend}
2021-04-01 16:13:29 +03:00
2021-04-03 12:37:59 +03:00
\section{Gentle Inflection at End of a Bend}
2021-04-01 16:13:29 +03:00
2021-04-03 12:37:59 +03:00
\section{Self-line Crossing When Cutting a Bend}
2021-04-01 16:13:29 +03:00
2021-04-03 12:37:59 +03:00
\section{Attributes of a Single Bend}
2021-04-01 16:13:29 +03:00
2021-04-03 12:37:59 +03:00
\section{Shape of a Bend}
2021-04-01 16:13:29 +03:00
2021-04-03 12:37:59 +03:00
\section{The Context of a Bend: Isolated and Similar Bends}
2021-04-01 16:13:29 +03:00
2021-04-03 12:37:59 +03:00
\section{Elimination Operator}
2021-04-01 16:13:29 +03:00
2021-04-03 12:37:59 +03:00
\section{Combination Operator}
2021-04-01 16:13:29 +03:00
2021-04-03 12:37:59 +03:00
\section{Exaggeration Operator}
2021-04-01 16:13:29 +03:00
2021-04-03 12:37:59 +03:00
\chapter{Program Implementation}
2021-04-01 16:13:29 +03:00
2021-04-03 12:37:59 +03:00
\chapter{Results of Experiments}
2021-04-01 16:13:29 +03:00
2021-04-03 12:37:59 +03:00
\chapter{Conclusions}
2021-03-30 17:37:13 +03:00
\label{sec:conclusions}
2021-04-03 12:37:59 +03:00
\chapter{Related Work and future suggestions}
2021-03-30 17:37:13 +03:00
\label{sec:related_work}
\printbibliography
\begin{appendices}
2021-04-03 12:37:59 +03:00
\chapter{Code listings}
2021-03-30 17:37:13 +03:00
2021-04-01 16:13:29 +03:00
We strongly believe in the ability to reproduce the results is critical for any
scientific work. To make it possible for this paper, all source files and
2021-04-01 17:27:49 +03:00
accompanying scripts have been attached to the PDF. To re-generate this
document and its accompanying graphics, run this script (assuming name of
2021-04-02 06:22:10 +03:00
this document is {\tt mj-msc-full.pdf}):
2021-03-30 17:37:13 +03:00
2021-03-31 15:12:42 +03:00
\inputminted[fontsize=\small]{bash}{extract-and-generate}
2021-03-30 17:37:13 +03:00
2021-04-02 06:27:32 +03:00
This was tested on Linux Debian 11 with upstream packages only.
2021-03-30 17:37:13 +03:00
\end{appendices}
\end{document}