22e2353788161c7a68c38de0ecb46ad4688ecbab
[phd-thesis.git] / introduction / introduction.tex
1 \documentclass[../thesis.tex]{subfiles}
2
3 \begin{document}
4 \ifSubfilesClassLoaded{
5 \pagenumbering{arabic}
6 }{}
7 \mychapter{chp:introduction}{Introduction}
8 %\setlength{\epigraphwidth}{.5\textwidth}%
9 %\epigraphhead[30]{
10 % A \textbf{rhapsody} in music is a one-movement work that is episodic yet integrated, free-flowing in structure, featuring a range of highly contrasted moods, colour, and tonality. An air of spontaneous inspiration and a sense of improvisation make it freer in form than a set of variations.
11 %}{%
12 % Wikipedia~\cite{wikipedia_contributors_rhapsody_2022}
13 %}%
14 The sheer number of connected devices around us is increasing exponentially.
15 First and foremost, these devices are driven by software.
16 This thesis is about \ldots
17 \todo[inline]{introduction}
18
19 \section{Internet of Things}
20 \todo[inline]{add more citations and rewrite to make modern}
21 The \gls{IOT} is growing rapidly and it is changing the way people and machines interact with the world.
22 While the term \gls{IOT} briefly gained interest around 1999 to describe the communication of \gls{RFID} devices~\todo{cite}, it probably already popped up halfway the eigthies in a speech by Peter T. Lewis~\cite{peter_t_lewis_speech_1985}.
23
24 \begin{quote}
25 The \acrlong{IOT}, or \acrshort{IOT}, is the integration of people, processes and technology with connectable devices and sensors to enable remote monitoring, status, manipulation and evaluation of trends of such devices.
26 \end{quote}
27
28 CISCO states that the \gls{IOT} only started when there where as many connected devices as there were people on the globe, i.e.\ around 2008~\cite{evans_internet_2011}.
29 Today, the \gls{IOT} is the term for a system of devices that sense the environment, act upon it and communicate with each other and the world.
30 These connected devices are already in everyone's household in the form of smart electricity meters, smart fridges, smartphones, smart watches, home automation, \etc.
31
32 When describing \gls{IOT} systems, a layered, or tiered architecture is often used to compartmentalize the technology.
33 The number of tiers heavily depends on the complexity of the model but for the intents and purposes of the thesis, the four layer architecture shown in \cref{fig:iot-layers} is used.
34
35 \begin{figure}[ht]
36 \centering
37 \includestandalone{iot-layers}
38 \caption{The four layered \gls{IOT} architecture.}%
39 \label{fig:iot-layers}
40 \end{figure}
41
42 \begin{description}
43 \item[Presentation layer] This presentation layer provides the interface between the user and the \gls{IOT} application.
44 For example, in home automation, this is often implemented as a web interface or a mobile app.
45 \item[Application layer]
46 Servers often power the application layer, by providing the data storage,
47 One of its goals is to provide the \gls{API}, interfaces and data storage
48 \item[Network layer] connects all layers together.
49 In many applications this may be implemented using conventional networking techniques such as WiFi or wire networks.
50 However, recently, networks tailored to the needs of \gls{IOT} applications have been increasingly popular such as \gls{BLE}, LoRa, ZigBee or LTE-M.
51 \item[Perception layer], also called edge layer, collects the data, interacts with the environment and consists of (edge) devices equipped with various sensors and actuators.
52 As a special type of device, it may also contain a \gls{SN}.
53 A \gls{SN} is a collection of sensors connected by a mesh network or central hub.
54 \end{description}
55
56 While the number of devices seems to be growing exponentially fast, programming \gls{IOT} systems remains difficult as there is a lot of semantic friction~\cite{ireland_classification_2009}.
57
58 The devices are a large heterogeneous collection of different platforms, protocols and languages resulting in impedance problems or semantic friction between layers~\cite{ireland_classification_2009}.
59
60 Additionaly, the perception layer often is a heterogeneous collections of microcontrollers as well, each having their own peculiarities, language of choice and hardware interfaces.
61 The hardware needs to be cheap, small-scale and energy efficient.
62 As a result, the \glspl{MCU} used to power these devices do not have a lot of computational power, a soup\c{c}on of memory, and little communication bandwidth.
63 Typically the devices do not run a full fledged \gls{OS} but a compiled firmware.
64 This firmware is often written in an imperative language that needs to be flashed to the program memory.
65 It is possible to dynamically send the program to the program memory using \gls{OTA} programming~\cite{baccelli_scripting_2018,baccelli_reprogramming_2018}.
66 Program memory typically is flash based and only lasts a couple of thousand writes before it wears out.
67 While devices are getting a bit faster, smaller, and cheaper, they keep these properties to an extent.
68 The properties of the device greatly reduce the flexibility for dynamic systems where tasks are created on the fly, executed on demand and require parallel execution.
69 These problems can be mitigated by dynamically sending code to be interpreted to the \gls{MCU}.
70 With interpretation, a specialized interpreter is flashed in the program memory once that receives the program code to execute at runtime.
71
72 %weiser_computer_1991
73 \section{Domain-specific languages}
74 % General
75 Programming languages can be divided up into two categories: \glspl{DSL} and \glspl{GPL}~\cite{fowler_domain_2010}.
76 Where \glspl{GPL} are not made with a demarcated area in mind, \glspl{DSL} are tailor-made for a specific domain.
77 Writing idiomatic domain-specific code in an \gls{DSL} is easy but this may come at the cost of the \gls{DSL} being less expressive to an extent that it may not even be Turing complete.
78 \Glspl{DSL} come in two main flavours: standalone and embedded\footnote{Also called external and internal respectively.} of which \glspl{EDSL} can again be classified into heterogeneous and homogeneous languages (see \cref{fig:hyponymy_of_dsls} for this hyponymy).
79
80 \begin{figure}[ht]
81 \centering
82 \includestandalone{hyponymy_of_dsls}
83 \caption{Hyponymy of \glspl{DSL} (adapted from~\cite[pg.\ 2]{mernik_extensible_2013})}%
84 \label{fig:hyponymy_of_dsls}
85 \end{figure}
86
87 \glspl{DSL} where historically created as standalone languages which means all the machinery is developed solely for the language.
88 The advantage of this approach is that the language designer is free to define the syntax and type system of the language as they wish, not being restricted by any constraint.
89 Unfortunately it also means that they need to develop a compiler or interpreter for the language to be usable making standalone \glspl{DSL} costly to create.
90 Examples of standalone \glspl{DSL} are regular expressions, make, yacc, XML, SQL, \etc.
91
92 A radically different approach is embeddding the \gls{DSL} in a host language, i.e.\ \glspl{EDSL}~\cite{hudak_modular_1998}.
93 By defining the language as constructs in the host language, much of the machinery is inherited and the cost of creating embedded languages is very low.
94 There are however two sides to the this coin.
95 If the syntax of the host language is not very flexible, the syntax of the \gls{DSL} may become clumsy.
96 Furthermore, errors shown to the programmer may be larded with host language errors, making it difficult for a non-expert of the host language to work with the \gls{DSL}.
97
98 Tratt applied a notion from metaprogramming~\cite{sheard_accomplishments_2001} to \glspl{EDSL} to define homogeneity and heterogeneity of \glspl{EDSL} as follows~\cite{tratt_domain_2008}:
99 \begin{quote}
100 a homogeneous system is one where all the components are specifically designed to work with each other, whereas in heterogeneous systems at least one of the components is largely, or completely, ignorant of the existence of the other parts of the system.
101 \end{quote}
102 Homogeneous \glspl{EDSL} are therefore languages that are solely defined as an extension to their host language.
103 They often restrict features of the host language to provide a safer interface or capture an idiomatic pattern in the host language for reuse.
104 The difference between a library and a homogeneous \glspl{EDSL} is not always clear.
105 Examples of homogeneous \glspl{EDSL} are libraries such as ones for \glspl{GUI} creation, LISP's macro system, \etc.
106
107 On the other hand, heterogeneous \glspl{EDSL} are languages that are not executed in the host language.
108 For example, Elliott et al.\ describe the language Pan, for which the final representation in the host language is a compiler that will, when executed, generate code for a completely different target platform~\cite{elliott_compiling_2003}.
109 In fact, \gls{ITASK} and \gls{MTASK} are both heterogeneous \glspl{EDSL}.
110
111 \section{Task-oriented programming}
112 \Gls{TOP} is a declarative programming paradigm designed to model interactive systems~\cite{plasmeijer_task-oriented_2012}.
113 Instead of dividing problems into layers or tiers, as is done in \gls{IOT} architectures as well, it deals with separation of concerns in a novel way.
114 From the data types, utilising various \emph{type-parametrised} concepts, all other aspects are handled (see \cref{fig:tosd}).
115 This approach to software development is also called \gls{TOSD}~\cite{wang_maintaining_2018}.
116
117 \begin{figure}[ht]
118 \centering
119 \begin{subfigure}[t]{.5\textwidth}
120 \centering
121 \includestandalone{traditional}
122 \caption{Traditional layered approach.}
123 \end{subfigure}%
124 \begin{subfigure}[t]{.5\textwidth}
125 \centering
126 \includestandalone{tosd}
127 \caption{\Gls{TOSD} approach.}
128 \end{subfigure}
129 \caption{Separation of concerns in a traditional setting and in \gls{TOSD} (adapted from~\cite[pg.\ 20]{wang_maintaining_2018}).}%
130 \label{fig:tosd}
131 \end{figure}
132
133 \begin{description}
134 \item[Presentation layer: UI]
135 The \gls{UI} of the system is automatically generated from the representation of the type.
136 For instance, \gls{TOP} languages implemented in an \gls{FP} language often use generic programming or template metaprogramming to automatically achieve this.
137 \Gls{TOP} languages embedded in imperative programming languages may use introspection\todo{Do I want this sentence here?}.
138 Even though the \gls{UI} is generated from the structure of the datatypes, in many practical \gls{TOP} systems it can be tweaked afterwards to suit the specific needs of the application.
139 \item[Business layer: Tasks]
140 A task is an abstract representation of a piece of work that needs to be done.
141 It provides an intuitive abstraction over work in the real world.
142 Just as with real-life tasks and workflow, tasks can be combined in various ways such as in parallel or in sequence.
143 Furthermore, tasks are observable which means it is possible to observe a --- partial --- result during execution and act upon it by for example starting new tasks.
144 Examples of tasks are filling in a form, sending an email, reading a sensor or even doing a physical task.
145 \item[Resource access: \glspl{SDS}]
146 Tasks can communicate using task values but this imposes a problem in many collaboration patterns where tasks that are not necessarily related need to share data.
147 Tasks can also share data using \glspl{SDS}.
148 \Glspl{SDS} are an abstraction over any data.
149 An \gls{SDS} can represent typed data stored in a file, a chunk of memory, a database \etc.
150 \Glspl{SDS} can also represent external impure data such as the time, random numbers or sensory data.
151 Similar to tasks, transformation and combination of \glspl{SDS} is possible.
152 \item[\Gls{UOD}: PL]
153 The \gls{UOD} from the business layer is explicitly and separately modelled by the relations that exist in the functions of the host language.
154 \gls{TOP} languages are usually embedded in \gls{FP} languages but this not necessarily the case.
155 Some host languages also provide the implementations for the tasks and \glspl{SDS}.
156 \todo{dit moet beter}
157 \end{description}
158
159 \section{Outline}
160 \todo[inline]{uitbreiden}
161 %\epigraph{%
162 % \textbf{rhapsody} /\textipa{['r\ae{}ps@di]}/ \emph{noun} (pl.\ \textbf{-ies}) a piece of music that is full of feeling and is not regular in form: Liszt's Hungarian Rhapsodies.
163 %}{%
164 % Oxford Advanced Learners Dictionary~\cite{margaret_deuter_rhapsody_2015}.
165 %}
166 This thesis is structured as a purely functional rhapsody.
167 On Wikipedia, a rhapsody is defined as follows~\cite{wikipedia_contributors_rhapsody_2022}:
168 \begin{quote}
169 A \textbf{rhapsody} in music is a one-movement work that is episodic yet integrated, free-flowing in structure, featuring a range of highly contrasted moods, colour, and tonality. An air of spontaneous inspiration and a sense of improvisation make it freer in form than a set of variations.
170 \end{quote}
171
172 This thesis follows the tradition and consists of three movements that are episodic yet integrated.
173 \Cref{prt:dsl} is about \gls{EDSL} techniques, \cref{prt:top} elaborates on \gls{TOP} for the \gls{IOT} and \cref{prt:tvt} compares traditional tiered \gls{IOT} architectures to a tierless architectures such as \gls{TOP}.
174 The movements are readable independently if the reader is familiarised with the background material provided in \cref{chp:introduction}.
175 The thesis wraps up with \cref{chp:conclusion} that provides a conclusion and an outlook on future work.
176
177 \subsection{\Cref{prt:dsl}: Domain-specific languages}
178 This movement is a cumulative---paper-based---movement that focusses on techniques for embedding \glspl{DSL} in functional programming lanugages.
179 After reading the first chapter, subsequent chapters in this movement are readable as independently.
180
181 \subsubsection{\Cref{chp:dsl_embedding_techniques}}
182 This chapter shows the basic \gls{DSL} embedding techniques and compares the properties of several embedding methods.
183 This chapter is not based on a paper and written as a background for the subsequent chapters in the movement.
184
185 \subsubsection{\Cref{chp:classy_deep_embedding}}
186 This chapter is based on the paper: \emph{Deep Embedding with Class}~\todo{cite when published}.
187
188 During a Master's thesis supervision~\cite{amazonas_cabral_de_andrade_developing_2018}, focussing on an early version of \gls{MTASK}, a seed was planted for a novel deep embedding technique for \glspl{DSL} where the resulting language is extendible both in constructs and in interpretation using type classes and existential data types.
189 Slowly the ideas organically grew to form the technique shown in the paper.
190
191 The research from this paper and writing the paper was solely performed by me.
192
193 \subsubsection{\Cref{chp:first-class_datatypes}} shows how to inherit data types in \glspl{EDSL} using metaprogramming.
194 This chapter is based on the paper: \emph{First-Class Data Types in Shallow Embedded Domain-Specific Languages using Metaprogramming}~\todo{cite when accepted}.
195
196 The research in this paper and writing the paper was performed by me, though there were weekly meetings with Pieter Koopman and Rinus Plasmeijer in which we discussed and refined the ideas.
197 %
198 %\subsubsection{\Cref{chp:strongly-typed_multi-view_stack-based_computations}} shows how to use advanced \gls{DSL} techniques to embed type safe stack-based computations in a host language.
199 %This chapter is based on the paper: \emph{Strongly-Typed Multi-View Stack-Based Computations}~\todo{cite when accepted}.
200 %\todo{Zal ik dit paper wel opnemen? Aangezien Pieter het grotendeels gedaan heeft?}
201 %
202 %I supported Pieter Koopman in performing the research in this paper by writing some of the software.
203 %The paper was mostly written by Pieter Koopman\todo{probably will be}.
204
205 \subsection{\Cref{prt:top}: Task-oriented \texorpdfstring{\acrlong{IOT}}{internet of things} programming }
206 This part is a monograph focussing on \glspl{TOP} for the \gls{IOT}.
207 Therefore, the chapters depend on eachother and are best read in order.
208 The monograph is compiled from the following papers and revised lecture notes.
209
210 \begin{itemize}
211 \item \bibentry{koopman_task-based_2018}.
212
213 This was the initial \gls{TOP}/\gls{MTASK} paper.
214 Pieter Koopman wrote it but I helped with the software and research.
215 \item \bibentry{lubbers_task_2018}.
216
217 This paper was an extension of my Master's thesis~\cite{lubbers_task_2017}.
218 It shows how a simple imperative variant of \gls{MTASK} was integrated with \gls{ITASK}.
219 While the language was a lot different than later versions, the integration mechanism is still used in \gls{MTASK} today.
220
221 The research in this paper and writing the paper was performed by me, though there were weekly meetings with Pieter Koopman and Rinus Plasmeijer in which we discussed and refined the ideas.
222 \item \bibentry{lubbers_multitasking_2019}.
223
224 This paper was a short paper on the multitasking capabilities of \gls{MTASK} in contrast to traditional multitasking methods for \gls{ARDUINO}.
225 \item \emph{Simulation of a Task-Based Embedded Domain Specific Language for the Internet of Things}~\todo{cite when published}
226
227 These revised lecture notes are from a course on the \gls{MTASK} simulator was provided at the 2018 CEFP/3COWS winter school in Ko\v{s}ice, Slovakia.
228
229 Pieter Koopman wrote and taught it though I helped with the software and research.
230 \item \emph{Writing Internet of Things applications with Task Oriented Programming}~\todo{cite when published}
231
232 These revised lecture notes are from a course on programming in \gls{MTASK} provided at the 2019 CEFP/3COWS summer school in Budapest, Hungary.
233
234 Pieter prepared half of the lecture. I wrote the lecture notes, prepared the other half of the lecture, the assignments and practical session.
235 \item \bibentry{lubbers_interpreting_2019}.
236
237 This paper showed an implementation for \gls{MTASK} for microcontrollers in the form of a compilation scheme and informal semantics description.
238
239 The research in this paper and writing the paper was performed by me, though there were weekly meetings with Pieter Koopman and Rinus Plasmeijer in which we discussed and refined the ideas.
240 \item \emph{Reducing the Power Consumption of IoT with Task-Oriented Programming}~\todo{cite when published}
241
242 This paper shows how to create a scheduler so that devices running \gls{MTASK} tasks can go to sleep more automatically.
243 Furthermore, it shows how to integrate hardware interrupts into \gls{MTASK}.
244 The research was carried out by Sjoerd Crooijmans during his Master's thesis~\cite{crooijmans_reducing_2021}.
245 I did the daily supervision and helped with the research, Pieter Koopman was the formal supervisor and wrote most of the paper.
246 \item \emph{Asynchronous Shared Data Sources}~\todo{cite when accepted}
247
248 \todo[inline]{This paper could in theory be dropped in favour of being done sooner with the thesis}
249
250 Asynchronous \glspl{SDS} are used in the current \gls{ITASK} system and allow bigger \gls{IOT} devices such as raspberry pi's to be used in a tierless \gls{ITASK} system.
251 The example application shown in \cref{prt:tvt} heavily uses asynchronous \glspl{SDS}.
252 This paper shows how to lift \glspl{SDS} to operate asynchronously independent of \gls{ITASK} or another \gls{TOP} implementation.
253
254 The initial research was carried out by Haye B\"ohm during his Master's thesis~\cite{bohm_asynchronous_2019}.
255 I did the daily supervision and helped with the research, Pieter Koopman and Rinus Plasmeijer were the formal supervisors.
256 I extended and generalised the research and wrote the paper.
257 \item \emph{Green Computing for the Internet of Things}~\todo[inline]{cite when done}
258
259 These revised lecture notes are from a course on sustainable programming using \gls{MTASK} provided at the 2022 Sustrainable summer school in Rijeka, Croatia.
260
261 Pieter prepared half of the lecture and wrote the introduction.
262 I wrote the middle part of the lecture notes, prepared the other half of the lecture, the assignments and the practical session.
263 \end{itemize}
264
265 The movement is made up out of the following chapters
266 \todo[inline]{preliminary}
267
268 \begin{itemize}
269 \item The mTask language (includes informal semantics)
270 \begin{itemize}
271 \item Expressions
272 \item Functions
273 \item Tasks
274 \item Shares
275 \item Peripherals
276 \end{itemize}
277 \item Interpretations/views/backends\todo{decide on terminology}
278 \item Green computing
279 \item Integration with iTask
280 \item Implementation (based on IFL19 paper)
281 \item TOP for IoT beyond microprocessors
282 \end{itemize}
283
284 \subsection{\Cref{prt:tvt}: Tiered versus tierless programming}
285 These chapters focus on comparing traditional tiered architectures to tierless architectures and are based on a single journal paper that extended on a conference paper.
286 The conference paper was partly funded by the Radboud-Glasgow Collaboration Fund.
287 It does both a qualitative and a quantitative four-way comparison of a smart campus application.
288
289 The research in these papers and writing them was performed by all authors.
290 I created the server application, the \gls{CLEAN}/\gls{ITASK}/\gls{MTASK} implementation (\acrshort{CWS}) and the \gls{CLEAN}/\gls{ITASK} implementation (\acrshort{CRS})
291 Adrian Ramsingh created the micropython implementation (\acrshort{PWS}), the original python implementation (\acrshort{PRS}) and the server application were created by Jeremy Singer, Dejice Jacob and Kristian Hentschel~\cite{hentschel_supersensors:_2016}.
292
293 \begin{itemize}
294 \item \bibentry{lubbers_tiered_2020}.
295 \item \emph{Could Tierless Languages Reduce IoT Development Grief?}~\todo[inline]{cite when accepted}
296 \end{itemize}
297
298 \newcounter{secondauthorcnt}
299 \newcounter{underreviewcnt}
300 \newcounter{plannedcnt}
301 \newcounter{publicationscnt}
302 \newcommand{\secondauthor}{\textsuperscript{$\ast$}\stepcounter{secondauthorcnt}}
303 \newcommand{\underreview}{\textsuperscript{$\dagger$}\stepcounter{underreviewcnt}}
304 \newcommand{\planned}{\textsuperscript{$\ddagger$}\stepcounter{plannedcnt}}
305
306 \subsection{List of publications}
307 \begin{enumerate}
308 \item \secondauthor{}
309 A Task-Based DSL for Microcomputers
310
311 P. Koopman\footnote{\orcid{0000-0002-3688-0957}}, M. Lubbers\footnote{\orcid{0000-0002-4015-4878}}, and R. Plasmeijer (RWDSL 2018)~\cite{koopman_task-based_2018}.
312 \item Task Oriented Programming and the Internet of Things
313
314 M. Lubbers, P. Koopman, and R. Plasmeijer (IFL 2018)~\cite{lubbers_task_2017} (extension of Master's thesis~\cite{lubbers_task_2018}).
315 \item Multitasking on Microcontrollers using Task Oriented Programming
316
317 M. Lubbers, P. Koopman, and R. Plasmeijer (MIPRO/4COWS 2019)~\cite{lubbers_multitasking_2019}.
318 \item \secondauthor{}
319 Simulation of a Task-Based Embedded Domain Specific Language for the Internet of Things
320
321 P. Koopman, M. Lubbers, and R. Plasmeijer (CEFP/3COWS 2018).
322 \item Writing Internet of Things applications with Task Oriented Programming
323
324 M. Lubbers, P. Koopman, and R. Plasmeijer (CEFP/3COWS 2019).
325 \item Interpreting Task Oriented Programs on Tiny Computers
326
327 M. Lubbers, P. Koopman, and R. Plasmeijer (IFL 2019)~\cite{lubbers_interpreting_2019}.
328 \item Tiered versus Tierless IoT Stacks: Comparing Smart Campus Software Architectures
329
330 M. Lubbers, P. Koopman, A. Ramsingh\footnote{\orcid{0000-0003-3501-902X}}, J. Singer\footnote{\orcid{0000-0001-9462-6802}}, and P. Trinder\footnote{\orcid{0000-0003-0190-7010}} (IoT 2020)~\cite{lubbers_tiered_2020}.
331
332 % in press
333 \item Deep Embedding with Class
334
335 M. Lubbers (TFP 2022).
336 \item \secondauthor{}
337 Reducing the Power Consumption of IoT with Task-Oriented Programming
338
339 S. Crooijmans, M. Lubbers, and P. Koopman (TFP 2022).
340
341 %Under review
342 \item \underreview{} Could Tierless Languages Reduce IoT Development Grief?
343
344 M. Lubbers, P. Koopman, A. Ramsingh, J. Singer, and P. Trinder (ACM TIOT) (extension of~\cite{lubbers_tiered_2020}).
345 \item \secondauthor\underreview{}
346 Strongly-Typed Multi-View Stack-Based Computations
347
348 M. Lubbers and P. Koopman (IFL 2022 under review).
349 \item \underreview{} First-Class Data Types in Shallow Embedded Domain-Specific Languages using Metaprogramming
350
351 M. Lubbers, P. Koopman, and R. Plasmeijer (IFL 2022 under review).
352 \item \underreview{} Asynchronous Shared Data Sources
353
354 M. Lubbers, H. Böhm, P. Koopman, and R. Plasmeijer (IFL 2022 under review).
355
356 %Planned
357 \item \planned{} Green Computing for the Internet of Things
358
359 M. Lubbers, P. Koopman (Sustrainable 2022 planned).
360
361 \setcounter{publicationscnt}{\value{enumi}}
362 \end{enumerate}
363
364 There are \thepublicationscnt{} publications of which \thesecondauthorcnt{} I'm second author, \theunderreviewcnt{} are under review and \theplannedcnt{} are planned.
365
366 \input{subfilepostamble}
367 \end{document}