Event Related Potentials in the Electroencephalogram for Human
Transcription
Event Related Potentials in the Electroencephalogram for Human
Event Related Potentials in the Electroencephalogram for Human-Robot Interaction Svetlin Penkov s1329991 NI VER S E R G O F H Y TH IT E U D I U N B Master of Science by Research Doctoral Training Centre in Neuroinformatics and Computational Neuroscience School of Informatics University of Edinburgh 2014 Abstract There is a great gap between the current state of robotics and the general population expectations of what tasks an advanced robot should be able to perform. Even though robots are irreplaceable factory workers, they are not able to fully autonomously perform even simple tasks in an uncontrolled human environment. One way to relax the problem of autonomous behaviour is to rely on human-robot interaction, establishing a semi-supervised scheme, where the human occasionally provides supervisory information to the robot. Such approach, however, requires an intuitive, robust and efficient human-robot interface. The human brain constantly registers various events in the environment which causes certain patterns in the electromagnetic activity to emerge. This thesis explores the detection and classification of event related potentials (ERPs) in the electroencephalogram for human-robot interaction. Previous ERP research utilises medical class EEG equipment which severely limits the mobility of the person wearing it. We show that the Emotiv EPOC headset, which is a lightweight low-cost wireless EEG capturing device, can detect ERPs evoked when an error is committed or observed by the human. Furthermore, three types of classifiers (a discriminative, a generative and a temporal one) were trained to recognise correct and incorrect events. It was found out that the dimensionality of the EEG data could be significantly reduced and that the discriminative classifier, which uses logistic regression, achieved the best performance compared to the other two. The evaluation of the Emotiv EPOC headset and the reported classification results provide the basis for further research in the field of human-robot interaction which will be conducted during the subsequent PhD. iii Acknowledgements I would like to thank my supervisor Dr Subramanian Ramamoorthy who was shaping my ideas when I did not know how to proceed. I would also like to thank Nantas Nardelli and Alejandro Bordallo Micó for the numerous discussions. Last, but not least, I would like to express my gratitude to all volunteers who participated in the experiments. iv Declaration I declare that this thesis was composed by myself, that the work contained herein is my own except where explicitly stated otherwise in the text, and that this work has not been submitted for any other degree or professional qualification except as specified. (Svetlin Penkov s1329991) v Table of Contents 1 2 Introduction 1 1.1 Motivation and Aim . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1.2 Outline . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 Background 5 2.1 Neuroergonomics . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 2.1.1 Definition . . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 2.1.2 Motivation and research . . . . . . . . . . . . . . . . . . . . 6 2.1.3 Measuring brain activity . . . . . . . . . . . . . . . . . . . . 7 Electroencephalography (EEG) . . . . . . . . . . . . . . . . . . . . . 7 2.2.1 Types . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8 2.2.2 Principles of operation . . . . . . . . . . . . . . . . . . . . . 8 2.2.3 Noise and artefacts . . . . . . . . . . . . . . . . . . . . . . . 9 2.2.4 Rhythmic activity . . . . . . . . . . . . . . . . . . . . . . . . 10 Event-related potentials (ERP) . . . . . . . . . . . . . . . . . . . . . 10 2.3.1 Readiness potential . . . . . . . . . . . . . . . . . . . . . . . 10 2.3.2 Contingent negative variation . . . . . . . . . . . . . . . . . 11 2.3.3 P300 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 2.3.4 N400 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 2.3.5 P600 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11 2.3.6 Error related negativity (ERN) . . . . . . . . . . . . . . . . . 12 Brain-computer interfaces . . . . . . . . . . . . . . . . . . . . . . . 13 2.4.1 Spontaneous activity BCIs . . . . . . . . . . . . . . . . . . . 14 2.4.2 Event-related activity BCIs . . . . . . . . . . . . . . . . . . . 14 2.4.3 Human-robot interaction . . . . . . . . . . . . . . . . . . . . 14 2.2 2.3 2.4 vii 3 Methods 17 3.1 Experimental setup . . . . . . . . . . . . . . . . . . . . . . . . . . . 17 3.1.1 Emotiv EEG EPOC headset . . . . . . . . . . . . . . . . . . 17 3.1.2 Experiment 1 - flanker task (FT) . . . . . . . . . . . . . . . . 18 3.1.3 Experiment 2 - robotic arm task (RAT) . . . . . . . . . . . . 20 ErrP Classification . . . . . . . . . . . . . . . . . . . . . . . . . . . 22 3.2.1 Robust discriminant power (RDP) . . . . . . . . . . . . . . . 22 3.2.2 Dimensionality reduction - principal components analysis (PCA) 23 3.2.3 Discriminative classifier - logistic regression (LR) . . . . . . 24 3.2.4 Generative classifier - mixture of Gaussians (MOG) . . . . . . 24 3.2.5 Temporal classifier - hidden Markov model (HMM) . . . . . . 25 3.2 4 Observing error related negativity 27 4.1 EEG data preprocessing . . . . . . . . . . . . . . . . . . . . . . . . . 27 4.1.1 Extracting event centred EEG epochs . . . . . . . . . . . . . 27 4.1.2 Frequency filtering . . . . . . . . . . . . . . . . . . . . . . . 28 4.1.3 Analysis for eye movement artefacts . . . . . . . . . . . . . . 28 Mean ERPs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30 4.2.1 Mean response ErrP . . . . . . . . . . . . . . . . . . . . . . 31 4.2.2 Mean observation ErrP . . . . . . . . . . . . . . . . . . . . . 31 4.3 Significance of the data . . . . . . . . . . . . . . . . . . . . . . . . . 34 4.4 Evaluation of the Emotiv EPOC headset . . . . . . . . . . . . . . . . 34 4.2 5 6 ErrPs classification results 39 5.1 Channel selection . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39 5.2 Dimensionality reduction . . . . . . . . . . . . . . . . . . . . . . . . 42 5.3 Logistic regression classifier . . . . . . . . . . . . . . . . . . . . . . 43 5.4 Mixture of Gaussians classifier . . . . . . . . . . . . . . . . . . . . . 46 5.5 Hidden Markov model classifier . . . . . . . . . . . . . . . . . . . . 46 5.6 Classifiers comparison . . . . . . . . . . . . . . . . . . . . . . . . . 51 Conclusion 53 6.1 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 53 6.2 Future work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54 Bibliography 55 viii Chapter 1 Introduction 1.1 Motivation and Aim Robots have long been imagined as workers that work alongside humans, extending the time an elderly person can live at home, providing physical assistance to a worker on an assembly line, or helping with household chores [1]. Despite the rapid development of technology, there is a significant mismatch between the current state of robotics and the futuristic general public expectations. Nowadays, robots are working relentlessly, performing tasks with unprecedented strength, speed and accuracy in various factories where the environment is carefully controlled. However, there are many issues related to the autonomous operation in human environments where human-robot interaction should occur. First of all, human environments vary a lot and tend to be complex. For example, imagine a typical kitchen with all the furniture, equipment and utensils in it. There are millions of kitchens, but most probably no two kitchens are absolutely the same. Secondly, other autonomous actors are present who usually introduce dynamic variations in the environment, often imposing real time constraints. Co-working with humans would require the abilities to manipulate various objects including non-rigid ones. The Fukushima Daiichi nuclear disaster clearly illustrated the need for advanced rescue robots capable of acting in uncontrolled human environments. The Darpa Robotics Challenge is an attempt to focus robotics research on those topics by challenging the robots to perform tasks such as climbing a ladder, opening a door and removing debris [2]. Most of the current state of the art methods in robotics rely on supervised machine learning techniques [1] which extract meaningful statistics from the incoming data 1 2 Chapter 1. Introduction during the training stage. A key component is the supervisory signal which has to be provided to the robot during the training. One possibility is to use labelled data, but the amount of data required to learn a problem increases exponentially with its complexity. Labelling large amounts of data is error prone, tedious and consumes time. Another possibility is to have a robot and a human working together, establishing a semi-supervised learning scheme, where the human actively interacts with the robot providing necessary guidance. Currently, human-robot collaboration occurs at the slave-master level, where the safety of the human has the highest priority and the robot simply follows commands. The robot could also react to the behaviour of the human by monitoring various behavioural characteristics including arm movements [3] and walking patterns [4]. However, more intuitive and efficient human-robot interaction methods are still required [1]. Brain computer interfaces (BCI) have been successfully used to control simple mobile robots moving in a plane [5, 6] and also quadcopters flying in 3D space [7]. BCIs rely on electroencephalographic (EEG) recordings of the electromagnetic brain activity which constitutes of two components: spontaneous activity and event related potentials (ERP) [8]. The ERPs are spatiotemporal patterns of brain activity which correspond to various events such as anticipation of an action or a stimulus, violation of the expected sensory input or perception of error [9]. Detection of such ERPs could allow the robot to infer the intention of the human and so enabling a more intuitive and efficient human-robot interaction. There is a large amount of research on ERPs as they are of great interest to neuroscientists and cognitive neuroscientists. However, most of the experiments are conducted in controlled lab environments. The field of neuroergonomics attempts to bring brain imaging methods to real-world environments in order to improve the interaction between a system and an operator and their overall performance. We believe that such neuroergonomic approach is essential for the development of the next generation intuitive and efficient human-robot interfaces. The work presented here is a pilot study for a subsequent PhD in human-robot interaction, which aims at exploring ERP detection and classification for the future development of a human-robot interface such as the one shown in figure 1.1. The human simply observes the robot and the working environment which causes various ERPs to be generated by the human brain. Those ERPs are first recorded using an EEG device and then detected and classified by performing suitable analysis of the 1.1. Motivation and Aim 3 Figure 1.1: Human-robot interface based on electromagnetic brain activity and the proposed cognitive control loop utilising event related potentials. EEG data. The robot utilises this information in order to learn and take decisions which hopefully lead to a more optimal behaviour. The altered behaviour evokes new ERPs and the whole sequence is repeated forming a cognitive control loop based on the electromagnetic brain activity of the operator which is modulated according to their plan, action and preferences. There are several important aspects of the work presented here. Firstly, a low-cost and commercially available EEG recording device is used, namely the Emotiv EPOC headset. It is a lightweight EEG device which allows wireless data acquisition from 14 channels making it suitable for real world environments and applications. One of the aims of the project is to evaluate the Emotiv EPOC headset and determine whether it really can be used for such scenarios. Secondly, we focus on the detection and classification of event related potentials which are evoked when an error is perceived, thus forming a reward signal that can easily be fed into a reinforcement learning algorithm later. Two types of error related event potentials are studied - when an erroneous action is executed and when an error is simply observed. Last but not least, three types of classifiers (a discriminative, a generative and a temporal one) are developed to recognise error related event potentials and their performance is analysed and compared. 4 1.2 Chapter 1. Introduction Outline Chapter 2 This chapter provides an introduction to the field of neuroergonomics describing its scope and research methods. Electroencephalography, which is on of the most often used brain imaging technique in the field, is described and issues related to it are discussed. A concise overview of the most well known event related potentials is also provided. The chapter finishes with a discussion on brain-computer interfaces and human-robot interaction. Chapter 3 This chapter describes the Emotiv EPOC headset used to collect EEG data. The two experiments which were conducted are also explained. A short overview of the mathematical tools utilised for the classification of the event related potentials is also provided. Chapter 4 The chapter shows the analysis of the recorded EEG data, acquired during the experiments. The steps for data preprocessing are described and the presence of ocular movement artefacts is investigated. The obtained event related potentials are reported and their statistical significance is examined. The chapter concludes with an evaluation of the Emotiv EPOC headset. Chapter 5 This chapter presents the channel selection procedure which was conducted and the dimensionality analysis applied to the data. The results of training three types of classifiers (a discriminative, a generative and a temporal one) to detect errors from event related potentials are reported and their performance is compared. Chapter 6 This chapter provides a a summary of the key points from this thesis. It concludes with a discussion on the presented work and points out potential future directions for development during a PhD. Chapter 2 Background This chapter provides a short introduction to the field of neuroergonomics - describing what it is and the current state of research within the field. Furthermore, the application of neuroergonomics to brain-machine interfaces (BMI) and human-robot interaction is discussed. One of the most widely used brain imaging methods in neuroergonomics, which is the electroencephalography (EEG), is described and various issues related to it are reviewed. The chapter also contains a non-exhaustive overview of the research focused on event related potentials (ERPs) in EEG signals. The presented work is based on ERPs related to errors (ErrPs), therefore the various types of ErrPs are also explained. Finally, applications of ERPs to brain-computer interfaces are overviewed and several key characteristics for intuitive, intelligent and efficient human-robot interfaces are outlined. 2.1 2.1.1 Neuroergonomics Definition Neuroergonomics is the study of the brain and behaviour in relation to performance in natural everyday settings [10]. It is an interdisciplinary field which combines ergonomics and human neuroscience. Ergonomics is a discipline concentrated not only on improving productivity in physical work environments, but it also studies the interaction between humans and various system components in order to ensure human well-being and greater overall system performance [11]. Human neuroscience studies the human nervous system and how it functions. Most of the studies rely on experiments conducted in controlled lab conditions which give very little information about 5 6 Chapter 2. Background the tasks performed by the brain in natural environments. Often those experiments rely on the fact that participants will be stationary, entirely focused on the task without any distractions. However, the theory of embodied cognition clearly states that the cognitive capabilities of an agent depend on its ability to actively move and explore its complex environment where many sources of distraction exist [12]. Neuroergonomics addresses that issue by utilising brain imaging techniques and applying them to natural environments which are extensively studied by ergonomists. 2.1.2 Motivation and research The benefit of correlating brain activity with behaviour and performance during the execution of real tasks is twofold. Firstly, more realistic models of the human brain and cognition could be developed and validated. Secondly, new technologies emerge which have critical impact and ensure the good overall system performance and safety. Neuroadaptive systems are an excellent example, where measures inferred from brain activity, such as fatigue in drivers [13] or the cognitive load of aircraft pilots [11], are used to determine how much control the operator should have and how much autonomy is granted to the system. If, for example, a pilot has to execute a cognitively demanding task, the aircraft should autonomously resolve non-critical issues. On the other hand, when the cognitive load of the pilot is low, those non-critical issues should be reported to the pilot. Limiting the autonomy of the aircraft, when it is possible, would keep the pilot focused and ensure optimal overall performance of the operator and the system. Research in neuroergonomics also aids the development of brain-machine interfaces (BMI) which currently lack robustness and are mainly limited to operate in experimental conditions. As discussed in the previous chapter, a neuroergonomic approach to BMI is required in order to create intuitive, robust and efficient human-robot interfaces. Having access to environment variables such as the mental state of the human would allow the robot to take more intelligent decisions and execute appropriate actions. There is a large amount of research in the field of neuroergonomics focused on decoding human intention [9]. The robot could use the intention of the human in order to predict what it should do, rather than just simply react to the current situation. That idea has resulted in an adaptive shared control paradigm for human-robot interaction based on intention prediction from neural activity [5, 6]. 2.2. Electroencephalography (EEG) 7 Figure 2.1: Brain imaging methods used in neuroergonomics compared in terms of spatial resolution (x-axis), temporal resolution (y-axis) and degree of immobility (z-axis). 2.1.3 Measuring brain activity Measuring brain activity is vital tool commonly utilised by neuroscientists. There is a large number of brain imaging techniques which offer different levels of spatial and temporal resolution. Methods such as the electroencephalography (EEG) have very high temporal resolution (in the range of 1ms), but poor spatial resolution. In the other end of the spectrum are methods such as functional magnetic resonance imaging (fMRI) which could provide spatial resolution of less than 1mm, but could record only slow events (in the range of 2s). Additionally, neuroergonomics imposes one more constraint to the brain imaging techniques. The subject should be able to move freely within their environment, unobstructed by the equipment which monitors the brain activity. Figure 2.1 illustrates how the available brain imaging techniques compare to each other based on 3 criteria - spatial resolution, temporal resolution, immobility [11]. The compactness and mobility of EEG equipment and its fine temporal resolution make EEG one of the most used brain imaging techniques in neuroergonomics. 2.2 Electroencephalography (EEG) EEG is one of the oldest brain imaging techniques with the first recordings from animals made in 1875 by the English physiologist Richard Caton. 8 Chapter 2. Background Figure 2.2: A diagram of showing the operational principles of EEG [14]. 2.2.1 Types There are three broad classes of EEG depending on the location of the electrode used to record the brain electromagnetic activity. During intracortical EEG a single electrode or an array of electrodes is inserted in the brain tissue providing high quality signal. Because of the intrusiveness of the method, it is applicable only to animals and in very rare cases to humans. The second type is cortical EEG where usually an array of electrodes is placed on the surface of the brain. It is mainly applied to treating patients with epilepsy. The last type is surface EEG where one or many electrodes are placed on the scalp. This type of EEG is non-intrusive which makes it suitable for recording EEG activity of human subjects while they perform different tasks. 2.2.2 Principles of operation Figure 2.2 shows a diagram which explains the principles of operation of surface EEG. The postsynaptic activity of the cerebral neurons, which form the grey matter, generates an electromagnetic field. An electrode which is placed on the scalp picks up that field and measures its potential with respect to another reference electrode placed elsewhere on the scalp. Modern EEG equipment can record simultaneously from as many as 256 electrodes that are placed according to the international 10-20 system which 2.2. Electroencephalography (EEG) 9 Figure 2.3: The EEG electrode locations according to the international 10-20 system. standardises electrode locations as shown in figure 2.3. The maximum amplitude of the detected voltages across the scalp does not exceed more than 200µV for healthy subjects. Subsequently, the analog signal is transformed into a digital one by first amplifying it and then sampling it. Once the EEG signal is digitised it could be filtered, processed and analysed by a regular computer using the appropriate software. 2.2.3 Noise and artefacts Surface EEG provides exceptionally fine temporal resolution, but there are several factors which cause the recorded signal to be noisy. Firstly, the electromagnetic field generated by the dendritic activity is very week. The signals which can be detected at the scalp are a mixture of the activity of thousands of pyramidal neurons in the cortex. Secondly, the electromagnetic field has to propagate through several layers of tissue as illustrated in figure 2.2 until it reaches the electrode which significantly attenuates it and severely reduces the signal to noise ratio. Furthermore, the signal is often contaminated with artefacts generated by muscle activity e.g. tongue movement, facial muscles contraction, ocular movement. In order to avoid such artefacts usually the participants in the experiment are instructed to move as little as possible. A less frequent approach is to simultaneously record the activity of skeletal muscles using electromyography (EMG) and reject data during movements above a certain threshold. Ocular movement is hard to control and it is 10 Chapter 2. Background often required in order to perform the experimental task. Therefore, various methods for ocular movement artefacts removal have been proposed [15]. 2.2.4 Rhythmic activity EEG signals have complex oscillatory waveforms which have been of a great interest to researchers for a long time. Several frequency bands have been identified, namely infra-slow (< 0.2Hz), δ (0.2 to 3.5Hz), θ (4 to 7.5Hz), α and µ (8 to 13Hz), β (14 to 30Hz), γ (30 to 90Hz) and high frequency oscillations (HFO) (> 90Hz). Various cognitive processes have been mapped to changes in the activity at different frequency bands. For example, the power of the θ band is increased during memory recognition task [16]; phase locked oscillations in the γ band are associated with conscious perception [17]; γ oscillations occur during voluntary isometric muscle contractions [18]. Furthermore, synchronisation between the oscillations of separate brain regions is commonly interpreted as either structural or functional connection between the regions [19]. 2.3 Event-related potentials (ERP) There is a large amount of research related not only to EEG oscillations, but also to EEG activity which is time locked to a particular event. The potentials, which the event evokes, are called event-related potentials (ERPs) and provide valuable information regarding the operation of the brain - namely how it processes various events such as sensory stimuli, movement preparation and execution, error detection. ERPs are usually observed by averaging EEG signals from multiple trials, however real time detection of ERPs requires single trial classifiers, which are the focus of this project. The following subsections list some of the most prominent ERPs in the literature and provide short descriptions. 2.3.1 Readiness potential Readiness potential is one of the first reported ERPs. It is expressed as a slow decrease in the voltage over the central scalp areas during movement initiation [9]. Lateralized readiness potential (LRP) could be measured by analysing the difference between the EEG signals at the C3 and C4 electrodes. If the subject is to move one hand then the electrode on the contralateral hemisphere would detect more negative voltage than the 2.3. Event-related potentials (ERP) 11 ipsilateral electrode [20]. Readiness potential provides a direct measure of movement intention and which arm is to be moved could also be predicted. 2.3.2 Contingent negative variation Contingent negative variation (CNV) is an ERP which occurs when the subject expects a particular stimulus associated with an already perceived clue [21]. The CNV is expressed as a gradual and continuous decrease of the EEG potential in the frontal lobe until the expected stimulus is presented. Hence, CNV could be used as a measure of anticipation of a particular stimulus. 2.3.3 P300 The P300 is characterised with a slow positive peak of the EEG potential in the centroparietal scalp region at approximately 300ms after the event onset [22]. It has large amplitude and could even be measured on single trials, which makes it probably the most studied and used ERP. It is evoked by the oddball paradigm. Two stimuli either conceptual or perceptual are selected and randomly presented to the subject with lower probability assigned to one of them. The amplitude of the evoked P300 ERP is larger when the less probable stimulus is perceived by the subject. The amplitude of the P300 is also thought to reflect the amount of attention which the subject is paying to the task. Furthermore, P300 is regarded as a reliable index of cognitive workload [10]. 2.3.4 N400 The N400 ERP is related to the difficulty of integrating new incoming information with prior beliefs. It is characterised as a widespread drop in the potential which occurs approximately 400ms after an unexpected event [9]. 2.3.5 P600 P600 is a language related ERP which is characterised by a positive peak in the potential of the posterior temporal lobe which occurs 600ms after a language error is perceived [9]. 12 Chapter 2. Background Figure 2.4: Average response ErrP [24]. 2.3.6 Error related negativity (ERN) Error related negativity is an ERP which is observed between 100ms and 150ms after an error is perceived. ERN is characterised with a negative peak, which is generated by the anterior cingulate cortex (ACC) in the frontal lobe. The peak is much smaller or absent when no error is perceived [23]. It is important to note that the amplitude depends on the perceived error meaning that even if the person commits an error, but does not perceive it as such then ERN will not be observed. Several sub-types of ERN have been reported and are termed as error related potentials (ErrP). 2.3.6.1 Response ErrP Response ErrP is invoked when a person executes an erroneous action and is characterised with a negative potential peak at approximately 100ms and a second positive peak at approximately 300ms after the error has been perceived [24, 25]. Figure 2.4 shows the response ErrP which was formed by averaging multiple trials. 2.3.6.2 Feedback ErrP Feedback ErrPs (or reinforcement ErrPs) are evoked when feedback indicating wrong action or decision is delivered. Feedback ErrPs have similar waveform as the one shown in figure 2.4, but they are usually slightly delayed as additional time is required to process the feedback stimulus [26]. 2.4. Brain-computer interfaces 13 Figure 2.5: Average observation ErrP [27]. Note that the y-axis is flipped. 2.3.6.3 Observation ErrP Observation ErrPs are evoked when the subject observes an error committed by another person or agent [27]. It is characterised with a negativity peak in the frontal lobe at approximately 250ms after an error is observed as shown in figure 2.5. The strong positive peak observed in a response ErrP is not present, or it has very small amplitude. 2.3.6.4 Interaction ErrP Interaction ErrPs are invoked when a command is delivered and the machine executes another one. Interaction ErrPs have similar shape to the response ErrP with one negative and one positive peak, however they are delayed with 100ms more [28]. Even though classified as a new type of ErrP, one could argue that interaction ErrP is simply a reaction ErrP, which is delayed due to the time required to realise the error. 2.4 Brain-computer interfaces There are two main classes of brain-computer interfaces: the first one exploits spontaneous EEG activity which subjects could learn to control; the second one relies on event related activity, which subjects have no conscious control of, and it is dependent on the interaction with the environment [8]. 14 2.4.1 Chapter 2. Background Spontaneous activity BCIs BCIs based on spontaneous activity have been applied to tasks such as control of a wheel chair [6, 5] or flying quadcopter [7] (for a detailed list of application see [29]). Spontaneous activity BCIs require a learning period during which the EEG activity of the operator is recorded while they think about the command. Certain features are then selected from those signals and passed through a classifier which outputs the detected command. This approach enables intuitive and quick interaction, but there are two main problems with it. Firstly, only a small number of commands could be learnt, thus severely limiting the capabilities of the interface. Secondly, EEG signals are chaotic and non-stationary [30], meaning that they will inevitably change over time resulting in performance decrease over time of the brain-computer interface. 2.4.2 Event-related activity BCIs Event-related activity BCIs are generally slower as they require constant interaction with the environment in order to register events. They have been applied to virtual spellers using the P300 ERP [31], virtual robotic arm control [32], BCI classifier adjustment [33]. An attempt has been made to remedy the issue with low interface speed and throughput by using the readiness potential which predicts movements [34]. BCIs based on event-related activity allow for higher throughput of information as the operator should not consciously think about the commands they are sending, because the brain automatically generates the ERPs. 2.4.3 Human-robot interaction Imagine a scenario where a human and a robot have to cooperate, for example assembling a computer. If the human had to constantly think about the actions which the robot should take, then the human will not be able to perform their own tasks. However, if an event-related BCI is used, then the human will subconsciously generate instructions for the robot. For example if the robot approaches the wrong detail, an error-related potential will be detected, informing the robot that it needs to adjust its behaviour. If a readiness potential is detected the robot will know that the human is about to move and it should assist the human if necessary. Intention prediction is a crucial feature for an intuitive human-robot interface. EEG activity is an excellent source of information for that [9]. Other equipment could be used as well (e.g. eye trackers, 2.4. Brain-computer interfaces 15 RGBD cameras, EMG) in order to successfully infer the intention of the operator. Error related potentials are particularly suitable for human-robot interfaces as their meaning could be directly translated to the reward signal in an reinforcement learning algorithm [32, 35]. ErrPs have been used to control a simulated 2-dimensional robotic arm [32], however, it is to the best of the author’s knowledge, that ErrPs have not been applied to an actual human-robot interface with a physical robot. This thesis is a pilot study in exploring single trial ERP detection and classification using affordable, compact and mobile EEG equipment with the purpose of developing actual HRI based on event related potentials. Chapter 3 Methods This chapter provides a thorough description of the experiments which were conducted during the project and the equipment which was used. A brief overview of the classifiers which are trained to detect single trial ERPs is given. Additionally, the dimensionality reduction method used to preprocess the EEG data is described as well as a feature selection method which is used for channel selection. 3.1 Experimental setup Two experiments were conducted during which the participants had to perform a behavioural task and their EEG activity was recorded. There were 10 participants in total. Each one of them participated in both experiments and signed a consent form prior to the experiments. Furthermore, ethical self-assessment was conducted and submitted to the School of Informatics as required by the University of Edinburgh regulations. The experiments were conducted in a sound insulated room in order to reduce the distraction of the participants. 3.1.1 Emotiv EEG EPOC headset The Emotiv EPOC headset was used to record the EEG activity of the participants during both of the experiments. One of the main aims of the current project is to evaluate the Emotiv EPOC headset which is a lightweight wireless EEG capturing device. Providing great mobility, 14 EEG channels and a built in 2-dimensional gyroscope at a comparatively low cost, the Emotiv EPOC headset is an extremely suitable device for the purposes of the development of an intuitive and efficient neural based human-robot 17 18 Chapter 3. Methods (a) Picture of the Emotiv EPOC headset. (b) The channels available on the Emotiv EPOC headset according to the 10-20 system are highlighted in orange. Figure 3.1: The Emotiv EPOC headset. interface. A picture of the Emotiv EPOC headset is shown in figure 3.1a and the location of the 14 electrodes according to the international 10-20 system is shown in figure 3.1b. The Emotiv EPOC headset uses dry electrodes which rely on pressure against the scalp to form good contact. Nevertheless, the electrodes need to be moistened with saline before use, but skin preparation gel is not necessary unlike to the conventional wet Ag/AgCl electrodes, which are typically used in research. Figure 3.2 lists some of the key characteristics of the Emotiv EPOC headset. 3.1.2 Experiment 1 - flanker task (FT) The aim of the first experiment is to confirm that response ErrPs can be recorded with the Emotiv EPOC headset as reported by [36]. The detection of P300 with the Emotiv EPOC headset has already been demonstrated [37]. During the experiment the participants had to perform a simple flanker task. They had to fixate a cross in the middle of the screen for 500ms. Once that time elapsed, a stimulus consisting of 5 arrows would be shown to the participant for 100ms and they had to indicate the direction of the middle arrow (left or right) as fast as possible using the keyboard arrow buttons. There were 4 possible stimuli - two congruent (<<<<< and >>>>>) and two incongruent (<<><< and >><>>) stimuli. In each trial, one of the stimuli was randomly chosen with equal probabilities for each stimulus. 3.1. Experimental setup Number of channels Channel names 19 14 (+2 reference channels) AF3, F7, F3, FC5, T7, P7, O1, O2, P8, T8, FC6, F4, F8, AF4 Sampling method Sampling rate Sequential sampling. Single ADC. 128 SPS (2048Hz internal) Resolution 14 bits, 1 LSB = 0.51 µV Bandwidth 0.2 - 45Hz, digital notch filters at 50Hz and 60Hz Filtering Built in digital 5th order Sinc filter Dynamic range 8400 µV (pp) Coupling mode AC coupled Connectivity Power Battery Life (average) Proprietary wireless, 2.4GHz band LiPoly 12 hours Impedance Measurement Real-time contact quality using patented system Figure 3.2: Characteristics of the Emotiv EPOC headset. Once the stimulus was presented a blinking dash would appear on the centre of the screen with blinking frequency of 2Hz until the subject reacts. This puts additional time pressure on the participant. Once the subject pressed a keyboard button indicating left or right, the screen would become blank for 500ms and then the next trial would begin with the fixation cross. During the experiment the screen background was black, while all stimuli were white. 6 sessions with 40 trials each were conducted per subject with 1 minute time for rest between the sessions. Thus the experiment duration was approximately 15 minutes per subject and 240 flanker task trials were recorded per subject. In order to keep the EEG data as noise free as possible, the participants were asked at the beginning of the experiment to sit comfortably and minimize their movements including tongue movement and blinking. Given the time pressure, the short exposure time to the stimulus and the incongruence of some of the stimuli, the subjects were prone to make errors, hence reaction ErrPs to be invoked. Even though subjects were instructed to be as fast as possible some of them were too careful not to make a mistake. Therefore, an improvement of the experiment protocol could involve the gamification of the task. It is possible to 20 Chapter 3. Methods give the participant a feedback score which puts larger weight on speed rather than accuracy. The synchronisation between the recorded EEG data and the behavioural response is essential for the results of the experiment. The Emotiv EPOC headset works only with a proprietary API which is implemented in C++. In order to avoid additional timestamping and minimise the margin for error in synchronisation the experimental task programs were implemented in C++ as well with the aid of the open computer vision library (OpenCV). 3.1.3 Experiment 2 - robotic arm task (RAT) The aim of the second experiment is to investigate whether observation ErrPs could be detected with the Emotiv EPOC head set. It is to the best of the author’s knowledge that observation ErrPs have been recorded only with medical grade EEG equipment [27, 32]. ErrPs have much smaller amplitude and so the signals provided by the Emotiv EPOC headset may be too noisy. The second experiment recreates the task used in [32] to record observation ErrPs. If observation ErrPs are not detected in our experiment then the conclusion that the Emotiv EPOC headset does not provide signals with sufficient quality could be safely drawn. During the second experiment the subjects are seated comfortably in front of a computer and they are asked to simply observe a planar robotic manipulator with 2 degrees of freedom (one rotational and one translational) which performs a reaching task. There are five baskets and the subject is told that the robot should always reach for the middle basket. Once a trail begins, the robot waits in idle position for 1 second as shown in figure 3.3a. In the experiment reported in [32] the basket towards which the robotic arm will reach is selected randomly from a uniform distribution over the baskets. Initial trails revealed that this enables the participants to quickly realise that the arm moves absolutely randomly and they lose concentration. Therefore, the selection of the basket for our experiment was altered. Firstly, the robot is not allowed to reach to a wrong basket twice in a row. Secondly, if the previous action was correct, the type of action is randomly selected - correct or wrong with 50% chance for each. If a wrong action is chosen, then any of the wrong baskets is randomly picked following a uniform distribution. When the basket is selected the robot instantaneously moves to it as shown in figure 3.3c and figure 3.3b. Moving instantaneously provides a distinctive event which the EEG data to be synchronised to. The manipulator remains in the 3.1. Experimental setup 21 reaching position for 1 second and goes back to the idle position, ready to execute a new movement. The experiment consisted of 6 session with 1 minute break between sessions. During each session the robotic arm performed 5 sequences of 20 movements with 5 seconds rest between each sequence, allowing the participant to re-adjust their position should they want. Hence, the total length of the experiment is approximately 25 minutes per subject. The total number of reaching trials during which the EEG activity was recorded is 6000. (a) The robotic arm in idle position. (b) The robotic arm reaching to a wrong basket. (c) The robotic arm reaching to the correct basket. Figure 3.3: The robotic arm experiment. 22 3.2 Chapter 3. Methods ErrP Classification Another aim of the presented thesis is to investigate to what extent single trial ERP detection is possible, how reliable it is and what type of classifiers cope best with the task of recognising ERPs provided with the data acquired with the Emotiv EPOC headset. It is not a trivial problem since EEG signals are known to be chaotic [30], noisy and last, but not least, ERPs have short duration of several hundred milliseconds. 3.2.1 Robust discriminant power (RDP) EEG equipment records the electromagnetic equipment of the brain with as many as 256 electrodes; even sparse sensors such as the Emotiv EPOC headset provide 14 channels. The EEG data is noisy and highly correlated, therefore it will be naive to feed all this data into a classifier and hope that it will make sense out of it. A common starting point when working with EEG data is to select one or several channels of interest. The process of channel selection requires a set of features which are measurable in order to enable objective data-based comparison of the separate channels. The concept of feature selection based on RDP is introduced in this section and the application to channel selection is described in section 5.1. 3.2.1.1 Feature selection The robust discriminant power (RDP) measure is introduced in [38]. It estimates the extent to which a given feature could be used to discriminate between several classes, assuming normal feature distribution. Let us consider a two class example for the purpose of explanation. If the distribution of a feature f for class 1 is pd f 1 ( f ) and for class 2 is pd f 2 ( f ) then feature f has high discriminant power if pd f 1 ( f ) is well separated from pd f 2 ( f ). If pd f 1 ( f ) and pd f 2 ( f ) are overlapping then feature f cannot be used to discriminate between the two classes. The discriminant power algorithm is computationally efficient because it is not necessary to estimate the actual distributions. It is sufficient to know the minimum and the maximum values of the feature f for which there is non-zero probability according to pd f 1 ( f ) and pd f 2 ( f ). Given those limits it is easy to estimate the overlap between the distributions. However, this method is sensitive to outliers. Therefore, [38] recommends calculating the means µ1 and µ2 as well as the standard deviations σ1 and σ2 of pd f 1 ( f ) and pd f 2 ( f ). Then, in order to estimate the overlap between the two distributions use the following limits for 3.2. ErrP Classification 23 each class k: lkmin = µk − |σk | lkmax = µk + |σk | (3.1) Thus outliers in the data will have very little effect on the estimate of the discriminant power. The robust discriminant power of a feature f for a two class problem is given by: RDP( f ) = ND1 + ND2 N1 + N2 (3.2) where N1 and N2 is the number of samples for each class. ND1 and ND2 is the number of discriminant samples which belong to each class. Discriminant samples are those which belong to a non-overlapping region of the class distribution and so: N1 ND1 = ∑ l(s1 (i) < l2min ) + l(s1 (i) > l2max ) (3.3) ND2 = ∑ l(s2 (i) < l1min ) + l(s2 (i) > l1max ) (3.4) i=1 N2 i=1 where sk (i) is the ith sample of class k and l(c) which is equal to 1 when the condition c is true and 0 otherwise. 3.2.2 Dimensionality reduction - principal components analysis (PCA) As discussed in section 3.2 EEG data has a large number of dimensions which are often correlated and dimensionality reduction is often desirable before any further analysis. The proposed method for ERP classification utilises the principal components analysis (PCA) and so it is briefly described here. If X is an n × d matrix, where n is the number of observed d dimensional points, then the covariance matrix of X is given by C = XT X If the observed points in X do not have zero mean, they should be recentred before the covariance matrix is calculated. C is a d × d matrix, the eigenvectors of which define d orthogonal directions along which the variance of the data in X is maximised. The magnitude of the corresponding eigenvalues provides information about the variance along each eigenvector. Usually, the first few eigenvectors are picked to form a lower dimensional space in which the data could be projected preserving as much variance 24 Chapter 3. Methods as possible. The preserved accumulative variance of the data projected onto an m dimensional subspace for (m < d) is given by σ2acc = ∑m i=1 λi ∑di=1 λi (3.5) where λi is the ith eigenvalue. 3.2.3 Discriminative classifier - logistic regression (LR) A logistic regression classifier is considered for the detection of ErrPs in EEG signals as it has already been reported to achieve satisfactory accuracy at classifying ERN potentials [36]. It is a discriminative classifier meaning that it directly models the class label probability distribution p(y|x, w) conditioned on the input data x and the model weights w. Logistic regression corresponds to the following binary classification model [39]: p(y|x, w) = Ber(y|sigm(wT x)) (3.6) where sigm(x) is the sigmoidal function defined as: sigm(x) = 1 1 + e−x and Ber(x|θ) is the Bernoulli distribution given by: θ if x = 1 Ber(x|θ) = 1 − θ if x = 0 The sigmoidal function introduces a nonlinearity in the negative log-lokelihood (NLL) of the model and so an analytical solution for the weights of the model does not exist. However, the Hessian matrix of the NLL is positive definite meaning that there is a single global optimum which could be found with an iterative optimisation algorithm such as gradient descent. The model fitting to the data was performed using MATLAB and the generalised linear model regression function (glmfit) part of the Statistical Toolbox. 3.2.4 Generative classifier - mixture of Gaussians (MOG) A generative classifier was also considered for the detection of ErrPs. A generative classifier models the distribution of features x given the class label c and the parameters 3.2. ErrP Classification 25 of the model θ i.e. p(x|y = c, θ). The probability of a new data point xnew to belong to class c could be found by using Bayes rule p(y = c|xnew , θ) ∝ p(xnew |y = c, θ)p(y = c) (3.7) The new point is assigned to the class with highest probability. It is common to assume that the class conditional distribution p(x|y = c, θ) could be represented as a mixture of Gaussians when classifying ErrPs. [8, 33, 38]. The mixture of Gaussians (MOG) model is given by K p(x|θ) = ∑ πk N(x|µk , Σk ) (3.8) k=1 where N(x|µ, Σ) is a normal distribution over x with mean µ and covariance Σ. πk are called mixing weights and they have to satisfy the 0 ≤ πk ≤ 1 and ∑K k=1 πk = 1. Fitting this model to the data is a non-convex problem and most often the expectation maximisation (EM) algorithm is used. It is a two-step iterative algorithm which firstly calculates the expected value of the NLL over the latent variables (the responsibility of each mixture component) and then maximises it [39]. A MOG model was fitted for each class using MATLAB and the (gmdistribution.fit) function part of Statistical Toolbox. 3.2.5 Temporal classifier - hidden Markov model (HMM) Markov models are widely applied to problems which involve time series such as EEG signals. A Markov chain is the joint probability distribution over all observed variables xt for a period T T p(x1:T ) = p(x1 ) ∏ p(xt |xt−1 ) t=2 where p(xt |xt−1 ) is the transition probability distribution. A hidden Markov model consists of a discrete-time, discrete-state Markov chain, with hidden states zt and observation model p(xt |zt ) often called emission probability distribution [39]. The HMM joint distribution is given by " T #" p(z1:T , x1:T ) = p(z1 ) ∏ p(zt |zt−1 ) t=2 T # ∏ p(xt|zt ) t=1 Since the observed EEG data, even though discretised, has a continuous nature, mixture of Gaussians (see equation 3.8) is often used as an observation model [8, 30, 40]. 26 Chapter 3. Methods An HMM is fitted to each class and new time series are assigned to the class which explains the new data with higher likelihood. Essentially, this is a generative classifier which uses a temporal model for the class conditional probability distributions. The HMMs were fitted to the data using MATLAB and the Hidden Markov Model (HMM) Toolbox 1 written by Kevin Murphy. 1 http://www.cs.ubc.ca/ murphyk/Software/HMM/hmm.html Chapter 4 Observing error related negativity The first step in the analysis of the results is to examine the recorded EEG data from the two conducted experiments and determine whether ERN components are present or not. The presence of ERN has already been reported for signals recorded by the Emotiv headset during the completion of the Flanker task [36]. However, the observation ErrP, evoked by the robotic arm task, is as much as 10 times weaker than the response ErrP, evoked by the Flanker task [27]. Observation ErrPs have been recorded only with research class EEG equipment using wet Ag/AgCl electrodes [27, 32]. Therefore, it is uncertain if observation ErrPs will be detected with dry electrodes and the Emotiv headset. This chapter describes how the recorded data is pre-processed and the ocular artefact analysis conducted. Furthermore, the mean recorded ERP signal is investigated and statistical analysis for the significance of the findings is performed. The chapter finishes with the evaluation of the Emotiv EPOC headset. 4.1 4.1.1 EEG data preprocessing Extracting event centred EEG epochs Since only ERPs are to be analysed, only the EEG activity during the occurrence of an event is of interest. Therefore, 1 second long epochs were extracted from the EEG data - starting half a second before and finishing half a second after an event. Table 4.1 lists the total count of extracted epochs for all participants in both experiments. Due to trimming artefacts caused by the extraction of the 1 second segments some epochs were dismissed. There are 2106 epochs in total for the Flanker task and 343 of them capture the EEG activity when an error was made by the subject. There are 5400 27 28 Chapter 4. Observing error related negativity Error event No error event Total epochs Flanker task 343 1763 2106 Robotic arm task 1759 3641 5400 Figure 4.1: Number of extracted epochs for all participants in both experiments during the occurrence of events with and without error. epochs for the robotic arm task and 1759 of them correspond to an observed error. 4.1.2 Frequency filtering One of the main disadvantages of electroencephalography is the low signal to noise ratio caused by the attenuation of the signals from the cortical neurons. Careful data pre-processing is essential for further analysis. Event related electromagnetic activity is relatively slow change of the field potentials and so the EEG data was bandpass filtered between 1Hz and 5Hz using Fast Fourier Transform. Figure 4.2 shows the raw and the filtered signal recorded from channel F7 both during correct and erroneous subject response. The amplitude of the raw signal is in µV measured with respect to the reference electrodes of the Emotive headset. Since we are interested in the shape of the EEG signal, not its absolute amplitude, the filtered signal is shifted such that it has zero mean value. As expected the raw EEG data is quite noisy, but selecting only slow frequencies improves the data quality significantly. At the bottom left plot in Fig. 4.2 a typical error related potential [24] could be observed - a drop in potential at approximately 100ms and then a positive peak at approximately 200 − 300ms after the error event. It must be noted that that the signal in Figure 4.2 was particularly picked such that it contains as little noise as possible for illustrative purpose only. Later sections in the section focus on more informative statistics such as the mean of the event related potentials, the variance of the ERPs and the statistical significance of the acquired data. 4.1.3 Analysis for eye movement artefacts Eye movements result in strong artefacts which deteriorate the quality of the EEG data. Various techniques have been suggested to either remove those artefacts or reject data which contains them [15]. Ultimately, one would use an eye tracking device and 4.1. EEG data preprocessing 29 F7 - Raw with error F7 - Raw without error F7 - Filtered with error F7 - Filtered without error Amplitude, µV 2300 2200 2100 2000 1900 Amplitude, µV 10 5 0 −5 −10 −400−200 0 200 400 Time, ms −400−200 0 200 400 Time, ms Figure 4.2: Top left is the raw signal at channel F7 recorded during the Flanker task associated with a committed error. Bottom left is the filtered version of the raw signal including only frequencies between 1 and 5 Hz. Top right is the raw signal recorded at channel F7 during the Flanker task when the subject correctly responded to the stimulus. Bottom right is the filtered out version of the raw signal including only frequencies between 1 and 5 Hz. 30 Chapter 4. Observing error related negativity HEOG - Flanker task HEOG - Roboarm task Amplitude, µV 300 200 100 −400−200 0 200 400 Time, ms −400−200 0 200 400 Time, ms Figure 4.3: The estimated mean HEOG for both experiments. The blue line is the mean HEOG during the extracted ERP epochs. The dashed red lines are ± 1 standard deviation away from the mean, forming a 68.2% confidence interval, assuming normal distribution. The horizontal red line is at 330µV which is the reported data rejection threshold by [41]. develop a model which predicts the EEG artefacts based on the detected movement. However, this requires additional equipment for the experiment and is not a trivial task. A more common approach is to infer eye movement from EEG data (electroocolarography (EOG)) and just reject short segments which occur during well-pronounced movements [35, 6, 13, 41]. The horizontal EOG (HEOG) can be extracted by calculating the absolute difference between the F7 and F8 channels. Figure 4.3 shows the mean HEOG estimated from the extracted and filtered ERP epochs. The amplitude of the observed HEOG is much smaller than the threshold of 330µV reported by [41] and so the ocular movement artefacts should not affect any further analysis. There are two potential reasons for the low HEOG amplitude. Firstly, the subjects were instructed to move their eyes as little as possible which decreases the probability of a movement to occur during an event. Secondly, eye movements produce narrow spikes in the EEG signal which should be removed by the applied bandpass filter. Given the reported threshold by [41] it is concluded that no further ocular artefact removal is necessary. 4.2 Mean ERPs Usually, ERPs are observed by calculating the mean signal from many trials. Assuming that the noise present in the EEG signal is white, then the mean ERP should reflect the 4.2. Mean ERPs 31 processes that take place in the brain during the processing of the event. 4.2.1 Mean response ErrP The mean response ErrP at channel F7 obtained during a correct and incorrect action while performing the FT is shown in figure 4.4a. When an error is present there is a strong negative peak at approximately 100ms after the event and a positive peak at about 250ms after the event. This waveform is identical to the one reported in [24] (see figure 2.4). The peak-to-peak amplitude is approximately 10µV . When no error is present there is also a very small negative deflection which is expected [24]. The mean ERP recorded at channel T8 is shown in 4.4b where response ErrP is not expected due to the frontal distribution over the scalp of ErrPs. It can be seen in 4.4b that there are some peaks present, but the peak-to-peak amplitude of the signal is approximately 1.5µV which is small enough to be safely ignored. Hence, the analysis of the mean response ErrP shows that response ErrPs can be recorded with the Emotiv EPOC headset from channels on the frontal lobe and are barely distinguishable in other regions of the scalp such as the temporal lobe. 4.2.2 Mean observation ErrP The mean observation ErrP at channel F7 obtained during a correct and incorrect action while performing the RAT is shown in figure 4.5a. When an error is present there is a strong negative peak at approximately 400ms after the event. This waveform is slightly different from the one reported in [27] (see figure 2.5). First of all, the peak is delayed with approximately 100ms and it is much steeper. This suggests that the detected ERP is not exactly an observation ErrP, but probably the N400 potential (see section 2.3.4). The experiment, however, was very repetitive and the subjects quickly learnt all possible states meaning that it is highly unlikely to be the N400. Moreover, the same signal shape (even more delayed) is reported in [32], where the RAT experiment is described. The delay could be potentially attributed to the time required to visually process the event of error. The peak amplitude of the recorded observation ErrP is approximately 4µV which is less than half of the response ErrP amplitude. Similar to the response ErrP, when no error is present there is a very small negative deflection. The mean ERP recorded at channel T8 is shown in 4.5b where observation ErrP is not expected. It can be seen in 4.5b that there are some peaks present, but their amplitude is negligible. Hence, the analysis of the mean observation ErrP shows that observation 32 Chapter 4. Observing error related negativity Mean response ErrP at F7 (FT) Amplitude, µV 10 5 0 −5 No error Error −400 −200 0 200 Time, ms 400 (a) Mean response ErrP recorded from channel F7. Mean response ErrP at T8 (FT) Amplitude, µV 0.5 0 −0.5 −1 No error Error −400 −200 0 200 Time, ms 400 (b) Mean observation ErrP recorded from channel T8. Figure 4.4: The mean response ErrP recorded at channel F7 (top) and T8 (bottom) during error and no error conditions . 4.2. Mean ERPs 33 Mean observation ERP at F7 (RAT) 1 Amplitude, µV 0 −1 −2 −3 −4 No error Error −400 −200 0 200 Time, ms 400 (a) Mean observation ErrP recorded from channel F7. Mean observation ERP at T8 (RAT) Amplitude, µV 0.2 0 −0.2 −0.4 −0.6 No error Error −400 −200 0 200 Time, ms 400 (b) Mean observation ErrP recorded from channel T8. Figure 4.5: The mean observation ErrP recorded at channel F7 (top) and T8 (bottom) during error and no error conditions. 34 Chapter 4. Observing error related negativity ErrPs can be recorded with the Emotiv EPOC headset from channels on the frontal lobe and are barely distinguishable in other regions of the scalp such as above the temporal lobe. 4.3 Significance of the data One way ANOVA analysis was carried out in order to determine how significant the shape of the ErrPs recorded during correct and incorrect events is. The null hypothesis is that ErrPs recorded during correct and incorrect events have no difference in the mean. The p-value was calculated for each time sample and the results for channel F7 are shown in figure 4.6. The probability of the recorded data to be explained by the zero hypothesis at the intervals of interest (between 0ms and 200ms after the event for the response ErrP and between 400ms and 500ms after the event for the observation ErrP) is virtually zero. Therefore, the results obtained from both experiments are statistically significant. Figure 4.7 shows the same analysis, but for the channel T8, where the difference between error and no error ERPs is statistically insignificant. 4.4 Evaluation of the Emotiv EPOC headset Even though some of the mainly used electrodes in the study of ErrPs, which are Cz and Fz, are not available on the Emotiv EPOC headset it was demonstrated that both response ErrPs and observation ErrPs can be recorded. The reported results confirm the fact the Emotiv EPOC headset actually records EEG activity rather than muscle activity only. However, there were some issues with the device. Given the design of the headset it was difficult to ensure correct placement of the electrodes. Not only that, but after 45 minutes of use subjects complained of discomfort caused by the mechanical pressure applied to the head by the headset. The dry electrodes manage to pick up EEG signals, but the quality of the contact with skin deteriorates relatively quickly. Figure 4.8 shows the quality of all contacts at the beginning of an experiment and at the end. This information is provided by the Emotiv Control Panel software. The quality of the contact is encoded with 5 colours - green, orange, yellow, red, black. Green means that the contact is excellent, while black indicates lack of contact. It can be seen that for the 45 minutes of use the state most of the contacts has worsened. Therefore, for any prolonged use the electrodes of the Emotiv EPOC headset should be periodically moistened with saline and repositioned. Additionally, there were some 4.4. Evaluation of the Emotiv EPOC headset 35 Significance of the mean response ErrP at F7 (FT) 1 10 5 0.6 0.4 p-value Amplitude, µV 0.8 0 0.2 −5 0 −400 −200 No error 0 200 Time, ms Error 400 0.05 p-value (a) The statistical significance of the shape of the mean response ErrP (FT) recorded at channel F7 for both error and no error conditions. 1 1 0 0.8 −1 0.6 −2 0.4 p-value Amplitude, µV Significance of the mean observation ErrP at F7 (RAT) 0.2 −3 0 −4 −400 −200 No error 0 200 Time, ms Error 400 p-value 0.05 (b) The statistical significance of the shape of the mean observation ErrP (RAT) recorded at channel F7 for both error and no error conditions. Figure 4.6: ANOVA analysis for the signals recorded at F7 - Error vs. No error conditions 36 Chapter 4. Observing error related negativity Significance of the mean response ErrP at T8 (FT) 10 1 5 0.6 0.4 0 p-value Amplitude, µV 0.8 0.2 −5 0 −400 −200 No error 0 200 Time, ms Error 400 0.05 p-value (a) The statistical significance of the shape of the mean response ErrP (FT) recorded at channel T8 for both error and no error conditions. 1 1 0 0.8 −1 0.6 −2 0.4 −3 0.2 −4 0 −400 −200 No error 0 200 Time, ms Error p-value Amplitude, µV Significance of the mean observation ErrP at T8 (RAT) 400 p-value 0.05 (b) The statistical significance of the shape of the mean observation ErrP (RAT) recorded at channel T8 for both error and no error conditions. Figure 4.7: ANOVA analysis for the signals recorded at T8 - Error vs. No error conditions 4.4. Evaluation of the Emotiv EPOC headset 37 (a) Contact quality at the beginning of (b) Contact quality at the end of the the experiment. experiment. Figure 4.8: Quality of contact before and after the experiment. The information is provided by the Emotiv Control Panel software and the quality of the contact is encoded with 5 colours - green, orange, yellow, red, black. Green means excellent contact, while black indicates lack of contact at all. connectivity issues occasionally causing the device to spontaneously disconnect. Since the system is closed source there was now way to track where the problem comes from. Nevertheless, the Emotiv EPOC headset was found to be an amazing low-cost research tool providing great opportunities for various experiments. Chapter 5 ErrPs classification results The previous chapter demonstrated that error related potentials can be detected in the EEG signals recorded by the Emotiv EPOC headset. This chapter investigates the classification of single trial ErrPs. It presents the results on the performance of 3 types of classifiers trained to discriminate between error or no error ErrPs. Various statistics are reported for each classifier type and advantages and disadvantages discussed. Additionally, a channel selection method based on the robust discriminative power of a feature is also presented. 5.1 Channel selection The Emotiv EPOC has 14 channels which provide data related to the electromagnetic activity of the subject’s brain. However, it is highly unlikely that the signals in all of the 14 channels are uncorrelated. Furthermore, it is known that the source of ErrPs is the anterior cingulate cortex in the frontal lobe and so it is expected that channels located on the frontal scalp regions would provide better signal than channels in the occipital lobe for example. This section describes a measure based on the robust discriminative power (RDP) which evaluates the extent to which the signal from a channel could be used to discriminate between error and no error events. The RDP of a feature was introduced in section 3.2.1, but how it is applied to channel selection was not discussed. In [38], where RDP was described, it is applied to a small set of features extracted from the power spectrum density of the EEG signal. However, the analysis of ErrPs, which focuses on transient waveforms, is in the time domain. Therefore, the following procedure to calculate ErrP score for each channel is proposed. 39 40 Chapter 5. ErrPs classification results The event-locked segments of 1 second EEG activity recorded by channel k could be represent as k serr (t) and k snoerr (t) where t ∈ [1; 128] is the time sample index and the subscript encodes the type of the event. There are two classes and the value of the signal at each sample point could be considered as a separate feature resulting in 128 features in total. Hence, the discriminant power of channel k, DPch (k), could be the sum of the RDPs of each sample point: 128 DPch (k) = ∑ RDP(k s(t)) (5.1) t=1 However, nothing prevents the measure in equation 5.1 from rating two channels with the same score even though the amplitude of the observed ERPs is much smaller in one of the channels. Therefore, the following measure is proposed where the RDP of each sample point is weighed with feature range which is estimated using the limits in equation 3.1 DPch (k) = 128 n ∑ o max min [lerr (t) − lerr (t)] RDP(k s(t)) (5.2) t=1 Thus, if the peak-to-peak amplitude of the detected signal is larger the channel will be assigned a higher ErrP score. The ErrP score of each channel during both of the conducted experiments was calculated according to equation 5.2 and the results are shown in figure 5.1. First of all, it could be immediately noticed that electrodes in the frontal lobe have significantly higher ErrP scores. This is expected as the ErrPs are reported to mainly occur in the frontal lobe (see section 2.3.6). A threshold value of 250 (shown as a horizontal red line in figure 5.1) was determined by inspection and so only channels with ErrP score greater than 250 are considered to provide signal which could be used to reliably discriminate between the presence and absence of error. This means that there are 6 channels which could be used for both tasks. Those channels for the FT are AF3, F7, F3, F4, F8 and AF4, while for the RAT they are the same with only F4 replaced by FC6. However, F4 in the FT has much higher score than FC6 in the RAT therefore F4 is considered for both tasks. Finally, a closer inspection of figure 5.1 reveals that the ErrP score of the selected channels is higher for the FT which indicates that observation ErrPs, invoked by the RAT, are more difficult to detect. This is expected, since the observation ErrP has smaller amplitude and is more likely to be masked by noise. Further data analysis, classifiers training and performance evaluation will be performed on the signals recorded from channels AF3, F7, F3, F4, F8 and AF4. 5.1. Channel selection 41 Flanker task ErrP Score 350 300 250 200 AF3 F7 F3 FC5 T7 P7 O1 O2 P8 T8 FC6 F4 F8 AF4 Channel Robotic arm task ErrP Score 350 300 250 200 150 AF3 F7 F3 FC5 T7 P7 O1 O2 P8 T8 FC6 F4 F8 AF4 Channel Figure 5.1: The ErrP score estimated for each channel during both the task according to equation 5.2. The horizontal red line at 250 represents a threshold which was chosen by inspection. 42 Chapter 5. ErrPs classification results Explained data variance - FT Explained data variance - RAT 1 Preserved accumulative variance Preserved accumulative variance 1 0.9 0.8 0.7 0.6 0.5 0.4 0.9 0.8 0.7 0.6 0.5 0.4 0 2 4 6 Number of dimensions AF3 F4 F7 F8 8 F3 AF4 0 2 4 6 Number of dimensions AF3 F4 F7 F8 8 F3 AF4 Figure 5.2: The preserved accumulative variance of the data when it is projected in a lower dimensional space for both the FT (left) and the RAT (right). 5.2 Dimensionality reduction Initial attempts to train a logistic classifier used the 1 second filtered segments as 128 dimensional data points. However, computational issues were encountered due to the fact that there was high correlation between the separate dimensions. This was not surprising as each dimension is a consecutive sample. Therefore, PCA was used to find a subspace with lower dimensionality for each of the channels in which the 128 dimensional data to be projected and preserve as much information as possible. Figure 5.2 shows the obtained results for each of the six channels of interest. Interestingly, if the data dimensionality is reduced from 128 down to 6 it is still possible to reconstruct it almost perfectly as indicated by the preserved accumulative variance value which is extremely close to 1. Therefore, for the training of the LR and the MOG classifiers the data was projected in a 6 dimensional space, which was separately determined for each of the two tasks using PCA. 5.3. Logistic regression classifier 5.3 43 Logistic regression classifier A logistic regression classifier was trained on the preprocessed singe trial ERPs data with reduced dimensionality for each channel. The data was split into two sets - training (80%) and testing set (20%). The reported accuracy is estimated as the number of correctly classified ERPs divided by the total number of ERPs in the testing set. The achieved accuracy levels for each channel for both tasks are shown at the top row of figure 5.3. The accuracy for the FT is similar to the reported results in [36], while the accuracy for the RAT task is comparable to the results in [35]. It is important to note that observation ErrPs are classified with worse accuracy than reaction ErrPs which is expected because they have smaller amplitude as already discussed and are more susceptible to noise. The number of positive samples (i.e. events with error) in the recorded dataset is less than a third from all samples (see figure 5.9). Therefore, a more informative measure of the classifier performance is the F1 score which is defined as F1 = 2Nt p 2Nt p + N f n + N f p (5.3) where Nt p is the number of true positive samples, N f n - number of false negative samples and N f p - number of false positive samples. The F1 scores of the logistic regression classifier for each channel of interest are shown at the bottom row of figure 5.3. The LR classifiers are much better at recognising reaction ErrPs corresponding to the FT rather than observation ErrPs. For the observation ErrPs each of the trained classifier performs slightly better than chance. Since one of the main disadvantage of surface EEG is noise, it is interesting to examine how the LR classifier copes with noise. An extreme case of noisy data, but quite likely, is to attempt to classify an observation ErrP with a classifier trained on response ErrP and vice versa. Therefore the classifiers for each channel trained on the FT were tested on the RAT and vice versa. Both the accuracy and the F1 score results for such cross-classification are shown in figure 5.4. The performance for either of the tasks does not change significantly meaning that the LR results in robust classifiers. Taking into account the fact that the F1 score is relatively low and the resistance to noise it could be concluded that LR model has most probably underfitted the training data. 44 Chapter 5. ErrPs classification results Logistic Regression Classifier ErrP classification - RAT 90 85 85 80 80 Accuracy, % Accuracy, % ErrP classification - FT 90 75 70 65 70 65 60 60 55 55 50 50 AF3 F7 F3 F4 F8 AF4 0.7 0.7 0.65 0.65 0.6 0.6 F1 Score F1 Score 75 0.55 F3 F4 F8 AF4 AF3 F7 F3 F4 F8 AF4 0.55 0.5 0.5 0.45 0.45 0.4 AF3 F7 0.4 AF3 F7 F3 F4 Channel F8 AF4 Channel Figure 5.3: Top row: The classification accuracy achieved on the test set for each task FT (left) and RAT (right) using logistic regression. Bottom row: The F1 score achieved by the logistic regression classifiers trained for each channel and each task. 5.3. Logistic regression classifier 45 Logistic Regression Classifier ErrP cross-classification - RAT 85 85 80 80 Accuracy, % Accuracy, % ErrP cross-classification - FT 75 70 65 70 65 60 60 AF3 F7 F3 F4 F8 AF4 0.65 0.65 0.6 0.6 F1 Score F1 Score 75 0.55 0.5 0.45 0.45 F3 F4 Channel F8 AF4 F3 F4 F8 AF4 AF3 F7 F3 F4 F8 AF4 0.55 0.5 AF3 F7 AF3 F7 Channel Figure 5.4: Top row: The classification accuracy achieved on the test set for the FT using a LR classifier trained on the RAT (left) and for the RAT using a classifier trained on the FT (right). Bottom row: The F1 score achieved during cross-classification. 46 5.4 Chapter 5. ErrPs classification results Mixture of Gaussians classifier The second classifier which was trained on the data was a generative mixture of Gaussians classifier. It was trained on the filtered single trial ERPs data with reduced dimensionality. The number of mixtures Nk was kept the same for both error and no error classes and it was determined using a model selection procedure. Values between 1 and 6 were tried, as suggested by [38], and it was found out that Nk = 4 gives the highest likelihood results. Since, fitting a MOG distribution to the data is not a convex problem, the EM algorithm was run 100 times and the best solution was selected. The results obtained for both tasks by training individual classifiers for each channel are shown in figure 5.5. It could be seen that the MOG classifiers perform slightly better than random guess only for some of the channels. As noticed before, the RAT is more difficult to learn than the FT. The obtained results are worse than the ones reported in [38], which could be attributed to the lower quality signal provided by the Emotiv EPOC headset. The results for cross-classification as described in the previous section are shown in figure 5.6. Unlike to the LR, MOG classifiers seem to be more sensitive to noise, which is expected given the fact that they model class properties instead of boundary characteristics. None of the MOG classifiers trained on the RAT could generate even 1 true positive prediction for the FT as seen on the bottom left plot in figure 5.6. 5.5 Hidden Markov model classifier The final classifier type considered is a generative classifier which models the class conditional distribution using a continuous output hidden Markov model. Each HMM was trained on the extracted 1 second long ERP segments, meaning that the sequences to be learnt are 128 samples long. First, model selection was performed in order to determine the number of hidden states and the number of Gaussian mixtures in the emission probability distributions. Both the number of states Q and the number of mixtures M were varied between 2 and 20, thus generating 400 different models. Based on how well each model explained the data it was found out that the most optimal choice is Q = 4 and M = 8. An HMM was fitted for each class and channel and for every fit the EM algorithm was run 20 times and the best solution was considered as the final one. The accuracy and the F1 score achieved using the HMM classifier are shown in figure 5.7. It can be seen that the performance is quite poor - none of the 5.5. Hidden Markov model classifier 47 Mixture of Gaussians Classifier ErrP classification - RAT 90 85 85 80 80 Accuracy, % Accuracy, % ErrP classification - FT 90 75 70 65 70 65 60 60 55 55 50 50 AF3 F7 F3 F4 F8 AF4 0.7 0.7 0.65 0.65 0.6 0.6 F1 Score F1 Score 75 0.55 F3 F4 F8 AF4 AF3 F7 F3 F4 F8 AF4 0.55 0.5 0.5 0.45 0.45 0.4 AF3 F7 0.4 AF3 F7 F3 F4 Channel F8 AF4 Channel Figure 5.5: Top row: The classification accuracy achieved on the test set for each task FT (left) and RAT (right) using mixture of Gaussians classifier. Bottom row: The F1 score achieved by the MOG classifiers trained for each channel and each task. 48 Chapter 5. ErrPs classification results Mixture of Gaussians Classifier ErrP cross-classification - RAT 50 48 48 46 46 44 44 Accuracy, % Accuracy, % ErrP cross-classification - FT 50 42 40 38 40 38 36 36 34 34 32 32 30 30 AF3 F7 F3 F4 F8 AF4 0.6 0.6 0.5 0.5 0.4 0.4 F1 Score F1 Score 42 0.3 0.2 0.1 0.1 0 0 F3 F4 Channel F8 AF4 F3 F4 F8 AF4 AF3 F7 F3 F4 F8 AF4 0.3 0.2 AF3 F7 AF3 F7 Channel Figure 5.6: Top row: The classification accuracy achieved on the test set for the FT using a MOG classifier trained on the RAT (left) and for the RAT using a classifier trained on the FT (right). Bottom row: The F1 score achieved during cross-classification. 5.5. Hidden Markov model classifier 49 Hidden Markov model classifier ErrP classification - RAT 80 75 75 70 70 Accuracy, % Accuracy, % ErrP classification - FT 80 65 60 55 60 55 50 50 45 45 40 40 AF3 F7 F3 F4 F8 AF4 0.8 0.8 0.7 0.7 0.6 0.6 0.5 0.5 F1 Score F1 Score 65 0.4 0.3 F3 F4 F8 AF4 AF3 F7 F3 F4 F8 AF4 0.4 0.3 0.2 0.2 0.1 0.1 0 AF3 F7 0 AF3 F7 F3 F4 Channel F8 AF4 Channel Figure 5.7: Top row: The classification accuracy achieved on the test set for each task FT (left) and RAT (right) using a hidden Markov model classifier. Bottom row: The F1 score achieved by the HMM classifiers trained for each channel and each task. 50 Chapter 5. ErrPs classification results Hidden Markov model classifier ErrP cross-classification - RAT 80 70 70 Accuracy, % Accuracy, % ErrP cross-classification - FT 80 60 50 40 50 40 30 30 AF3 F7 F3 F4 F8 AF4 0.6 0.6 0.5 0.5 0.4 0.4 F1 Score F1 Score 60 0.3 0.2 0.1 0.1 0 0 F3 F4 Channel F8 AF4 F3 F4 F8 AF4 AF3 F7 F3 F4 F8 AF4 0.3 0.2 AF3 F7 AF3 F7 Channel Figure 5.8: Top row: The classification accuracy achieved on the test set for the FT using a HMM classifier trained on the RAT (left) and for the RAT using a classifier trained on the FT (right). Bottom row: The F1 score achieved during cross-classification. 5.6. Classifiers comparison Classifier 51 Reaction ErrP Observation ErrP Channel Accuracy F1 score Channel Accuracy F1 score LR F8 86.97 0.6061 AF4 69.66 0.5172 MOG F8 60.00 0.5942 AF4 57.44 0.5297 HMM F8 73.35 0.2710 F3 51.20 0.3973 Figure 5.9: Comparison between the best performing classifiers of each type trained classifiers reaches a better performance than random guess according to the F1 score. Potential reasons for that are discussed in the next section. The results for cross-classification are shown in 5.8. 5.6 Classifiers comparison The results achieved by the best performing classifiers for each of the 3 types are summarised in figure 5.9. Logistic regression seems to be the most suitable approach for the detection of both reaction ErrPs and observation ErrPs. It should be noted that logistic regression is the only discriminative classifier amongst the 3 considered. Since the EEG data is noisy, it is difficult to capture the key characteristics of each class which a generative classifier attempts to do. A discriminative classifier models the difference between the classes which seems to be beneficial in the given problem of ErrP classification. However, as mentioned before, it is very likely that the logistic regression model underfits the data. In subsequent analysis a multilayer perceptron will be also trained on the problem. There are several factors which could cause the poor performance of the HMM classifier. First of all, the length of the analysed 1 second segments (128 samples) is really small, compared to other reported applications of HMMs to EEG signals which use sequences of approximately 10 seconds [30]. Secondly, EEG signals are know to be chaotic and non-stationary, therefore a more complex model could be needed to capture the dynamics of ERP signals. The overall classification accuracy is not satisfactory and future work should address that issue by using other models for classification. However, it is naive to expect close to perfect accuracy and so in order to close the HRI control loop learning from noisy reward signals would be required. Chapter 6 Conclusion 6.1 Summary This thesis reported results on the usage of the Emotiv EPOC headset to detect error related event potentials and explored several simple methods for classification. It was shown that both observation ErrPs and response ErrPs can be detected with the Emotiv EPOC headset. A channel selection method was proposed which measures the extent to which the signals generated from each channel could be used for a later classification. As expected from previous research, channels located on the frontal lobe (AF3, F7, F3, F4, F8, AF4) detect the strongest ErrPs. Three types of classifiers were trained to classify single trial ErrP as being evoked from an event with or without error. The first classifier was a discriminative logistic regression classifier and it achieved the best classification performance (accuracy of 86.97% and F1 score of 0.6). A generative classifier modelling the class conditional densities with a mixture of Gaussians was also trained on the data, but its performance was not satisfactory. As the last classifier, a continuous hidden Markov model was trained on the ERPs for each condition (error and no error), trying to exploit the dynamics of the signal. However, the HMM classifier performed even worse than a random guess. Therefore, from those three relatively simple classifiers the logistic regression is the most suitable method for the given problem. The Emotiv EPOC headset was used throughout all of the experiments. Despite the minor problems which were encountered as discussed in section 4.4, the headset is a remarkable device which provides numerous opportunities for the research and development of neural based human-robot interfaces. 53 54 Chapter 6. Conclusion 6.2 Future work Since this is only a pilot study for a subsequent PhD in the field of human-robot interaction there are many directions in which the presented work could be developed and improved. First of all, other types of classifiers should be investigated in order to improve both the sensitivity and the precision rate of classification. Suitable candidates are a support vector machine and a multilayer perceptron. The poor performance of the discrete state HMMs was surprising, and so a discriminative classifier using HMMs such as the IOHMM should be tested. Furthermore, only principal components analysis was used for dimensionality reduction, but maybe a method such as independent components analysis will be more suitable. Another idea is to attempt feature extraction and detection in the frequency domain instead of the time domain. Second, despite the fact that the importance of the neuroergonomic approach was highlighted, the conducted experiments were in artificially controlled environment. This was imposed mainly by the time constraints of the for the project. The next step would be to perform an experiment similar to the robotic arm task, with an actual robot in a more realistic environment. There are many issues which need to be resolved in order to that. The most difficult one is that the actual robot will not move instantaneously as the simulated arm and there will be no distinguishable event which to synchronise the EEG data to. Therefore, a method for asynchronous ERP detection will have to be developed. Last, but not least, other types of ERPs should be extracted from the EEG signals which would enable a more robust intention inference. Other sensing modalities could be used as well - for example eye-tracking, surface EMG, RGBD data from the scene. It is not realistic to expect perfect classification rate for either of those modalities, but fusing them together should significantly improve the robustness of the human-robot interface. Bibliography [1] C. C. Kemp, A. Edsinger, and E. Torres-Jara, “Challenges for Robot Manipulation in Human,” no. March, pp. 20–29, 2007. [2] E. Guizzo, “Rescue-robot show-down,” Spectrum, IEEE, vol. 51, pp. 52–55, January 2014. [3] C. Lenz, G. Panin, T. Roder, and M. Wojtczyk, “Hardware-assisted Multiple Object Tracking for Human-Robot-Interaction,” Human-Robot Interaction (HRI), 2009 4th ACM/IEEE International Conference on, pp. 283–284, 2009. [4] J. A. Rivera-Bautista, A. C. Ramirez-Hernandez, V. a. Garcia-Vega, and A. Marin-Hernandez, “Modular control for human motion analysis and classification in Human-Robot interaction,” 2010 5th ACM/IEEE International Conference on Human-Robot Interaction (HRI), pp. 169–170, Mar. 2010. [5] X. Perrin, R. Chavarriaga, F. Colas, R. Siegwart, and J. D. R. Millán, “Braincoupled interaction for semi-autonomous navigation of an assistive robot,” Robotics and Autonomous Systems, vol. 58, pp. 1246–1255, Dec. 2010. [6] F. Galán, M. Nuttin, E. Lew, P. W. Ferrez, G. Vanacker, J. Philips, and J. D. R. Millán, “A brain-actuated wheelchair: asynchronous and non-invasive Braincomputer interfaces for continuous control of robots.,” Clinical neurophysiology : official journal of the International Federation of Clinical Neurophysiology, vol. 119, pp. 2159–69, Sept. 2008. [7] K. LaFleur, K. Cassady, A. Doud, K. Shades, E. Rogin, and B. He, “Quadcopter control in three-dimensional space using a noninvasive motor imagerybased brain–computer interface,” Journal of neural engineering, vol. 10, no. 4, p. 046003, 2013. 55 56 Bibliography [8] S. Chiappa, Analysis and Classification of EEG Signals using Probabilistic Models for Brain Computer Interfaces. PhD thesis, Ecole Polytechnique Federale de Lausanne, 2006. [9] S. T. Grafton and C. M. Tipper, “Decoding intention: a neuroergonomic perspective.,” NeuroImage, vol. 59, pp. 14–24, Jan. 2012. [10] R. Parasuraman and M. Rizzo, Neuroergonomics: The brain at work. 2006. [11] R. K. Mehta and R. Parasuraman, “Neuroergonomics: a review of applications to physical and cognitive work.,” Frontiers in human neuroscience, vol. 7, p. 889, 2013. [12] M. Anderson, “Embodied cognition: A field guide,” Artificial Intelligence, vol. 149, pp. 91–130, 2003. [13] J. Touryan, G. Apker, B. J. Lance, S. E. Kerick, A. J. Ries, and K. McDowell, “Estimating endogenous changes in task performance from EEG,” Frontiers in Neuroscience, vol. 8, June 2014. [14] M. F. Bear, B. W. Connors, and M. A. Paradiso, Neuroscience: Exploring the brain (3rd ed.). 2007. [15] R. Croft and R. Barry, “Removal of ocular artifact from the EEG: a review,” Neurophysiologie Clinique/Clinical Neurophysiology, 2000. [16] A. Burgess and J. Gruzelier, “Short duration synchronization of human theta rhythm during recognition memory,” Neuroreport, 1997. [17] L. T. Trujillo, M. A. Peterson, A. W. Kaszniak, and J. J. Allen, “Eeg phase synchrony differences across visual perception conditions may depend on recording and analysis methods,” Clinical Neurophysiology, vol. 116, no. 1, pp. 172 – 189, 2005. [18] P. Brown, S. Salenius, J. C. Rothwell, and R. Hari, “Cortical correlate of the piper rhythm in humans,” Journal of Neurophysiology, vol. 80, no. 6, pp. 2911–2917, 1998. [19] M. Rubinov and O. Sporns, “Complex network measures of brain connectivity: Uses and interpretations,” NeuroImage, vol. 52, no. 3, pp. 1059–1069, 2010. Bibliography 57 [20] H. G. Vaughan, L. D. Costa, and W. Ritter, “Topography of the human motor potential.,” Electroencephalography and clinical neurophysiology, vol. 25, pp. 1– 10, 1968. [21] W. G. Walter, R. Cooper, V. J. Aldridge, W. C. Mccallum, and a. L. Winter, “Contingent Negative Variation: an Electric Sign of Sensorimotor Association and Expectancy in the Human Brain.,” Nature, vol. 203, pp. 380–384, 1964. [22] J. Polich and A. Kok, “Cognitive and biological determinants of P300: An integrative review,” 1995. [23] W. J. Gehring, B. Goss, and M. G. H. Coles, “A neural system for error detection and compensation,” 1993. [24] M. Falkenstein, J. Hoormann, S. Christ, and J. Hohnsbein, “Erp components on reaction errors and their functional significance: a tutorial,” Biological Psychology, vol. 51, no. 23, pp. 87 – 107, 2000. [25] S. Hoffmann and M. Falkenstein, “Predictive information processing in the brain: errors and response monitoring.,” International journal of psychophysiology : official journal of the International Organization of Psychophysiology, vol. 83, pp. 208–12, Feb. 2012. [26] S. Nieuwenhuis, C. B. Holroyd, N. Mol, and M. G. Coles, “Reinforcementrelated brain potentials from medial frontal cortex: origins and functional significance,” Neuroscience & Biobehavioral Reviews, vol. 28, no. 4, pp. 441 – 448, 2004. [27] H. T. van Schie, R. B. Mars, M. G. H. Coles, and H. Bekkering, “Modulation of activity in medial frontal and motor cortices during error observation.,” Nature neuroscience, vol. 7, pp. 549–54, May 2004. [28] P. Ferrez and J. d. R. Millan, “Error-related EEG potentials generated during simulated braincomputer interaction,” Biomedical Engineering, IEEE . . . , vol. 55, pp. 923–9, Mar. 2008. [29] L. F. Nicolas-Alonso and J. Gomez-Gil, “Brain Computer Interfaces, a Review,” Sensors, vol. 12, pp. 1211–1279, 2012. 58 Bibliography [30] S. Solhjoo, “Classification of chaotic signals using HMM classifiers: EEG-based mental task classification,” Proc. European Signal . . . , pp. 18–21, 2005. [31] V. Bostanov, “BCI competition 2003 - Data sets Ib and IIb: Feature extraction from event-related brain potentials with the continuous wavelet transform and the t-value scalogram,” IEEE Transactions on Biomedical Engineering, vol. 51, pp. 1057–1061, 2004. [32] I. Iturrate, L. Montesano, and J. Minguez, “Robot reinforcement learning using EEG-based reward signals,” in 2010 IEEE International Conference on Robotics and Automation, pp. 4822–4829, IEEE, May 2010. [33] P. Ferrez and J. Millán, “You are wrong!Automatic detection of interaction errors from brain waves,” 2005. [34] M. Krauledat, G. Dornhege, B. Blankertz, F. Losch, G. Curio, and K.-R. Müller, “Improving speed and accuracy of brain-computer interfaces using readiness potential features.,” Conference proceedings : ... Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Conference, vol. 6, pp. 4511–4515, 2004. [35] R. Chavarriaga and J. del Millán, “Learning from EEG error-related potentials in noninvasive brain-computer interfaces,” Neural Systems and Rehabilitation Engineering, vol. 18, no. 4, pp. 381–388, 2010. [36] C. Vi and S. Subramanian, “Detecting Error-Related Negativity for Interaction Design,” in Proceedings of the 2012 ACM annual conference on Human Factors in Computing Systems - CHI ’12, (New York, New York, USA), p. 493, ACM Press, May 2012. [37] M. Duvinage, T. Castermans, T. Dutoit, M. Petieau, T. Hoellinger, C. D. Saedeleer, K. Seetharaman, and G. Cheron, “A P300-based Quantitative Comparison between the Emotiv Epoc Headset and a Medical EEG Device,” Biomedical Engineering / 765: Telehealth / 766: Assistive Technologies, 2012. [38] P. Ferrez, Error-Related EEG Potentials in Brain-Computer Interfaces. PhD thesis, 2007. [39] K. Murphy, Machine learning: a probabilistic perspective. 2012. Bibliography 59 [40] H. L. H. Lee and S. C. S. Choi, “PCA+HMM+SVM for EEG pattern classification,” Seventh International Symposium on Signal Processing and Its Applications, 2003. Proceedings., vol. 1, no. 2, pp. 1–4, 2003. [41] M. E. Maier, G. Di Pellegrino, and M. Steinhauser, “Enhanced error-related negativity on flanker errors: Error expectancy or error significance?,” Psychophysiology, vol. 49, pp. 899–908, 2012.