Architetture di sistemi per la videosorveglianza
Transcription
Architetture di sistemi per la videosorveglianza
University of Genova Department of Biophysical and Electronic Engineering Multimedia Surveillance Systems Prof. C.Regazzoni Data Fusion and Bayesian Interaction Modeling for Cognitive Ambient Intelligence Multimedia Surveillance Systems (MSS) Multimedia Surveillance Systems (MSS) are designed for assisting the human operator(s) in the monitoring of various kinds of environments, e.g.: Transport environments (railway stations, level crossings, car parks, urban roads, highways, etc.); Commercial and financial environments (supermarkets, banks, jewel shops etc.); Tourist environments (naturalistic parks, tourist villages, historical town centres, etc.). vIdeo and SIgnal Processing for Telecommunications – ISIP40 Multimedia video-surveillance systems Characterization A Multimedia Surveillance System is characterized by: Processing of heterogeneous data from different types of sensors; Transmission over heterogeneous communication channels; Multiple signal representation and event detection abstraction levels for presentation to an improved human interface. A MSS is a system able to process, transmit and present in different ways heterogeneous data transmitted over various communication channels. vIdeo and SIgnal Processing for Telecommunications – ISIP40 Multimedia video-surveillance systems Examples of functionalities (1) Detection of thefts and vandalism; Detection of unauthorized accesses (both of people and vehicles) to restricted areas; Prevention of dangerous situation related to the presence of hidden bombs; Prevention of dangerous situations related to the presence of unauthorized persons in areas only allowed to vehicles (e.g. railway tracks, highway tollgates, airport strips etc.); vIdeo and SIgnal Processing for Telecommunications – ISIP40 Multimedia video-surveillance systems Examples of functionalities (2) People counting and crowding estimations; Detection of cars stopping within level crossing areas; Traffic monitoring and control. vIdeo and SIgnal Processing for Telecommunications – ISIP40 Overall video surveillance market Estimated marked size based on manufacturer total revenue (not end user) vIdeo and SIgnal Processing for Telecommunications – ISIP40 Overall surveillance market 2009 Global video surveillance market: $8.26 Billion Asia: $3.28 Billion Americas: $2.87 Billion EMEA: $2.05 Billion Projected 2014 market size and implicit Compound annual growth rate (CAGR): Americas: $4.48 Billion with CAGR of 8.9% Asia: $6.6 Billion with CAGR of 15.1% EMEA: $3.3 Billion with CAGR of 10% vIdeo and SIgnal Processing for Telecommunications – ISIP40 IP Video adoption by segments vIdeo and SIgnal Processing for Telecommunications – ISIP40 IP Video adoption by segments The graph shows the percentage adoption of network vs analog video by segment. Residential: ~15% Healthcare and Education: ~45% Retail and Banking: ~12% Transport and Government: ~35% High correlation between network video adoption and surveillance over larger physical area vIdeo and SIgnal Processing for Telecommunications – ISIP40 Grow projections by segment The table forecasts growth rates by segment vIdeo and SIgnal Processing for Telecommunications – ISIP40 Grow projections by segment Key claims to note: Transportation market size projected to more than double to $3.6 Billion (or 25% of the global market) Government market size projected to double to $2.3 Billion Banking/finance and retail projected for lowest growth rates vIdeo and SIgnal Processing for Telecommunications – ISIP40 IP Video Surveillance growth IP video surveillance product sales will increase by 200% total between 2010 and 2012 overtaking analog CCTV sales Mainstream 720p/1.3MP cameras cut into the cost advantage of analog cameras Mainstream hybrid DVRs remove the barriers of legacy deployments to migrate to IP Emerging managed/hosted video undermines analog's advantages in small deployments Emerging panoramic cameras further cut the cost advantage of analog cameras Maturing IP camera standards will cut costs and reduce complexity vIdeo and SIgnal Processing for Telecommunications – ISIP40 Video analytics Only "moderate growth" projected this year and next 30% annual growth rate by 2012-2013 2008 sales volume: 38,000 channels server based, 40,000 channels edge based (cameras, encoders) Government and Transportation about 50% of total video analytics market IMS states, "The market is still awaiting that “killer application,” and no one seems certain yet what that will be" vIdeo and SIgnal Processing for Telecommunications – ISIP40 Multimedia video-surveillance systems Scientific projects US Projects (Defense Advanced Research Projects Agency-DARPA) [Active] Mind’s Eye (ME - develop in machines a capability that exists only in animals: visual intelligence) [Active] Combat Zones That See (CTS - track everything that moves in a city by linking up a massive network of surveillance cameras to a centralized computer system) [Active] Heterogeneous Aerial Reconnaissance Team (HART – Aerial surveillance) [1997-2000] VSAM Video Surveillance and Monitoring European Projects [2010-2013] SUPPORT Project - Autonomous Monitoring for Securing European Ports [2010-2012] VANAHEIM Project - Autonomous Monitoring Of Underground Transportation Environment [2008-2011] COFRIEND Project - Cognitive Vision System able to adapt itself to unexpected situations [2005-2008] SERKET Project - Video Surveillance Architectures [2004-2006] AVITRACK Project - Airport Apron Monitoring [2002-2004] INMOVE Intelligent Mobile Video Environments [2002-2004] REOST Railway Electro Optical System for Safe Transportation [2000-2003] ADVISOR - Subway Monitoring [1994-1997] PASSWORDS, AVS-PV - Video Surveillance vIdeo and SIgnal Processing for Telecommunications – ISIP40 Multimedia video-surveillance systems Scientific projects Italian national interest projects [1999-2000] Sistemi intelligenti per l’elaborazione e la trasmissione di segnali multidimensionali per applicazioni di video-sorveglianza in tempo reale (Ministero dell’Università e della Ricerca Scientifica); [2002-2006] VICOM-FIRB Virtual Immersive Communications (Fondo Italiano per la Ricerca di Base - Ministero dell’Istruzione, dell’Università e della Ricerca); [2002-2004] Industrial grant Elsag PSA (sistema di videosorveglianza con l’uso di sensori video omnidirezionali); [2002-2005] Architetture distribuite ed eterogenee per sistemi di sorveglianza multi-sensoriali (Prot. N. 7280/297 MIUR); [2008-2010] SINTESIS (Integrated Security System with Distributed Intelligence) vIdeo and SIgnal Processing for Telecommunications – ISIP40 Multimedia video-surveillance systems Reference model for a generic MSS Network interface Peripheral modem intelligence Sensor layer Local processing layer Network access HUB Network layer PC cluster Operator layer vIdeo and SIgnal Processing for Telecommunications – ISIP40 Multimedia video-surveillance systems First generation (1960-1980) monitor cluster, control panel and analog transceiver Analog transceiver Sensor layer Switching devices Local processing layer Network layer (coax 75 ohm cable network) vIdeo and SIgnal Processing for Telecommunications – ISIP40 Storage and retrieval Multimedia video-surveillance systems Second generation (1980-2000) Network interface IEEE 802.11 Cable modem VDSL modem Sensor layer Peripheral intelligence: source coding Local processing layer (channel coding and TX) Network access HUB WAN and/or LAN Network layer vIdeo and SIgnal Processing for Telecommunications – ISIP40 PC cluster (attention driving) Operator layer Multimedia video-surveillance systems Third generation (2000-?) Network interface UMTS Cable modem ADSL ISDN Peripheral intelligence (embedded image processing and understanding) modem Local processing layer Network access HUB Decision planning Network layer Sensor layer vIdeo and SIgnal Processing for Telecommunications – ISIP40 Operator layer Historical and technological evolution of video-surveillance systems Sensor Information type processing Analog None First Generation (1960-1980) Second Analog/ Generation Digital (1980-2000) Third Digital Generation (2000 - ?) Centralised at operator level Centralised/ distributed (configurable) Information transmission Analog/Digital CCTV Information presentation Cyclic multiplexing of analog/digital video signals Automatic signalling of “interesting situations” (e.g. alert signalling) Relevant processing tools Analog TV cameras, box for digital video encoding Analog CCTV, Analog/digital Analog over coax cameras, frame networks, wireless grabbers, PCs, digital, wired digital digital wireless standards (e.g, 802.11), digital wired standards (cable modems, xDSL, etc.) Wireless digital, Configurable signalling Digital intelligent wired digital at semantic level sensors, (configurable) communtication interface language (e.g. JAVA), software radio technologies for radio link reconfigurability vIdeo and SIgnal Processing for Telecommunications – ISIP40 Beyond 3rd Generation: cognitive surveillance Improved perception and reasoning over monitored events will allow efficient dynamic context representation within complex environments Extended communications new research trends go in the direction of extended interaction capabilities: User will be able to interact with occurring events • Directly • Through virtual “agents” Towards a new paradigm…. Cognitive Surveillance vIdeo and SIgnal Processing for Telecommunications – ISIP40 Cognitive surveillance Analysis & Representation Sensing AVS systems Data Fusion -------Learning Physical/Virtual World Beyond 3rd generation Action & Comms Decision Extended comms vIdeo and SIgnal Processing for Telecommunications – ISIP40 Beyond 3rd Generation Research lines: Extended Perception/reasoning Fusion coordination problems (physical synchronization and action timing) Learning stage needed (increasing difficulty with increasing complex interactions) Extended communications management vIdeo and SIgnal Processing for Telecommunications – ISIP40 Extended perception & reasoning The described scene understanding basic methods cooperate to achieve extended perception Mainly based on low level scene understanding techniques extended reasoning Increasing abstraction level information processing vIdeo and SIgnal Processing for Telecommunications – ISIP40 What can we do now Intelligent sensors Video Analytics • Limited set of robust 24/7/12 functionalities – E.g. Intrusion detection, People counting, Traffic analysis Networking Architecture • Thousands of connected heterogeneous (e.g. analog, IP, etc.) cameras • High speed wireless connections Coverage type • On line metadata and video streaming • First examples of networked embedded video analytics vIdeo and SIgnal Processing for Telecommunications – ISIP40 What can we do now Control Center Architecture • Coexistence of multiple vendors Video Management Systems • Audio-Video centered multimedia event presentation Functionalities • A posteriori Event/Situation interactive retrieval • Off-line Data fusion and Decision support vIdeo and SIgnal Processing for Telecommunications – ISIP40 What would/could we have now/tomorrow Cognitive sensors Video analytics • 24/7 extended set of robust functionalities (e.g. crowd analysis, action recognition, etc.) Cognitive surveillance/Operator in the loop • Integrated on line anomaly detection and decision support Networking Architecture improvements • Heterogeneous networks interoperability - Ad hoc fault resistent networks - Scalable multilevel data transmission Extended coverage with software/cognitive nodes networks • Mature networked embedded nodes - On line understanding and prediction of complex multicamera events vIdeo and SIgnal Processing for Telecommunications – ISIP40 What would/could we have now/tomorrow Control center Architecture • Interoperable standards for VMS • Heterogeneous data source fusion and presentation • Tools for operator reactions monitoring Functionalities • Improved performance retrieval techniques (time and space) • Cooperative distributed on line analysis and prediction for safety mainteinance • Cooperative and distributed automated learning from experience vIdeo and SIgnal Processing for Telecommunications – ISIP40 Information per time and space unit Technological trends in Cognitive sensors DECISION PREDICTIONS SITUATIONS ? OBJECTS ? ? CHASING FEATURES SIGNALS 1995 2000 2005 2010 2015 2020 vIdeo and SIgnal Processing for Telecommunications – ISIP40 ? Cognitive surveillance system architecture Control Centre Mobile Terminal Network Cognitive Node Intelligent Sensing Node(s) vIdeo and SIgnal Processing for Telecommunications – ISIP40 Networking: architecture and extended coverage vIdeo and SIgnal Processing for Telecommunications – ISIP40 Technological trends in Control centers ? ? Information per time and space unit ? ? DECISION SUPPORT PREDICTION RETRIEVAL SINGLE SENSOR 2000 2005 2010 2015 2020 vIdeo and SIgnal Processing for Telecommunications – ISIP40 Examples of existing video-surveillance systems (Communications) Wireless networks: CCTV(R) products from Wavelength (UK) (first generation commercial system which transmits JPEG compress images with IEEE 802.11 standard). About 11.000 USD; Wireless networks: system for recognition of stolen car number plate with intelligent sensors processing in-situ the information acquired among highways traffic (prototypic system operating in USA) (S. S. Shyne, “Distributed surveillance network utilises neural networks for stolen vehicle detection”, Proceedings of the SPIE, Vol. 2938, 1997, pp. 186-190). Wired Network: System for video surveillance over Closed Circuit Twisted Pair (CCTP) for the digital age; a security system for airports, offices, schools, healthcare facilities, and government & military. Wired network: system for turistic villages monitoring, developed in the CEE project AVS-RIO, with information transmission over a coaxial cable CATV network (second generation prototypic system operating in Riomaggiore municipality). vIdeo and SIgnal Processing for Telecommunications – ISIP40 Examples of existing video-surveillance systems (Video management systems) Genetec OnSSI Milestone Avigilon Mirasys March Networks Axis vIdeo and SIgnal Processing for Telecommunications – ISIP40 Examples of existing video-surveillance systems (Computer Vision) Scene understanding (Multiprocessor and Multisensor): AxonX (http://www.axonx.com/): SigniFire (fire and smoke detection) Acuity (http://acuitysystem.com/): AcuVI (waterside protection) AgentVI (http://agentvi.com/): VI-System (Real-Time Detection & Alert Software) VideoIQ (http://www.videoiq.com/): iCVR object tracking embedded/server based VCATechnology (http://www.vcatechnology.com/): embedded and pc based traffic analysis Bosch (http://www.boschsecurity.it): Intelligent Video Analysis (IVA) IoImage (http://www.ioimage.com): embedded and server based object tracking ObjectVideo (http://www.objectvideo.com/): embedded and server based intrusion detection, traffic analysis TechnoAware (http://www.technoaware.com): server based or embedded object tracking, traffic management, people counting, intrusion detection vIdeo and SIgnal Processing for Telecommunications – ISIP40 MSS main physical features System architecture; Environmental coverage: # and type of sensors; System complexity: # and processing power of processors; Bandwidth and communication channels features. Network characteristics: open or dedicated network Processing capabilities: # frames/sec; vIdeo and SIgnal Processing for Telecommunications – ISIP40 MSS main logical features Multiple access characteristics: #of end users simultaneously supported System functionalities # and type of supported functionalities; System performances: misdetection and false alarms probabilities; vIdeo and SIgnal Processing for Telecommunications – ISIP40 MSS Methodologies Research field Purpose Examples of methodologies Artificial intelligence -Understanding -Intelligence distribution -Agent based systems -Cooperative systems Data fusion Multi-sensor integration Computer Vision –Object and behavior characterization –Event classification - Data alignment - Attribute estimation - Object identity estimation - Shape, motion or texture analysis - Tracking -Sensor calibration Pattern recognition -Model learning -Feature extraction and selection for event classification - Neural networks -PCA Analysis Signal processing - Source Coding and transmission - Object recognition - Compression algorithms - Change detection -Information detection and hiding Communications Signal transmission -Channel coding -Multiple access techniques - Reconfigurable devices and networks vIdeo and SIgnal Processing for Telecommunications – ISIP40 Different methodologies in MSS Computer Vision Tracking example 1st detected blob 2nd detected blob vIdeo and SIgnal Processing for Telecommunications – ISIP40 Different methodologies in MSS Data Fusion Shape extraction without using multisensor information Shape extraction using multisensorial information vIdeo and SIgnal Processing for Telecommunications – ISIP40 Different methodologies in MSS Pattern Recognition Retrieval and classification with SHOSLIF trees Probed samples Voronoi space tessellation Quasi-Voronoi space tessellation vIdeo and SIgnal Processing for Telecommunications – ISIP40 Retrieved samples Different methodologies in MSS Artificial Intelligence An agent framework No matching Matching vIdeo and SIgnal Processing for Telecommunications – ISIP40 Different methodologies in MSS Communication ADSL phone line HFC net Wireless network vIdeo and SIgnal Processing for Telecommunications – ISIP40 Signal and Video Processing for MSS Signal Processing for MSS must focus the system attention on the most important information (depending on the functionality) for: Recognition Communication There are several constraints regarding: Processing time / real-time requirements (for recognition); Transmission time / available transmission band (for communication). vIdeo and SIgnal Processing for Telecommunications – ISIP40 Signal Processing for MSS Signal Processing techniques vary on the basis of the processed data nature (audio, video, tactile, etc.) Video sensors will be considered (passive sensors) Sensors layer Data processing for alarm generation vIdeo and SIgnal Processing for Telecommunications – ISIP40 Signal Processing for MSS Steps that follows Signal Processing techniques are mainly: Transmission of data over heterogeneous channels to a remote site. Data can be transmitted over open networks (such as Internet) or dedicated channels. Main problems in these cases are related with protection of information against noise and unauthorized access and data integrity; System intelligence: signal processing methods for MSS are able to automatically extract an high level symbolic representation of the processed data suitable for pattern recognition and computer vision steps in order to wake up the attention of the human operator. vIdeo and SIgnal Processing for Telecommunications – ISIP40 Intrusion detection vIdeo and SIgnal Processing for Telecommunications – ISIP40 People counter vIdeo and SIgnal Processing for Telecommunications – ISIP40 Behavior analysis vIdeo and SIgnal Processing for Telecommunications – ISIP40 Abandoned objects vIdeo and SIgnal Processing for Telecommunications – ISIP40 Traffic management vIdeo and SIgnal Processing for Telecommunications – ISIP40 PTZ Tracking vIdeo and SIgnal Processing for Telecommunications – ISIP40 PTZ tracking vIdeo and SIgnal Processing for Telecommunications – ISIP40 Other functions vIdeo and SIgnal Processing for Telecommunications – ISIP40