libatk 1.0 0.dll

Transcription

libatk 1.0 0.dll
Multi-View Annotation Tool
(Manual)
Ákos Utasi and Csaba Benedek
April 19, 2012
1
Compiling the sources
Linux
Uncompress the mvatool.tar.gz archive to get the sources (e.g. tar -zxvf
mvatool.tar.gz), and simply build the tool by make. Note that GTK+ 2.0,
OpenCV, and libxml2 libraries are required to build the application. It has
been tested on Ubuntu Linux 11.10 using OpenCV 2.1 libraries.
Windows
The mvatool vs10 win32.zip archive contains the Visual Studio 2010 win32
project of the software. Unzip the archive and simply double click the .sln
to open the project. You will need the GTK+ 2.0, OpenCV, libiconv, and
libxml2 libraries to build the application. You either build them yourself
or download from our website http://web.eee.sztaki.hu/~ucu/mvatool
the pre-built GTK+ (gtk+.zip), libiconv (libiconv-1.9.2-1.zip), and
libxml2 (libxml2-2.7.6.win32.zip) win32 libraries we tested the application with. Simply unpack the three zip archives on drive C:\. The application has been tested on Windows 7 using OpenCV 2.3.0. Note that the
following changes to the project might be necessary to build the application:
• Additional Include Directories: change the paths if you are using
different library versions, or different folders to store your libraries;
• Additional Library Directories: change the paths if you are using
different library versions, or different folders to store your libraries;
1
• Additional Dependencies: change the library files if you are using
different versions.
You will also need the following dlls to be accessible from the PATH (assuming Release configuration):
freetype6.dll
iconv.dll
intl.dll
libatk-1.0-0.dll
libcairo-2.dll
libexpat-1.dll
libfontconfig-1.dll
libgdk-win32-2.0-0.dll
libgdk pixbuf-2.0-0.dll
libgio-2.0-0.dll
libglib-2.0-0.dll
libgmodule-2.0-0.dll
libgobject-2.0-0.dll
libgthread-2.0-0.dll
libgtk-win32-2.0-0.dll
libpango-1.0-0.dll
libpangocairo-1.0-0.dll
libpangoft2-1.0-0.dll
libpangowin32-1.0-0.dll
libpng14-14.dll
libxml2.dll
opencv core230.dll
opencv highgui230.dll
opencv imgproc230.dll
zlib1.dll
We collected these dlls (excluding the OpenCV libs) into a single zip
archive, which can be downloaded from our website (dlls win32.zip). Simply copy the dlls from the archive into the folder of the MVATool.exe executable. Optionally you will also need the dlls of TBB in your PATH, if your
OpenCV is built with TBB support.
If you still have problems compiling or using the tool, please do not hesitate to contact us at [email protected] (Linux related questions) or at
[email protected] (Windows/VS2010 related questions).
2
Figure 1: Graphical user interface of the application.
2
Using the tool
The application should be executed from the command line, usage:
./MVATool database
where database is the absolute path of the file containing all the information
necessary for annotation, and is described in Sec. 3. Once the tool successfully
loaded the database you will receive a similar message in the standard output:
Using database: /home/utasi/test/db.txt
Number of camera views: 3
Ground file loaded: -13962.00,-13335.00;-1774.00,1578.00
Number of images: 3
Camera models loaded.
Finally the GUI will appear on your screen, see Fig. 1. In the GUI the upper
buttons control the actual position in the sequences. The images in the next
row display an overview of the whole scene. You can add a new ground truth
3
annotation by simply clicking on one of these camera images. In this case
the attributes of the annotation are initialized by default values, and the
annotation is projected to the other views. Fine tuning of the parameters
can be done by the spin buttons:
• Person properties: X, Y are the positions, and Z is the height of the
center line;
• Person ground occupancy: W, H are the sizes of the ground rectangle,
R is the rotation, and X, Y are the positions.
Use the buttons of the Person navigation to navigate between the ground
truth annotations, or to Delete an annotation. The actual annotation is also
display in the zoomed images to help the fine tuning.
3
Database format
The database file required by the tool is a simple text file and contains the following lines. The 1st line denotes the number of camera views k. The absolute
path of the file containing the real world area of interest (AOI) coordinates
in the 2nd line, see Sec. 3.1 for details. The next k lines contain the absolute
paths of the list files of camera images, see Sec. 3.2. Finally, the last k lines
contain the absolute paths of the camera calibration XML documents (further information can be found at http://www.cvg.rdg.ac.uk/PETS2009/).
Example, k = 3:
3
/home/utasi/test/aoi.txt
/home/utasi/test/files 001.txt
/home/utasi/test/files 002.txt
/home/utasi/test/files 003.txt
/home/utasi/test/View 001.xml
/home/utasi/test/View 002.xml
/home/utasi/test/View 003.xml
Sample databases for Linux (mva test.tar.gz) and Windows (mva test.zip)
can be downloaded from our website. Both archives contain calibration data
and images from the PETS City Center sequence.
4
3.1
Area of interest
The AOI rectangle is defined in a simple text file, by its real world coordinates
(x1 ; y1 ; x2 ; y2 ). Example:
-13962.0;-13335.0;-1774.0;1578.0
3.2
Camera image lists
The list file is a simple text document, and each line contains the absolute
path of a camera image. Example:
/home/utasi/test/View 001/frame 0792.jpg
/home/utasi/test/View 001/frame 0793.jpg
/home/utasi/test/View 001/frame 0794.jpg
4
Helper scripts for evaluation
We provide the trim.sh and merge.sh scripts to create a compatible data
format for our multi-view evaluation tool http://web.eee.sztaki.hu/~ucu/
3dmpp/. The real world annotations created by MVATool are stored in the
base directory of the database. Here you should create a list.txt list file
containing all the annotation file names in separate lines. Example:
frame 0792.txt
frame 0793.txt
frame 0794.txt
Copy the trim.sh shell script into this folder and execute it. The resulting
files are compatible with the evaluation tool using the GPE metric. Use the
merge.sh script to create data for the PPE metric (assuming 3 cameras).
Usage:
merge.sh out view1 view2 view3
where
• out: output folder;
• view1, view2, view3: folders of camera images, where the annotation
tool stores the projected annotations.
Example:
merge.sh gt annotation/ View 001/ View 002/ View 003/
5
5
Citations
Please reference our papers when using the annotation software or our annotations for evaluation.
@article{utasiVIGTA12,
author = {Utasi, {\’A}. and Benedek, {Cs}.},
title = {A {3-D} Marked Point Process Model for Multi-View People
Detection},
booktitle = {Proceedings of the First International Workshop on Visual
Interfaces for Ground Truth Collection in Computer Vision Applications},
month = {May},
year = {2012},
address = {Capri, Italy}
}
@article{utasiTCSVT12,
author = {Utasi, {\’A}. and Benedek, {Cs}.},
title = {A Bayesian Approach on People Localization in Multi-Camera
Systems},
journal = {IEEE Transactions on Circuits and Systems for Video Technology},
year = {2012}
}
_________________________________________
/ Louis, I think this is the beginning
\
\
of a beautiful FRIDAY!
/
----------------------------------------\
,__,
\ (oo)____
(__)
)\
||--|| *
6