Javed Aslam Professor and Associate Dean of Faculty College of
Transcription
Javed Aslam Professor and Associate Dean of Faculty College of
Javed Aslam Professor and Associate Dean of Faculty College of Computer and Information Science Northeastern University Awards and honors and year received (list--no more than *five* items): • ASONAM Best Student Paper Award, 2014 • ECIR Best Poster Paper Award, 2013 • Northeastern University Excellence in Teaching Award, Finalist, 2013 • National Science Foundation Career Award, 2001 • Dartmouth College Class of 1962 Faculty Fellow (university-wide teaching award), 2001 Have you previously been involved in any CRA activities? If so, describe. I have not previously been involved with the CRA. List any other relevant experience and year(s) it occurred (list--no more than *five* items). • Northeastern University, College of Computer and Information Science, Associate Dean of Faculty, Fall 2013–present • Northeastern University, Academic Plan Steering Committee, Faculty of the Future, Chair 2015-16 • SIGIR 2016, Program co-Chair • Northeastern University, College of Computer and Information Science, Associate Dean and Director of the Graduate School, 2013 • SIGIR 2009, General co-Chair. Research interests: (list only) • Information retrieval • Machine learning • Applications of information theory • Design and analysis of algorithms Personal Statement Within my home discipline of Information Retrieval, I have served as both the Program Chair and General Chair for the main international conference in the area, SIGIR. Within my university, I currently serve as the Associate Dean of Faculty within my college, and I serve on the universitywide Academic Plan Steering Committee, chairing the working group on the Faculty of the Future. I am particularly interested in the issue of interdisciplinarity, both in research and in faculty development. Fully 1/3 of the faculty in my College have interdisciplinary appointments, including many that I recruited as Hiring Chair and Associate Dean. Brief Biography or CV (Attached) Javed Alexander Aslam Office Address College of Computer and Information Science Northeastern University 360 Huntington Ave, #202WVH Boston, MA 02115 (617) 373-8169 [email protected] Home Address 15 Columbine Rd Weston, MA 02493 (781) 431-4529 Current Position Professor and Associate Dean of Faculty, College of Computer and Information Science, Northeastern University. Research Interests Information retrieval, machine learning, applications of information theory, and the design and analysis of algorithms. Education Massachusetts Institute of Technology Ph.D. in Electrical Engineering and Computer Science, February 1995. Thesis: “Noise Tolerant Algorithms for Learning and Searching.” Advisor: Prof. Ronald L. Rivest. Minor: Queuing Theory. Massachusetts Institute of Technology M.S. in Electrical Engineering and Computer Science, January 1992. Thesis: “Inferring Graphs from Walks.” Advisor: Prof. Ronald L. Rivest. University of Notre Dame B.S. in Electrical Engineering, summa cum laude, June 1987. Majors in Mathematics and Electrical Engineering. Teaching Experience Northeastern University Boston, MA Average overall student evaluation of teaching e↵ectiveness (28 courses, 545 responses): 4.6/5.0. Discrete Structures (Fall 2009, Fall 2007, Fall 2006, Fall 2005). An undergraduate course on discrete mathematics. Information Retrieval (Spring 2012, Fall 2006). A graduate course on the design and evaluation of information retrieval systems. Information Retrieval (Summer 2010, Summer 2009, Summer 2008, Summer 2006, Summer 2005). An undergraduate course on the design and evaluation of information retrieval systems. Advanced Algorithms (Fall 2012, Fall 2007, Fall 2006, Fall 2004). A graduate course on the design and analysis of algorithms. Applications of Information Theory to Computer Science (Fall 2015, Fall 2011, Fall 2008, Fall 2005). A graduate course on information theory and its applications. Machine Learning (Fall 2014, Fall 2013, Fall 2012, Fall 2011). A graduate course on machine learning theory, implementation, and applications. 1 Theory of Computation (Spring 2004). A graduate course on formal language theory and formal models of computation. Theory of Computation (Fall 2009, Fall 2008, Fall 2004, Fall 2003). An undergraduate course on formal language theory and formal models of computation. Dartmouth College Hanover, NH Average overall student teaching evaluation during tenure-track position (12 courses, 397 responses): 4.86/5.0. Concepts in Computing (Winter 2000, Summer 1999, Summer 1996). An undergraduate course o↵ering an introduction to computer science for non-majors. Algorithms (Fall 2002, Fall 2001, Summer 1999, Fall 1998, Fall 1997, Fall 1996, Fall 1995). An undergraduate course on the design and analysis of algorithms. Implementation of Programming Languages (Spring 1997). An undergraduate course on the principles and techniques of compiler construction. Theory of Computation (Winter 2001, Winter 1997). An undergraduate course on formal language theory and formal models of computation. Principles of Programming Languages (Winter 1999, Winter 1998, Winter 1996). An undergraduate course comparing and contrasting imperative, functional, logical and object-oriented programming languages. Programming Languages (Spring 1999, Spring 1998, Spring 1997). A graduate course covering the design and implementation of imperative, functional, logical and objectoriented programming languages. Machine Learning and Information Retrieval (Winter 2001). A graduate course covering computational learning theory and its applications to information retrieval. Information Theory and Its Applications (Spring 2003). A graduate course covering information theory and its applications to various other disciplines. Research Experience Northeastern University Boston, MA Professor and Associate Professor, College of Computer and Information Science (Fall 2003 – Present). Continuing research in information retrieval, machine learning, applications of information theory, and the design and analysis of algorithms. Dartmouth College Hanover, NH Assistant Professor (Summer 1998 – Summer 2003) and Visiting Assistant Professor (Fall 1995 – Spring 1998) of Computer Science. Conducted research in computational learning theory (specifically learning in the presence of noise and the connections between learning theory, information theory and complexity theory) and information retrieval (specifically metasearch and data fusion, o↵-line organization of static data, on-line organization of dynamic data, and constructing a digital library for functional magnetic resonance imaging studies). Harvard University Cambridge, MA Post-doctoral position in the Division of Engineering and Applied Sciences (Summer 1994 – Summer 1995). Conducted research on learning in the presence of noise and other aspects of computational learning theory. Advisor: Leslie Valiant. 2 Massachusetts Institute of Technology Cambridge, MA Research Assistant (Fall 1987 – Spring 1994). Conducted research in computational learning theory (learning in the presence of noise), graph algorithms (graph inference and graph colorability), and fault-tolerant computing (searching in the presence of errors). Advisor: Ronald Rivest. Professional Activities Conference Organization1 SIGIR 2016, Program co-Chair. SIGIR 2009, General co-Chair. Conference Track Organization2 TREC 2015 Temporal Summarization Track, co-organizer. TREC 2014 Temporal Summarization Track, co-organizer. TREC 2013 Temporal Summarization Track, co-organizer. TREC 2008 Million Query Track, co-organizer. Program Committees WSDM 2016, SIGIR 2015, WSDM 2015, ECIR 2015, ASONAM 2015, NAACLHLT 2015, SIGIR 2014, ECIR 2014, CIKM 2014, BigData 2014, SIGIR 2013, OAIR 2013, ICTIR 2013, AIRS 2013, MUBE 2013, SIGIR 2012, ECIR 2012, CIKM 2012, NAACL-HLT 2012, SIGIR 2011, WWW 2011, CIKM 2011, SIGIR 2010, WWW 2010, EVIA 2010, SIGIR 2008, ECIR 2008, AIRS 2008, CIKM 2008, EVIA 2008, SIGIR 2007, CIKM 2007, EVIA 2007, SIGIR 2006, CIKM 2006, IISWC 2006, SIGIR 2005, SIGIR 2004, SIGIR 2003, SIGIR 2002, COLT 2002, ALT 2001, SIGIR 2001. Funding Panels National Science Foundation, Intelligent Information Systems, 2014. National Science Foundation, Information Technology Research, 2002. Medical Research Council of the United Kingdom, E-Science, 2002. Other Committees and Panels Strategic Workshop on Information Retrieval in Lorne, SWIRL 2012. Olin College Curriculum Assessment Panel, 2003. IR/LM Information Retrieval and Language Modeling committee, 2002. Neuroinformatics Task Force of the Organization for Human Brain Mapping, 2001. Referee/Reviewer AAAI National Conference on Artificial Intelligence, ACM Symposium on Theory of Computing, ACM Transactions on Information Systems, Foundations and Trends in Information Retrieval, IEEE Computing in Science & Engineering. IEEE Symposium on Foundations of Computer Science, IEEE Transactions on Computers, IEEE Transactions on Mobile Computing, Information and Computation, Information Processing Letters, Information Retrieval, International Conference on Computing and Information, International Joint Conference on Artificial Intelligence, Journal of the Association for Computing Machinery, Journal of Computer and System Sciences, Journal of Machine Learning Research, Machine Learning, SIAM Symposium on Discrete Algorithms, Theoretical Computer Science, Transactions on Information Systems, Transactions on Knowledge and Data Engineering, and World Wide Web Journal 1 2 SIGIR is the annual ACM Conference on Research and Development in Information Retrieval. TREC is the annual Text REtrieval Conference sponsored by the National Institute of Standards and Technology. 3 University Activities College of Computer and Information Science, Northeastern University Associate Dean of Faculty, Fall 2013–present University Academic Plan Steering Committee, Faculty of the Future, Chair 2015–16 Faculty Hiring Committee, Chair 2015–16 Provost Search Committee, Fall 2014–Spring 2015 Cluster Hire Search Committee (Data Science), Chair 2013–14 Dean Search Committee, Chair 2012–14 Associate Dean and Director, Graduate School, Spring–Summer 2013 Faculty Senate Agenda Committee, 2013 Faculty Senate, 2011–13 Graduate Committee, Member 2012–13, 2004–2005 Faculty Senate Ad hoc Committee to Assess the Role of Full-time, Non-tenure-track Faculty, Member 2012–13 Sabbatical Committee, Chair 2012–13, Member 2011–12 Undergraduate Committee, Member 2011–12, 2003–2004 Committee on “The Future of CS 3500”, Chair 2012, 2010 Ad hoc Undergraduate Curriculum Review Committee, Chair 2011 Faculty Hiring Committee, Chair 2005–2010 Colloquium Chair, 2009–2010, 2003–2004 Tenure Committee, Member 2006–present Committee to Evaluate the Dean of the College of Health Science, Member 2008 Institute for Information Assurance, Acting Co-director 2006 Committee to Evaluate the Chair of the Mathematics Department, Member 2006 Natural Computing Group, Faculty Advisor 2003–2005 Ad hoc Committee on Workload, Chair 2004 Department of Computer Science, Dartmouth College Equipment Committee, Chair 2002–2003 Kemeny Prize Committee, Chair and/or Member 1996–2003 Dartmouth Undergraduate Journal of Science, Faculty Advisor 2000–2003 The Basement Project, Faculty Advisor 1999–2003 Swing Kids, Faculty Advisor 1998–2003 Colloquium Chair, 1999–2001 Faculty Recruiting Committee, Member 2000-2001 Freshman Advisor, 2000–2001 Ad hoc Committee on Distance Learning, Member 2001 Ph.D. Admissions Committee, Member 1998–2000 Curriculum Committee, Member 1998–1999 Sta↵ Recruiting Committee, Member 1998–1999 Journal Articles On the Network You Keep: Analyzing Persons of Interest Using Cliqster Saber Shokat Fadaee, Mehrdad Farajtabar, Ravi Sundaram, Javed A. Aslam, and Nikos I. Passas. In Social Network Analysis and Mining, 5(63):1–14, December 2015. Harnessing the Power of GPUs to Speed Up Feature Selection for Outlier Detection Fatemeh Azmandian, Ayse Yilmazer, Jennifer Dy, Javed A. Aslam, and David R. Kaeli. In Journal of Computer Science and Technology, 29(3):408–422, May 2014. 4 Increasing Evaluation Sensitivity to Diversity Peter Golbus, Javed A. Aslam, and Charles L. A. Clarke. Invited to appear in Information Retrieval, 16(4):530–555, August 2013. Virtual Machine Monitor-based Lightweight Intrusion Detection Fatemeh Azmandian, Micha Moffie, Malak Alshawabkeh, Jennifer G. Dy, Javed A. Aslam, and David R. Kaeli. In Operating Systems Review, 45(2):38–53, 2011. Variational Bayes for Modeling Score Distributions With Keshi Dai, Evangelos Kanoulas, and Virgil Pavlu. Invited to appear in Information Retrieval, 14(1):47–67, 2011. Implementing and Evaluating Phrasal Query Suggestions for Proximity Search With Alan Feuer and Stefan Savev. Invited to appear in Information Systems, 34(8):711–723, 2009. Estimating Average Precision When Judgments are Incomplete With Emine Yilmaz. Invited to appear in Knowledge and Information Systems, 16(2):173-211, 2008. Derivation of the Tumor Position from External Respiratory Surrogates with Periodical Updating of the Internal/External Correlation With E. Kanoulas, G. C. Sharp, R. I. Berbeco, S. Nishioka, H. Shirato, and S. B. Jiang. In Physics in Medicine and Biology, 52:5443–5456, 2007. Persistent Queries over Dynamic Text Streams With Katya Pelekhov and Daniela Rus. In International Journal of Electronic Business, 3(3/4):288–299, 2005. The Kerf Toolkit for Intrusion Analysis With Sergey Bratus, David Kotz, Ron Peterson, Daniela Rus, and Brett Tofel. In IEEE Security & Privacy, 2(6):42–52, 2004. The Star Clustering Algorithm for Static and Dynamic Information Organization With Katya Pelekhov and Daniela Rus. In Journal of Graph Algorithms and Applications, 8(1):95–129, 2004. Three Power-aware Routing Algorithms for Sensor Networks With Qun Li and Daniela Rus. In Wireless Communications and Mobile Computing, 3(2):187–208, 2003. The Functional Magnetic Resonance Imaging Data Center (fMRIDC): The Challenges and Rewards of Large-scale Databasing of Neuroimaging Studies With John D. Van Horn, Je↵rey S. Grethe, Peter Kostelec, Je↵rey B. Woodward, Daniela Rus, Daniel Rockmore, and Michael S. Gazzaniga. In Philosophical Transactions of the Royal Society of London B, 356:1323–1339, 2001. Specification and Simulation of Statistical Query Algorithms for Efficiency and Noise Tolerance With Scott E. Decatur. Invited to appear in Journal of Computer and System Sciences, 56(2):191–208, 1998. 5 General Bounds on Statistical Query Learning and PAC Learning with Noise via Hypothesis Boosting With Scott E. Decatur. In Information and Computation, 141(2):85–118, 1998. On the Sample Complexity of Noise-Tolerant Learning With Scott E. Decatur. In Information Processing Letters, 57:189–195, 1996. On-line Algorithms for 2-Coloring Hypergraphs via Chip Games With Aditi Dhagat. In Theoretical Computer Science, 112(2):355–369, 1993. Journal Abstracts Derivation of the Tumor Position From External Respiratory Surrogates with Periodical Updating of External/Internal Correlation With E. Kanoulas, B. Sharp, R. Berbeco, S. Nishioka, H. Shirato, and S. Jiang. In Medical Physics, 33(6):2232–2233, June 2006. Presented at the 48th Annual Meeting of the American Association of Physicists in Medicine (AAPM 2006). The fMRI Data Center: A Progress Report With John Van Horn, Je↵rey Woodward, Daniel Rockmore, Joseph Edelman, Bennet Vance, Sarene Schumacher, and Michael Gazzaniga. In NeuroImage, 19(2), June 2003. Presented at the 9th International Conference on Functional Mapping of the Human Brain, June 19–22, 2003, New York, NY. Statistical Time Course Feature Vectors for Use in Rapid Assessment and Clustering With John Darrell Van Horn, Je↵rey Woodward, Je↵rey Grethe, and Michael Gazzaniga. In NeuroImage, 16(2), June 2002. Presented at the 8th International Conference on Functional Mapping of the Human Brain, June 2–6, 2002, Sendai, Japan. The fMRI Data Center: An Introduction With Je↵rey S. Grethe, John D. Van Horn, Je↵rey B. Woodward, Souheil Inati, Peter J. Kostelec, Daniel Rockmore, Daniela Rus, and Michael S. Gazzaniga. In NeuroImage, 13(6):S135, June 2001. Presented at the 7th International Conference on Functional Mapping of the Human Brain, June 10–14, 2001, Brighton, UK. A National Data Center for the Storage and Retrieval of Neuroimaging Data With Peter Kostelec, Je↵rey Grethe, Daniel Rockmore, Robert Fendrich, Scott Grafton, and Michael Gazzaniga. In Society for Neuroscience Abstracts, 26(2):2235, 2000. Book Chapters The Star Clustering Algorithm for Information Organization With Ekaterina Pelekhov and Daniela Rus. In Grouping Multidimensional Data: Recent Advances in Clustering, pages 1–23. Springer, January 2006. A Lifetime-Optimizing Approach to Routing Messages in Ad-hoc Networks With Qun Li and Daniela Rus. In Ad Hoc Wireless Networking, pages 1–43. Networking Theory and Applications, Vol. 14. Kluwer Academic Publishers, December 2003. The fMRI Data Center: Software Tools for Neuroimaging Data Management, Inspection, and Sharing With John Darrell Van Horn, Je↵rey B. Woodward, Geo↵rey Simonds, Bennet Vance, Je↵rey S. Grethe, Mark Montague, Daniela Rus, Daniel Rockmore, and Michael S. Gazzaniga. In Neuroscience Databases: A Practical Guide, pages 221–236. Kluwer Academic Publishers, September 2002. 6 Proceedings Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval Edited with James Allan, Mark Sanderson, ChengXiang Zhai, and Justin Zobel. ACM Press, July 2009. Refereed Conference Papers Aggregation of Crowdsourced Ordinal Assessments and Integration with Learning to Rank: A Latent Trait Model Pavel Metrikov, Virgil Pavlu, and Javed A. Aslam. In Proceedings of the 24th ACM International Conference on Information and Knowledge Management (CIKM), pages 1391–1400. ACM Press, October 2015. Anytime Planning of Optimal Schedules for a Mobile Sensing Robot Jingjin Yu, Javed A. Aslam, Sertac Karaman, and Daniela Rus. In Proceedings of the 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 5279–5286. IEEE, September 2015. Securing Virtual Execution Environments Through Machine Learning-based Intrusion Detection Fatemeh Azmandian, David R. Kaeli, Jennifer G. Dy, and Javed A. Aslam. In Proceedings of the 25th IEEE International Workshop on Machine Learning for Signal Processing (MLSP), pages 1–6. IEEE, September 2015. The Network You Keep: Analyzing Persons of Interest Using Cliqster Saber Shokat Fadaee, Mehrdad Farajtabar, Ravi Sundaram, Javed A. Aslam, and Nikos I. Passas. In Proceedings of the 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM), pages 122–129. IEEE, August 2014. On the Information Di↵erence Between Standard Retrieval Models Peter Golbus and Javed A. Aslam. In Proceedings of the 37th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 1135– 1138. ACM Press, July 2014. An Analysis of Crowd Workers Mistakes for Specific and Complex Relevance Assessment Task Jesse Anderton, Maryam Bashir, Virgil Pavlu, and Javed A. Aslam. In Proceedings of the 22nd ACM International Conference on Information and Knowledge Management (CIKM). ACM Press, October 2013. A Modification of LambdaMART to Handle Noisy Crowdsourced Assessments Pavel Metrikov, Jie Wu, Jesse Anderton, Virgil Pavlu, and Javed A. Aslam. In Proceedings of the 4th International Conference on the Theory of Information Retrieval (ICTIR). ACM Press, September 2013. A Mutual Information-based Framework for the Analysis of Information Retrieval Systems Peter Golbus and Javed A. Aslam. In Proceedings of the 36th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, July 2013. 7 A Document Rating System for Preference Judgements Maryam Bashir, Jesse Anderton, Jie Wu, Peter Golbus, Virgil Pavlu, and Javed A. Aslam. In Proceedings of the 36th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, July 2013. Live Nuggets Extractor: A Semi-automated System for Text Extraction and Test Collection Creation Matthew Ekstrand-Abueg, Virgil Pavlu, and Javed A. Aslam. In Proceedings of the 36th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, July 2013. Optimizing nDCG Gains by Minimizing E↵ect of Label Inconsistency Pavel Metrikov, Virgil Pavlu, and Javed A. Aslam. In Advances in Information Retrieval: 35th European Conference on IR Research (ECIR). Lecture Notes in Computer Science. Springer-Verlag, April 2013. Best Poster Paper Award. GPU-Accelerated Feature Selection for Outlier Detection Using the Local Kernel Density Ratio Fatemeh Azmandian, Ayse Yilmazer, Jennifer G. Dy, Javed A. Aslam, and David R. Kaeli. In Proceedings of the 12th IEEE International Conference on Data Mining (ICDM). IEEE Computer Society, December 2012. Feature Weighting and Selection Using Hypothesis Margin of Boosting Malak Alshawabkeh, Javed A. Aslam, Jennifer G. Dy, and David R. Kaeli. In Proceedings of the 12th IEEE International Conference on Data Mining (ICDM). IEEE Computer Society, December 2012. Local Kernel Density Ratio-Based Feature Selection for Outlier Detection Fatemeh Azmandian, Jennifer Dy, Javed A. Aslam, and David R. Kaeli. In Journal of Machine Learning Research - Proceedings Track, November 2012. Proceedings of the 4th Asian Conference on Machine Learning (ACML). City-scale Traffic Estimation From a Roving Sensor Network Javed A. Aslam, Sejoon Lim, Xinghao Pan, and Daniela Rus. In Proceedings of the 10th ACM Conference on Embedded Network Sensor Systems (SenSys). ACM Press, November 2012. Constructing Test Collections by Inferring Document Relevance via Extracted Relevant Information Shahzad Rajput, Matthew Ekstrand-Abueg, Virgiliu Pavlu, and Javed A. Aslam. In Proceedings of the 21st ACM Conference on Information and Knowledge Management (CIKM). ACM Press, October 2012. Enhanced Boosting-based Algorithm for Intrusion Detection in Virtual Machine Environments Malak Alshawabkeh, David R. Kaeli, Javed A. Aslam, Jennifer G. Dy, and Dana Schaa. In Proceedings of the First International Workshop on Secure and Resilient Architectures and Systems (SRAS). ACM Press, September 2012. Securing Cloud Storage Systems Through a Virtual Machine Monitor Fatemeh Azmandian, David R. Kaeli, Jennifer G. Dy, Javed A. Aslam, and Dana Schaa. In Proceedings of the First International Workshop on Secure and Resilient Architectures and Systems (SRAS). ACM Press, September 2012. 8 Congestion-aware Traffic Routing System using Sensor Data Javed A. Aslam, Sejoon Lim, and Daniela L. Rus. In Proceedings of the 15th International IEEE Conference on Intelligent Transportation Systems (ITSC). IEEE, September 2012. Markov-Based Redistribution Policy Model for Future Urban Mobility Networks Mikhail Volkov, Javed A. Aslam, and Daniela Rus. In Proceedings of the 15th International IEEE Conference on Intelligent Transportation Systems (ITSC). IEEE, September 2012. Impact of Assessor Disagreement on Ranking Performance Pavel Metrikov, Virgiliu Pavlu, and Javed A. Aslam. In Proceedings of the 35th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. ACM Press, August 2012. Extended Expectation Maximization for Inferring Score Distributions Keshi Dai, Evangelos Kanoulas, Virgil Pavlu, and Javed A. Aslam. In Advances in Information Retrieval: 34th European Conference on IR Research (ECIR), April 2012. IR System Evaluation using Nugget-based Test Collections Virgil Pavlu, Shahzad Rajput, Peter B. Golbus, and Javed A. Aslam. In Proceedings of the Fifth ACM International Conference on Web Search and Data Mining (WSDM), February 2012. Feature Selection Metric Using AUC Margin for Small Samples and Imbalanced Data Classification Problems Malak Alshawabkeh, Javed A. Aslam, Jennifer Dy, and David Kaeli. In Proceedings of the Tenth International Conference on Machine Learning and Applications (ICMLA), December 2011. A Novel Feature Selection for Intrusion Detection in Virtual Machine Environments Malak Alshawabkeh, Javed A. Aslam, David Kaeli, and Jennifer Dy. In Proceedings of the 23rd IEEE International Conference on Tools with Artificial Intelligence (ICTAI), November 2011. A Nugget-based Test Collection Construction Paradigm Shahzad Rajput, Virgil Pavlu, Peter B. Golbus, and Javed A. Aslam. In Proceedings of the 20th ACM Conference on Information and Knowledge Management (CIKM), October 2011. Workload Characterization at the Virtualization Layer Fatemeh Azmandian, Micha Moffie, Jennifer G. Dy, Javed A. Aslam, and David R. Kaeli. In Proceedings of the 19th Annual Meeting of the IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunications Systems (MASCOTS), July 2011. A Large-scale Study of the E↵ect of Training Set Characteristics over Learning-to-rank Algorithms Evangelos Kanoulas, Stefan Savev, Pavel Metrikov, Virgiliu Pavlu, and Javed A. Aslam. In Proceedings of the 34th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 2011. 9 Constructing Collections for Learning to Rank Emine Yilmaz, Evangelos Kanoulas, Stephen Robertson, and Javed Aslam. In Proceedings of the 11th Dutch-Belgian Information Retrieval Workshop (DIR), February 2011. E↵ective Virtual Machine Monitor Intrusion Detection Using Feature Selection on Highly Imbalanced Data With Malak Alshawabkeh, Micha Moffie, Fatemeh Azmandian, Jennifer Dy, and David Kaeli. In Proceedings of the Ninth International Conference on Machine Learning and Applications(ICMLA), December 2010. Score Distribution Models: Assumptions, Intuition, and Robustness to Score Manipulation With Evangelos Kanoulas, Keshi Dai, and Virgiliu Pavlu. In Proceedings of the 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 2010. Empirical Justification of the Gain and Discount Function for nDCG With Evangelos Kanoulas. In Proceedings of the Eighteenth ACM Conference on Information and Knowledge Management (CIKM), November 2009. Modeling the Score Distributions of Relevant and Non-relevant Documents With Evangelos Kanoulas, Virgil Pavlu, and Keshi Dai. In Proceedings of the 3rd International Conference on Theory in Information Retrieval (ICTIR), September 2009. Document Selection Methodologies for Efficient and E↵ective Learning-torank With Evangelos Kanoulas, Virgil Pavlu, Stefan Savev, and Emine Yilmaz. In Proceedings of the 32nd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 2009. If I Had a Million Queries With Ben Carterette, Virgil Pavlu, Evangelos Kanoulas, and James Allan. In Advances in Information Retrieval: 31st European Conference on IR Research (ECIR), April 2009. On Auditing Elections When Precincts Have Di↵erent Sizes With Raluca A. Popa and Ronald L. Rivest. In Proceedings of the 2008 USENIX/ACCURATE Electronic Voting Technology Workshop (EVT), July 2008. Evaluation Over Thousands of Queries With Ben Carterette, Virgil Pavlu, Evangelos Kanoulas, and James Allan. In Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 2008. A New Rank Correlation Coefficient for Information Retrieval With Emine Yilmaz and Stephen Robertson. In Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 2008. A Simple and Efficient Sampling Method for Estimating AP and NDCG With Emine Yilmaz and Evangelos Kanoulas. In Proceedings of the 31st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 2008. 10 Inferring Document Relevance from Incomplete Information With Emine Yilmaz. In Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management (CIKM), November 2007. Evaluation of Phrasal Query Suggestions With Alan Feuer and Stefan Savev. In Proceedings of the Sixteenth ACM Conference on Information and Knowledge Management (CIKM), November 2007. On Estimating the Size and Confidence of a Statistical Audit With Raluca A. Popa and Ronald L. Rivest. In Proceedings of the 2007 USENIX/ACCURATE Electronic Voting Technology Workshop (EVT), August 2007. Query Hardness Estimation Using Jensen-Shannon Divergence Among Multiple Scoring Functions With Virgil Pavlu. In Advances in Information Retrieval: 28th European Conference on IR Research (ECIR), 2007. Semi-supervised Data Organization for Interactive Anomaly Analysis With Virgil Pavlu and Sergey Bratus. In Proceedings of the Fifth International Conference on Machine Learning and Applications (ICMLA), December 2006. Estimating Average Precision with Incomplete and Imperfect Judgments With Emine Yilmaz. In Proceedings of the Fifteenth ACM International Conference on Information and Knowledge management (CIKM), August 2006. A Statistical Method for System Evaluation Using Incomplete Judgments With Virgil Pavlu and Emine Yilmaz. In Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, August 2006. Inferring Document Relevance via Average Precision With Emine Yilmaz. In Proceedings of the 29th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, August 2006. A Geometric Interpretation and Analysis of R-precision With Emine Yilmaz. In Proceedings of the Fourteenth ACM International Conference on Information and Knowledge Management (CIKM), October 2005. The Maximum Entropy Method for Analyzing Retrieval Measures With Emine Yilmaz and Virgil Pavlu. In Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, August 2005. A Geometric Interpretation of R-precision and Its Correlation with Average Precision With Emine Yilmaz and Virgil Pavlu. In Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, August 2005. Measure-based Metasearch With Virgil Pavlu and Emine Yilmaz. In Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, August 2005. 11 A Sampling Technique for Efficiently Estimating Measures of Query Retrieval Performance Using Incomplete Judgments With Virgil Pavlu and Emine Yilmaz. In Proceedings of the Workshop on Learning with Partially Classified Training Data at the 22nd International Conference on Machine Learning (ICML), August 2005. Kerf: Machine Learning to Aid Intrusion Analysts In Proceedings of the 13th USENIX Security Symposium, August 2004. Work-in-progress report. Tracking a Moving Object with a Binary Sensor Network With Zack Butler, Florin Constantin, Valentino Crespi, George Cybenko, and Daniela Rus. In Proceedings of the First International Conference on Embedded Networked Sensor Systems (SenSys), November 2003. A Unified Model for Metasearch, Pooling, and System Evaluation With Virgiliu Pavlu and Robert Savell. In Proceedings of the Twelfth International Conference on Information and Knowledge Management (CIKM), November 2003. A Unified Model for Metasearch and the Efficient Evaluation of Retrieval Systems via the Hedge Algorithm With Virgiliu Pavlu and Robert Savell. In Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 2003. On the E↵ectiveness of Evaluating Retrieval Systems in the Absence of Relevance Judgments With Robert Savell. In Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 2003. An Information-theoretic Measure for Document Similarity With Meredith Frost. In Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 2003. The Kerf Toolkit for Intrusion Analysis With Sergey Bratus, David Kotz, Ron Peterson, Daniela Rus, and Brett Tofel. In Proceedings of the 2003 IEEE Workshop on Information Assurance, June 2003. Distributed Energy-conserving Routing Protocols With Qun Li and Daniela Rus. In Proceedings of the Thirty-Sixth Annual Hawaii International Conference on System Sciences (HICSS), January 2003. Condorcet Fusion for Improved Retrieval With Mark Montague. In Proceedings of the Eleventh International Conference on Information and Knowledge Management (CIKM), November 2002. Relevance Score Normalization for Metasearch With Mark Montague. In Proceedings of the Tenth International Conference on Information and Knowledge Management (CIKM), November 2001. Models for Metasearch With Mark Montague. In Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, September 2001. 12 Metasearch Consistency With Mark Montague. In Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, September 2001. Online Power-aware Routing in Ad hoc Wireless Networks With Qun Li and Daniela Rus. In Proceedings of the Seventh Annual International Conference on Mobile Computing and Networking (MOBICOM), July 2001. Using Mobile Agents for Analyzing Intrusion in Computer Networks With Marco Cremonini, David Kotz, and Daniela Rus. In Proceedings of the Seventh ECOOP Workshop on Mobile Object Systems, June 2001. Hierarchical Power-aware Routing in Sensor Networks With Qun Li and Daniela Rus. In Proceedings of the DIMACS Workshop on Pervasive Networking, May 2001. Clustering Data without Prior Knowledge With Alain Leblanc and Cli↵ Stein. In Algorithm Engineering: 4th International Workshop (WAE 2000). Lecture Notes in Computer Science, Vol. 1982. Springer, 2001. Using Star Clusters for Filtering With Katya Pelekhov and Daniela Rus. In Proceedings of the Ninth International Conference on Information and Knowledge Management (CIKM), November 2000. Improving Algorithms for Boosting In Proceedings of the Thirteenth Annual Conference on Computational Learning Theory (COLT), July 2000. Automatic Information Organization With Katya Pelekhov and Daniela Rus. In Proceedings of the International Conference on Advances in Infrastructure for Electronic Business, Science, and Education on the Internet (SSGRR), July 2000. Bayes Optimal Metasearch: A Probabilistic Model for Combining the Results of Multiple Retrieval Systems With Mark Montague. In Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 2000. Scalable Information Organization With Fred Reiss and Daniela Rus. In Proceedings of the 6th Conference on ContentBased Multimedia Information Access (RIAO), April 2000. A Practical Clustering Algorithm for Static and Dynamic Information Organization With Katya Pelekhov and Daniela Rus. In Proceedings of the Tenth Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), January 1999. Improved Bicriteria Existence Theorems for Scheduling With April Rasala, Cli↵ Stein, and Neal Young. In Proceedings of the Tenth Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), January 1999. Static and Dynamic Information Organization with Star Clusters With Katya Pelekhov and Daniela Rus. In Proceedings of the 1998 ACM CIKM International Conference on Information and Knowledge Management, November 1998. 13 Generating, Visualizing and Evaluating High-Quality Clusters for Information Organization With Daniela Rus and Katya Pelekhov. In Principles of Digital Document Processing: 4th International Workshop (PODDP). Lecture Notes in Computer Science, Vol. 1481. Springer, 1998. Specification and Simulation of Statistical Query Algorithms for Efficiency and Noise Tolerance With Scott E. Decatur. In Proceedings of the Eighth Annual ACM Conference on Computational Learning Theory (COLT), July 1995. General Bounds on Statistical Query Learning and PAC Learning with Noise via Hypothesis Boosting With Scott E. Decatur. In Proceedings of the Thirty-Fourth Annual Symposium on Foundations of Computer Science (FOCS), November 1993. Searching in the Presence of Linearly Bounded Errors With Aditi Dhagat. In Proceedings of the Twenty-Third Annual ACM Symposium on Theory of Computing (STOC), May 1991. Inferring Graphs from Walks With Ronald L. Rivest. In Proceedings of the Third Annual Workshop on Computational Learning Theory (COLT), August 1990. Unrefereed Papers Frontiers, Challenges, and Opportunities for Information Retrieval James Allan, et al. In ACM SIGIR Forum, 46(1):2–32, June 2012. Empirical Justification of the Discount Function for nDCG [abstract] With Evangelos Kanoulas. In Proceedings of the SIGIR 2008 Workshop: Beyond Binary Relevance: Preferences, Diversity and Set-Level Judgments, July 2008. Million Query Track 2007 Overview With James Allan, Ben Carterette, Virgil Pavlu, Blagovest Dachev, and Evangelos Kanoulas. In The Sixteenth Text REtrieval Conference Proceedings (TREC 2007). National Institute of Standards and Technology, November 2007. The Hedge Algorithm for Metasearch at TREC 2007 With Virgil Pavlu and Olena Zubaryeva. In The Sixteenth Text REtrieval Conference Proceedings (TREC 2007). National Institute of Standards and Technology, November 2007. The Hedge Algorithm for Metasearch at TREC 2006 With Virgil Pavlu and Carlos Rei. In The Fifteenth Text REtrieval Conference Proceedings (TREC 2006). National Institute of Standards and Technology, September 2007. The Kerf Toolkit for Intrusion Analysis With Sergey Bratus, David Kotz, Ron Peterson, and Daniela Rus. In IAnewsletter, 8(2):12–16, 2005. The Kerf Toolkit for Intrusion Analysis With Sergey Bratus, David Kotz, Ron Peterson, Daniela Rus, and Brett Tofel. Dartmouth College, Department of Computer Science technical report TR2004-493, March 2004. 14 Challenges in Information Retrieval and Language Modeling With James Allan, et al. In SIGIR Forum, 37(1):31–47, 2003. Simple Bounds on the Expected Height of a Randomly Built Binary Search Tree Dartmouth College, Department of Computer Science technical report TR2001-387, 2001. Bayes Optimal Metasearch: A Probabilistic Model for Combining the Results of Multiple Retrieval Systems With Mark Montague. Dartmouth College, Department of Computer Science technical report TR2000-382, 2000. Using High-Quality Clusters for Summarizing and Visualizing Large Document Collections With Daniela Rus and Katya Pelekhov. In Proceedings of the 1997 SIGIR Workshop on Information Reduction in Information Retrieval, Summarization and Visualization, July 1997. Computing Dense Clusters On-Line for Information Organization With Daniela Rus and Katya Pelekhov. Dartmouth College, Department of Computer Science technical report PCS-TR97-324, 1997. Generating, Visualizing and Evaluating High-Quality Clusters for Information Organization With Daniela Rus and Katya Pelekhov. Dartmouth College, Department of Computer Science technical report PCS-TR97-319, 1997. Noise Tolerant Algorithms for Learning and Searching MIT Laboratory for Computer Science technical report MIT/LCS/TR-657, 1995. Ph.D. thesis. Improved Noise-Tolerant Learning and Generalized Statistical Queries With Scott E. Decatur. Harvard University, Center for Research in Computing Technology technical report TR-17-94, 1994. Boosting Weak Hypotheses in the Statistical Query Model With Scott E. Decatur. In Proceedings of the 1993 MIT CBCL Learning Day at Endicott, April 1993. Inferring Graphs from Walks Masters thesis, MIT, January 1992. On-Line Algorithms for 2-Coloring Hypergraphs via Chip Games With Aditi Dhagat. MIT Laboratory for Computer Science technical report MIT/LCS/TM-439, 1990. Funding Awarded Code-set Prediction Project Massachusetts General Hospital. Principal investigator. $50,177 awarded for the period 07/01/15 through 04/30/16. Optimal Allocation of Crowdsourced Resources for IR Evaluation National Science Foundation. Principal investigator. $499,677 awarded for the period 08/15/14 through 07/31/17. 15 A Nugget-Based Information Retrieval Evaluation Paradigm National Science Foundation. Principal investigator. $150,000 awarded for the period 09/15/12 through 02/28/14, extended to 02/18/15. Research Partnership in Clinical Information Retrieval US Department of Veterans A↵airs. Principal investigator with Mirek Riedewald (coPI). $24,677.26 awarded for the period 1/27/2013 through 1/26/2014. Collection Construction Methodologies for Learning-to-Rank National Science Foundation. Principal investigator. $488,723 awarded for the period 09/01/10 through 08/31/13. Research Partnership in Clinical Information Retrieval US Department of Veterans A↵airs. Principal investigator with Mirek Riedewald (coPI). $24,000 awarded for the period 1/27/2012 through 1/26/2013. Analysis and Evaluation of Measures of Retrieval Performance National Science Foundation. Principal investigator. $300,000 awarded for the period 07/01/06 through 06/30/09, extended to 06/30/10. Mining the Figures and Text of the Biological Literature National Science Foundation. Co-principal investigator with Robert Futrelle (PI) and Peter Tarasewich (co-PI). $150,580 awarded for the period 09/01/06 through 08/31/09. Risk Assessment Liberty Mutual Insurance. Principal investigator. $50,000 awarded for FY 2006–2010 as part of a gift to the University and College. MRI: Enabling Research on Terabyte-scale Datasets National Science Foundation. Co-principal investigator with Gene Cooperman (PI) and Ravi Sundaram, Jennifer Dy, and David Kaeli (co-PIs). $199,000 awarded for the period 08/01/06 through 07/31/08. CAREER: An Information-Theoretic Approach to Computational Learning with Applications National Science Foundation. Principal investigator. $250,002 originally awarded through Dartmouth College for the period 07/01/01 through 06/30/06; $173,737 transferred and awarded through Northeastern University for the period 09/01/03 through 08/31/07. Kerf: A Toolkit for Intrusion Analysis Institute for Security Technology Studies. Principal investigator. $75,449 awarded for the period 09/01/05 through 08/31/06. The National fMRI Data Center National Science Foundation. Co-principal investigator with Michael Gazzaniga (PI) and Daniel Rockmore (Co-PI). $4,682,260 awarded for FY 1999–2004. Extended to Fall 2006. The Kerf Toolkit for Intrusion Analysis Department of Homeland Security, Institute for Security Technology Studies. Coprincipal investigator with David Kotz (PI) and Daniela Rus (co-PI). $1,013,507 awarded for FY 2003–2005. 16 Infrastructure for Distributed Collaboration in Detecting Network Attacks Department of Justice, Institute for Security Technology Studies. Co-principal investigator with David Kotz (PI) and Daniela Rus (co-PI). $491,876 awarded for AY 2001– 2002. Assessing and Mining of Data from Network Sensors Department of Justice, Institute for Security Technology Studies. Co-principal investigator with David Kotz (PI) and Daniela Rus (co-PI). $153,379 awarded for AY 2000– 2001. National fMRI Data Center at Dartmouth College William M. Keck Foundation. Co-principal investigator with Michael Gazzaniga (PI) and Daniel Rockmore (Co-PI). $1,000,000 awarded for FY 2000–2001. Honors and Awards ASONAM Best Student Paper Award 2014 Awarded for “The Network You Keep: Analyzing Persons of Interest using Cliqster,” Saber Shokat Fadaee, Mehrdad Farajtabar, Ravi Sundaram, Javed A. Aslam, and Nikos Passas. ECIR Best Poster Paper Award 2013 Awarded for “Optimizing nDCG Gains by Minimizing E↵ect of Label Inconsistency,” Pavel Metrikov, Virgiliu Pavlu, and Javed A. Aslam. Excellence in Teaching Award, Finalist 2013 One of 25 finalists among 200 nominations for the 2013 Northeastern University Excellence in Teaching Award. National Science Foundation Career Award 2001 – 2005 Awarded for career grant entitled “An Information-Theoretic Approach to Computational Learning with Applications.” Class of 1962 Faculty Fellow 2001 – 2002 Awarded by Dartmouth College for “demonstrated excellence in undergraduate teaching and promise as a scholar.” Dartmouth Review Top 20 Professor 1999 – 2003 Awarded by a student-run Dartmouth College newspaper for excellence in teaching. General Electric Foundation Fellowship 1987 – 1988 Awarded by the General Electric Corporation for graduate study at MIT. Steiner Prize 1987 Awarded by the University of Notre Dame for “academic excellence in engineering.” General Motors Scholar Tau Beta Pi Eta Kappa Nu Students 1985 – 1987 Elected 1986 Elected 1985 Postdoctoral Advisor Virgil Pavlu (current), Sergey Bratus, and Mark Montague. Ph.D. Graduates Pavel Metrikov, 2015. Thesis: “Relevance Assessment (Un-)Reliability in Information Retrieval: Minimizing Negative Impact” 17 Maryam Bashir, 2014. Thesis: “Optimally Selecting and Combining Assessment and Assessor Types for Information Retrieval Evaluation” Peter Golbus, 2014. Thesis: “Beyond Measuring Performance: Targeted Meta-Evaluations for Diversity and an Information-Theoretic Framework for the Analysis of Search Engines” Stefan Savev, 2012. Thesis: “Collection Construction Methodologies for Learning to Rank” Shahzad Rajput, 2012. Thesis: “A Nugget-based Test Collection Construction Paradigm” Keshi Dai, 2012. Thesis: “Modeling Score Distributions for Information Retrieval” Evangelos Kanoulas, 2009. Thesis: “Building Reliable Test and Training Collections in Information Retrieval” Virgil Pavlu, 2008. Thesis: “Large Scale IR Evaluation” Emine Yilmaz, 2007. Thesis: “Informative and Efficient Evaluation of Retrieval Systems” Alan Feuer, 2007. Thesis: “Increasing Conversation in Proximity Search Using Phrasal Query Suggestions” Robert Savell, 2005. Thesis: “On-line Metasearch, Pooling, and System Evaluation” Mark Montague, 2002. Thesis: “Metasearch: Data Fusion for Document Retrieval” Ph.D. Thesis Advisor (current) Jesse Anderton, Maryam Aziz, Matthew Ekstrand-Abueg, Cheng Li, and Bingyu Wang. Ph.D. Thesis Committee Member (past) Tim Smith, Bahar Qarabaqi, Karl Wiegand, Malak Alshawabkeh, Fatemeh Azmandian, Nick Blumm, Peter Dillinger, Cheng Wu, Mingyan Shao, Ben Carterette, Keary LeBeau, Micha Moffie, Jun Gong, Guolong Lin, Anna Shubina, Wenxu Tong, Viet Ha Nguyen, Huanmei Wu, Qun Li, David Wagner, Clint Hepner, Brian Premore, Scott McElfrish, and Katya Pelekhov. M.S. Graduates Jie Wu, 2013. Thesis: “Applying EM to Compute Document Relevance from Crowdsourced Pair Preferences” Joshua Hodosh, 2010. Thesis: “Learning Malicious Activity Using Virtual Machine Introspection” Meredith Frost, 2005. Thesis: “Evaluating Information-theoretic Similarity Measures for Documents” John Thomas, 2004. Thesis: “Bicriteria Existence Theorems for Scheduling via Weighted Average Loss” M.S. Research Advisor (current) Paul Grosu. M.S. Research Advisor (past) Sam Scarano, Patrick Redmond, Galen Wilkerson, Vidji Shah, Prasanna Pilla, and Olena Zubaryeva. 18 M.S. Thesis Committee Member (past) Rahul Verma, Fatemeh Azmandian and Tim Morgan. Honors Thesis Supervisor Paul Seligman, Lisa Torrey, Sebastien Lahaie, Zach Berke, David Latham, Fred Reiss, Dan Scholnick, Ann DeBord, Je↵ Isaacs, Jason Whaley, Matt Carter, Jack Pien, Michael Pryor, and Eric Hagen. Undergraduate Research Supervisor Daniel Matysiak, Chris Lambert, Carlos Rei, Kevin Roche, David Marmaros, Jay Cormier, Ken Yasuhara, and Morgan Soutter. 19