A Search System for Mathematical Expressions on Software Binaries
Developers often ask for libraries that implement specific mathematical expressions. A fundamental bottleneck in building information retrieval (IR) systems to answer such mathematical queries is the inability to detect a given expression in software binaries. While we have a few math IR solutions such as EgoMath2 and Tangent-3 that work over text documents, none exist to search over software binaries. Our vision is to build a search system for binaries to answer queries containing mathematical expressions. A wide variety of compilers and differences in the way they optimize the code, pose difficult challenges to solve this problem. In this work, we discuss our preliminary results in detecting mathematical expressions in software binaries. We use a knowledge base assisted approach to solve this problem. We are able to search mathematical expressions with a precision of 80% and a recall of 53%. This work opens up interesting research opportunities in areas such as software security and performance, to help analysts in identifying and analyzing binaries for implementations of mathematical expressions.
Tue 29 MayDisplayed time zone: Amsterdam, Berlin, Bern, Rome, Stockholm, Vienna change
14:00 - 15:30 | |||
14:00 17mFull-paper | Analyzing Requirements and Traceability Information to Improve Bug Localization Technical Papers A: Michael Rath Technische Universität Ilmenau, A: David Lo Singapore Management University, A: Patrick Mäder Technische Universität Ilmenau DOI Pre-print | ||
14:17 17mFull-paper | Towards Extracting Web API Specifications from Documentation Technical Papers A: Jinqiu Yang , A: Erik Wittern IBM Research, A: Annie T.T. Ying EquitySim, A: Julian Dolby IBM Thomas J. Watson Research Center, A: Lin Tan University of Waterloo | ||
14:34 17mFull-paper | Evaluating How Developers Use General-Purpose Web-Search for Code Retrieval Technical Papers A: Md Masudur Rahman University of Virginia, USA, A: Jed Barson University of Virginia, A: Sydney Paul , A: Joshua Kayani , A: Federico Andrés Lois , A: Sebastián Fernandez Quezada , A: Chris Parnin NCSU, A: Kathryn Stolee North Carolina State University, A: Baishakhi Ray Columbia University, New York Pre-print | ||
14:51 17mFull-paper | Learning to Mine Aligned Code and Natural Language Pairs from Stack Overflow Technical Papers A: Pengcheng Yin , A: Bowen Deng Carnegie Mellon University, A: Edgar Chen Carnegie Mellon University, A: Bogdan Vasilescu Carnegie Mellon University, A: Graham Neubig Carnegie Mellon University | ||
15:08 7mShort-paper | A Search System for Mathematical Expressions on Software Binaries Technical Papers A: Ridhi Jain , A: Sai Prathik Saba Bama , A: Venkatesh Vinayakarao IIITD, A: Rahul Purandare IIIT-Delhi DOI Pre-print | ||
15:15 15mOther | Discussion phase Technical Papers |