Research
Publications
Teaching
Bio
Home

Research

Research Interest

My interest lies in reasoning under uncertainty for analysis and prediction over heterogeneous data using probabilistic graphical models and other machine learning techniques. My research includes developing probabilistic and cut-based formulations for collective relational clustering for applications such as entity resolution in structured databases and document collections, and word sense disambiguation from multilingual corpora. More recently, I have been exploring applications of relational clustering in cross domain transfer of learning.

Refereed Conference and Workshop Publications

  1. "Learning Dirichlet Processes from Partially Observed Groups", Avinava Dubey, Indrajit Bhattacharya, Mrinal Das, Tanveer Faruquie, and Chiranjib Bhattacharyya, To appear in Proceedings of IEEE International Conference on Data Mining (ICDM), Dec 2011 (Acceptance Rate: 18%)

  2. "Exploiting Coherence for the simultaneous discovery of latent facets and associated sentiments", Himabindu Lakkaraju, Chiranjib Bhattacharyya, Indrajit Bhattacharya and Srujana Merugu, SIAM International Conference on Data Mining (SDM), April 2011 (Acceptance Rate: 25%) (Best Research Paper Award)

  3. "Building Re-usable Dictionary Repositories for Real-world Text Mining", Shantanu Godbole, Indrajit Bhattacharya, Ajay Gupta and Ashish VermaACM International Conference on Information and Knowledge Management (CIKM), Toronto, October 2010 (Acceptance Rate: 13%).

  4. "A Cluster-level Semi-supervision Model for Inter-active Clustering", Avinava Dubey, Indrajit Bhattacharya and Shantanu GodboleEuropean Conference on Machine Learning (ECML-PKDD), Barcelona, September 2010 (Acceptance Rate: 16%).

  5. "Cross-Guided Clustering: Transfer of Relevant Supervision across Domains for Improved Clustering", Indrajit Bhattacharya, Shantanu Godbole, Sachindra Joshi and Ashish Verma,IEEE International Conference on Data Mining (ICDM), Miami, December 2009 (Acceptance Rate: 9%).

  6. "Enabling Analysts in Managed Services for CRM Analytics", Indrajit Bhattacharya, Shantanu Godbole, Ajay Gupta, Ashish Verma, Jeff Achtermann and Kevin English, ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD), Paris, July, 2009.

  7. "Structured Entity Identification and Document Categorization: Two Tasks with One Joint Model", Indrajit Bhattacharya, Shantanu Godbole, and Sachindra Joshi, ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD), Las Vegas, August, 2008 (Acceptance Rate: 10%).
    (12 citations)

  8. "Online Collective Entity Resolution", Indrajit Bhattacharya and Lise Getoor,NECTAR Track, Conference on Artifical Intellegence (AAAI), 2007.

  9. "Query-Time Entity Resolution", Indrajit Bhattacharya, Louis Licamele and Lise Getoor, The 12th ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD), Philadelphia, USA, August 2006 (Acceptance Rate: 20%).
    (27 citations)
  10. "Relational Clustering for Entity Resolution Queries", Indrajit Bhattacharya, Louis Licamele and Lise Getoor, ICML 2006 Workshop on Statistical Relational Learning (SRL), Pittsburgh, USA, June 2006.
    (1 citations)
  11. "A Latent Dirichlet Model for Unsupervised Entity Resolution", Indrajit Bhattacharya and Lise Getoor, The 6th SIAM Conference on Data Mining (SIAM SDM), Bethesda, Maryland, April 2006
    (Best Research Paper Award) (Acceptance Rate: 16%). (101 citations)
  12. "Relational Clustering for Multi-type Entity Resolution", Indrajit Bhattacharya and Lise Getoor, The 11th ACM SIGKDD Workshop on Multi Relational Data Mining (MRDM), Chicago, August 2005.
    (34 citations)
  13. "Similarity Searching in Peer-to-Peer Databases", Indrajit Bhattacharya, Srinivas Kashyap and Srinivas Parthasarathy, The 25th International Conference on Distributed Computing Systems (ICDCS), Columbus, Ohio, June 2005 (Acceptance Rate: 14%).
    (29 citations)
  14. "Deduplication and Group Detection using Links", Indrajit Bhattacharya and Lise Getoor, The 10th ACM SIGKDD Workshop on Link Analysis and Group Detection (LinkKDD), Seattle, August 2004.
    (50 citations)
  15. "The University of Maryland Senseval-3 system descriptions", Clara Cabezas, Indrajit Bhattacharya and Philip Resnik, The 3rd International Workshop on the Evaluation of Systems for the Semantic Analysis of Text (SENSEVAL-3), Barcelona, July 2004
    (4 citations)
  16. "Unsupervised Sense Disambiguation using Bilingual Probabilistic Models", Indrajit Bhattacharya, Lise Getoor and Yoshua Bengio, The 42nd Annual Meeting of the Association for Computational Linguistics, (ACL-04), Barcelona, July 2004 (Acceptance Rate: 25%).
    (27 citations)
  17. "Iterative Record Linkage for Cleaning and Integration", Indrajit Bhattacharya and Lise Getoor, The 9th ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery (DMKD), Paris, June 2004 (Acceptance Rate: 25%).
    (139 citations)

Journal Publications and Book Chapters

  1. "Cross-Guided Clustering: Transfer of Relevant Supervision across Tasks", Indrajit Bhattacharya, Shantanu Godbole, Sachin Joshi and Ashish Verma, Accepted for publication in ACM Transactions on Knowledge Discovery from Data.
  2. "Collective Relational Clustering", Indrajit Bhattacharya and Lise Getoor, Chapter in: Constrained Clustering: Advances in Algorithms, Theory, and Applications, (eds Basu,Davidson,Wagstaff), Chapman & Hall/CRC Data Mining and Knowledge Discovery Series, June 2008.
  3. "Query-time Entity Resolution", Indrajit Bhattacharya and Lise Getoor, Journal of Artificial Intelligence Research (JAIR), Volume 30, pages 621-657, 2007.
    (27 citations)
  4. "Collective Entity Resolution in Relational Data", Indrajit Bhattacharya and Lise Getoor, ACM Transactions on Knowledge Discovery from Data (ACM-TKDD), Volume 1, Issue 1, March 2007.
    (143 citations)
  5. "Collective Entity Resolution in Relational Data", Indrajit Bhattacharya and Lise Getoor, IEEE Data Engineering Bulletin, Special Issue on Data Quality, June 2006.
    (2 citations)
  6. "Entity Resolution in Graphs", Indrajit Bhattacharya and Lise Getoor, Chapter in Mining Graph Data, Lawrence B. Holder and Diane J. Cook, Editors, Wiley, 2006.
    (17 citations)
  7. "An Object Oriented Fuzzy Data Model for Similarity Detection in Image Databases", Arun K. Majumdar, Indrajit Bhattacharya and Amit K. Saha, IEEE Transactions on Knowledge and Data Engineering, Vol. 14(5), September/October 2002.
    (11 citations)
  8. "Quantified Computation Tree Logic", Anindya Patthak, Indrajit Bhattacharya, Anirban Dasgupta, Pallab Dasgupta and Partha Pratim Chakrabarti, Information Processing Letters, Vol. 82(3), May 2002.
    (5 citations)

PhD Thesis

  • "Collective Entity Resolution In Relational Data", Indrajit Bhattacharya, PhD thesis from University of Maryland, College Park, Dec 2006. 17 citations

Technical Reports and Unpublished Manuscripts

  1. "Entity Resolution in Graph Data", Indrajit Bhattacharya and Lise Getoor, University of Maryland Technical Report CS-TR-4758, October 2005.
    (9 citations)
  2. "Latent Dirichlet Allocation Model for Entity Resolution", Indrajit Bhattacharya and Lise Getoor, University of Maryland Technical Report CS-TR-4740, August 2005.
  3. "Similarity Searching in Peer-to-Peer Databases", Indrajit Bhattacharya, Srinivas Kashyap and Srinivas Parthasarathy, University of Maryland Technical Report CS-TR-4558, January 2004.

Students

  • Arunachalam E.V. (PhD) (jointly with Shirish Shevade)
  • Ranganath B.N.(PhD)
  • Lavanya Sita Tekumalla (PhD)
  • Sachin Kale (ME 2011) (jointly with KV Raghavan and Deepak D'Souza)
  • Varun (ME)
  • Nishanth (ME)