Metasearch information fusion using linear programming
RAIRO - Operations Research - Recherche Opérationnelle, Tome 46 (2012) no. 4, pp. 289-303.

For a specific query merging the returned results from multiple search engines, in the form of a metasearch aggregation, can provide significant improvement in the quality of relevant documents. This paper suggests a minimax linear programming (LP) formulation for fusion of multiple search engines results. The paper proposes a weighting method to include the importance weights of the underlying search engines. This is a two-phase approach which in the first phase a new method for computing the importance weights of the search engines is introduced and in the second stage a minimax LP model for finding relevant search engines results is formulated. To evaluate the retrieval effectiveness of the suggested method, the 50 queries of the 2002 TREC Web track were utilized and submitted to three popular Web search engines called Ask, Bing and Google. The returned results were aggregated using two exiting approaches, three high-performance commercial Web metasearch engines and our proposed technique. The efficiency of the generated lists was measured using TREC-Style Average Precision (TSAP). The new findings demonstrate that the suggested model improved the quality of merging considerably.

DOI : 10.1051/ro/2012019
Classification : 90C05, 90C90, 68P20
Mots clés : linear programming, search engine, metasearch, information fusion, information retrieval
@article{RO_2012__46_4_289_0,
     author = {Amin, Gholam R. and Emrouznejad, Ali and Sadeghi, Hamid},
     title = {Metasearch information fusion using linear programming},
     journal = {RAIRO - Operations Research - Recherche Op\'erationnelle},
     pages = {289--303},
     publisher = {EDP-Sciences},
     volume = {46},
     number = {4},
     year = {2012},
     doi = {10.1051/ro/2012019},
     mrnumber = {2995738},
     zbl = {1270.90027},
     language = {en},
     url = {http://www.numdam.org/articles/10.1051/ro/2012019/}
}
TY  - JOUR
AU  - Amin, Gholam R.
AU  - Emrouznejad, Ali
AU  - Sadeghi, Hamid
TI  - Metasearch information fusion using linear programming
JO  - RAIRO - Operations Research - Recherche Opérationnelle
PY  - 2012
SP  - 289
EP  - 303
VL  - 46
IS  - 4
PB  - EDP-Sciences
UR  - http://www.numdam.org/articles/10.1051/ro/2012019/
DO  - 10.1051/ro/2012019
LA  - en
ID  - RO_2012__46_4_289_0
ER  - 
%0 Journal Article
%A Amin, Gholam R.
%A Emrouznejad, Ali
%A Sadeghi, Hamid
%T Metasearch information fusion using linear programming
%J RAIRO - Operations Research - Recherche Opérationnelle
%D 2012
%P 289-303
%V 46
%N 4
%I EDP-Sciences
%U http://www.numdam.org/articles/10.1051/ro/2012019/
%R 10.1051/ro/2012019
%G en
%F RO_2012__46_4_289_0
Amin, Gholam R.; Emrouznejad, Ali; Sadeghi, Hamid. Metasearch information fusion using linear programming. RAIRO - Operations Research - Recherche Opérationnelle, Tome 46 (2012) no. 4, pp. 289-303. doi : 10.1051/ro/2012019. http://www.numdam.org/articles/10.1051/ro/2012019/

[1] L. Akritidis, D. Katsaros and P. Bozanis, Effective rank aggregation for metasearching. J. Syst. Soft. 84 (2011) 130-143.

[2] G.R. Amin and A. Emrouznejad, An extended minimax disparity to determine the OWA operator weights. Comput. Ind. Eng. 50 (2006) 312-316.

[3] G.R. Amin and A. Emrouznejad, Finding relevant search engines results : a minimax linear programming approach. J. Oper. Res. Soc. 61 (2010) 1144-1150. | Zbl

[4] G.R. Amin and H. Sadeghi, Application of Prioritized Aggregation Operators in Preference Voting. Int. J. Intell. Syst. 25 (2010) 1027-1034. | Zbl

[5] R.A. Baeza-Yates and B. Ribeiro-Neto, Modern information retrieval : the concepts and technology behind search, 2nd edition. ACM Press Books (2010).

[6] J. Bar-Ilan, M. Mat-Hassan and M. Levene, Methods for comparing rankings of search engine results. Comput. Netwo. 50 (2006) 1448-63. | Zbl

[7] S.K. Deka and N. Lahkar, Performance evaluation and comparison of the five most used search engines in retrieving web resources. Online Inf. Rev. 34 (2010) 757-771.

[8] E.D. Diaz, A. De and V. Raghavan, A comprehensive OWA-based framework for result merging in metasearch. Lect. Notes Comput Sci. 3642 (2005) 193-201.

[9] A. De, E.D. Diaz and V. Raghavan, A fuzzy search engine weighted approach to result merging for metasearch. Lect. Notes Comput Sci. 4482 (2007) 95-102.

[10] A. Emrouznejad, MP-OWA : The Most Preferred OWA Operator. Knowl-Based Syst. 21 (2008) 847-851.

[11] M. Farah and D. Vanderpooten, An outranking approach for rank aggregation in information retrieval. Proceedings of the 30th ACM SIGIR conference on Research and development in information retrieval. Amsterdam, The Netherlands (2007) 591-598.

[12] E. Herrera-Viedma, J. Lopez Gijon, S. Alonso, J. Vilchez, C. Garcia and L. Villen, Applying Aggregation Operators for Information Access Systems : An Application in Digital Libraries. Int. J. Intell. Syst. 23 (2008) 1235-1250. | Zbl

[13] Y. Lu, W. Meng, L. Shu, C. Yu and K.L. Liu, Evaluation of Result Merging Strategies for Metasearch Engines. Lect. Notes Comput. Sci. 3806 (2005) 53-66.

[14] W. Meng, C. Yu and K.L. Liu, Building efficient and effective metasearch engines. ACM Comput. Surv. 34 (2002) 48-89.

[15] H. Sadeghi, Assessing metasearch engine performance. Online Inf. Rev. 33 (2009) 1058-1065.

[16] H. Sadeghi, Empirical challenges and solutions in construction of a high-performance metasearch engine. Online Inf. Rev. 36 (2012) 713-723.

[17] S. Shekhar, K.V. Arya, R. Agarwal and R. Kumar, A WEBIR Crawling Framework for Retrieving Highly Relevant Web Documents : Evaluation Based on Rank Aggregation and Result Merging Algorithms, International Conference on Computational Intelligence and Communication Networks (CICN) (2011) 83-88.

[18] M. Shokouhi and L. Si, Federated Search. Found. Trends Inf. Retr. FTIR 5 (2011) 1-102.

[19] T.P.C. Silva, E.S. De Moura, J.M.B. Cavalcanti, A.S. Da Silva, M.G. De Carvalho and M.A. Gonçalves, An evolutionary approach for combining different sources of evidence in search engines. Inform. Syst. 34 (2009) 276-289.

[20] E. Voorhees, Overview of TREC 2002, in Proceedings of the 11th Text REtrieval Conference (TREC), Gaithersburg, MD, USA (2002) 1-15.

[21] S. Wu, Applying statistical principles to data fusion in information retrieval. Expert Syst. Appl. 36 (2009) 2997-3006.

[22] S. Wu, Linear combination of component results in information retrieval. Data Knowl. Eng. 71 (2012) 114-126.

[23] S. Wu and F. Crestani, Data fusion with estimated weights, CIKM '02 : Proceedings of the eleventh international conference on Information and knowledge management (2002) 648-651.

[24] S. Wu, Y. Bi, X. Zeng and L. Han, The Experiments with the Linear Combination Data Fusion Method in Information Retrieval. Lect. Notes Comput. Sci. 4976 (2008) 432-437.

[25] J.T. Yao, V. Raghavan and Z. Wu, Web information fusion : A review of the state of the art. Inform Fusion 9 (2008) 446-449.

[26] X.S. Xie and G. Zhang, Study of Optimizing the Merging Results of Multiple Resource Retrieval Systems by a Particle Swarm Algorithm. International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC) (2011) 39-42.

[27] S. Zhou, M. Xu and J. Guan, LESSON : A system for lecture notes searching and sharing over Internet. J. Syst. Soft. 83 (2010) 1851-1863.

[28] G.T. Zhou, K.M. Ting, F.T. Liu and Y. Yin, Relevance feature mapping for content-based multimedia information retrieval. Pattern Rec. 45 (2012) 1707-1720

Cité par Sources :