|
Xiaohui Yu
3050 TEL Building
Tel: +1 (416) 736-2100 ext 33887 |
|
I am an associate professor in the School of Information Technology, York University. I received a BSc degree from Nanjing University, China, a MPhil from the Chinese University of Hong Kong (under the supervision of Prof. Ada Fu), and a PhD from the University of Toronto (under the supervision of Prof. Nick Koudas). I am affiliated with the Graduate Program in Computer Science at York University.
My current research interests include:
In the past, I have also worked on spatio-temporal query processing, and high-dimensional data indexing.
X. Yu, H. Shi. CI-Rank: Ranking Keyword Search Results Based on Collective Importance, in Proceedings of the 28th IEEE International Conference on Data Engineering (ICDE 2012), Washington D.C. April 1-5, 2012
Y. Liu, X. Yu, X. Huang, A. An. Combining integrated sampling with SVM ensembles for learning from imbalanced datasets, Inf. Process. Manage. 47(4): 617-631 (2011)
X. Yu, Y. Liu, X. Huang, A. An. Mining Online Reviews for Predicting Sales Performance: A Case Study in the Movie Domain, in IEEE Transactions on Knowledge and Data Engineering (TKDE), in press.
X. Yu, J. Dong. Indexing High-Dimensional Data for Main-Memory Similarity Search, in Information Systems 35 (2010), pp. 825-843, Elsevier, November 2010. DOI:10.1016/j.is.2010.05.001
Y. Liu, X. Yu, X. Huang, A. An, S-PLSA+: Adaptive Sentiment Analysis with Application to Sales Performance Prediction, to appear in Proceedings of SIGIR 2010, July 19-23, 2010, Geneva, Switzerland. (poster)
X. Yu, Y. Liu, X. Huang, A. An. A Quality-Aware Model for Sales Prediction Using Reviews, in Proceedings of the 19th International World Wide Web Conference (WWW 2010), Raleigh, North Carolina, April 26-30, 2010. (poster)
X. Yu, H. Shi. Query Segmentation Using Conditional Random Fields, in Proceedings of the First International Workshop on Keyword Search on Structured Data (KEYS 2009), co-located with SIGMOD 2009, Providence, RI, June 28, 2009.
K. Pu, X. Yu. FRISK: Query Cleaning and Processing in Action, to appear in Proceedings of 25th International Conference on Data Engineering (ICDE 2009), Shanghai, China, March 29-April 4, 2009.
Y. Liu, X. Huang, A. An, and X. Yu. Predicting the Helpfulness of Online Reviews, to appear in Proceedings of 8th IEEE International Conference on Data Mining (ICDM 2008), Pisa, December, 2008. (Acceptance rate for full papers: 9.7%.)
Y. Liu, X. Huang, A. An, and X. Yu. HelpMeter: A Nonlinear Model for Predicting the Helpfulness of Online Reviews, to appear in Proceedings of 2008 IEEE/WIC/ACM International Conference on Web Intelligence (WI 2008), Sydney, December, 2008.
Y. Liu, X. Yu, X. Huang, A. An. Blog Data Mining: the Predictive Power of Sentiments, a chapter in L. Cao, P.S. Yu, C. Zhang, H. Zhang (eds.): Data Mining for Business Applications, Springer. To appear in late 2008.
K. Pu, X. Yu. Keyword Query Cleaning, in the 34th International Conference on Very Large Data Bases (VLDB 2008), Auckland, New Zealand, August 2008.(pdf, errata)
M. Hadjieleftheriou, X. Yu, N. Koudas, D. Srivastava. Selectivity Estimation of Set Similarity Selection Queries, in the 34th International Conference on Very Large Data Bases (VLDB 2008), Auckland, New Zealand, August 2008. (pdf)
X. Yu, Y. Liu. Reasoning about Similarity Queries in Text Retrieval Tasks, in Proceedings of the 17th International World Wide Web Conference (WWW 2008), Beijing, April 2008.
Y. Liu, X. Huang, A. An, X. Yu. ARSA: A Sentiment-Aware Model for Predicting Sales Performance Using Blogs, in Proceedings of the 30th Annual International ACM SIGIR Conference (SIGIR 2007), Amsterdam, July 2007. Acceptance rate:18%. (pdf)
J. Dong, X. Yu. CSR+-tree: Cache-conscious Indexing for High-dimensional Similarity Search, in Proceedings of the 19th International Conference on Scientific and Statistical Database Management (SSDBM 2007), Banff, Canada, July 2007. (pdf)
C. Zuzarte, X. Yu. Fast Approximate Computation of Statistics on Views. In Proceedings of the ACM SIGMOD Conference (SIGMOD 2006), Chicago, IL, June 2006. (industry talk)
X. Yu, N. Koudas, C. Zuzarte. HASE: A Hybrid Approach to Selectivity Estimation for Conjunctive Predicates. In Proceedings of the 10th International Conference on Extending Database Technology (EDBT 2006), Munich, Germany, March 2006. Acceptance rate: 16%. (ps, pdf, ppt)
S. Guha, N. Koudas, D. Srivastava, X. Yu. Reasoning About Approximate Match Query Results. In Proceedings of the 22nd International Conference on Data Engineering (ICDE 2006), Atlanta, USA, April 2006. Acceptance rate (full paper): 12.9%. (ps, pdf, ppt)
X. Yu, C. Zuzarte, K. Sevcik. Towards Estimating the Number of Distinct Value Combinations for a Set of Attributes. In Proceedings of the ACM 14th Conference on Information and Knowledge Management (CIKM 2005), Bremen, Germany, November 2005. Acceptance rate: 18%. (ps, pdf, ppt)
X. Yu, K. Q. Pu, N. Koudas. Monitoring k-Nearest Neighbor Queries over Moving Objects. In Proceedings of the 21st International Conference on Data Engineering (ICDE 2005), Tokyo, Japan, April 2005. Acceptance rate: 12.9%. (ps, pdf, ppt)