Dr Sharif Ullah Khan
Home

 Projects List

 

No

Year

 Title

17

2010 - to date

NUST Knowledge Repository

16

2009 - to date

National Digital Repository for Pakistan Science Abstracts

15

2009 - 2010

Evaluation of Semantic (RDF) Storage Systems

14

2008 - 2009

Alignment of digital documents with Hierarchical Domain Ontology

13

2008 - 2009

Bridging Hierarchical Ontologies

12

2008 - 2009

Semantic Based Mapping and Query Rewriting in Information Integration

11

2008 -2009

Generation and Storage of Context Aware Metadata for Digital Documents

10

2007 -2008

Semantic-based Information Retrieval in Digital Documents

9

2007 -2008

Ontology Driven Relevance Reasoning for Source Selection in Data Integration

8

2007 - 2008

Transforming Database Relations to Source Ontology in Data Integration

7

2006 - 2007

Ontology-driven Query Expansion

6

2005 - 2006

Integrating Biomedical Data Sources on the Internet

5

2004 - 2005

Optimization of Semantic Caching for XML Databases

4

2004 - 2005

Expertise Matching System

3

2004 - 2005

Efficient Data Storage for Dynamically Grouped CQs in XML Databases

2

2004 - 2005

Optimization of Web Caching

1

1999 - 2002

Optimization of Continual Queries

 

 

NUST Knowledge Repository

(2010 - to date)

National University of Sciences and Technology (NUST), academic information is increasing and the sizes of collections are expanding. It needs an appropriate and permanent storage for preserving and future reuse. Digital repositories provide seamless integration of information for log term preservation and leverage new approaches to teaching, research and industry. Real challenges in establishing institutional knowledge repositories are data gathering, security and preservation, interoperability, scalability and information retrieval. We are focusing on these challenges using open source digital library software.

Funding

National University of Sciences and Technology (NUST), Islamabad, Pakistan

Publications

 

National Digital Repository for Pakistan Science Abstracts

(2009 - to date)

A vital service of Pakistan Scientific and Technological Information Centre (PASTIC)  is abstracting and indexing service for abstracts of research articles, published in Pakistani S&T journals which is named as “Pakistan Science Abstracts (PSA)”. Taxonomy-based Web digital repository has been proposed to PSA in order to facilitate PASTIC in (i) disseminating information, (ii) maximizing its information scope (iii) assisting authors/institutional libraries through flexible repository services and (iv) analyzing information for future national planning.

Funding

  • Pakistan Science Foundation (PSF), Islamabad, Pakistan

  • National University of Sciences and Technology (NUST), Islamabad, Pakistan

Publications

 

Evaluation of Semantic (RDF) Storage Systems

(2009 - 2010)

Discovery of semantic associations, existing among concepts in ontologies, is an important task in various analytical activities, such as national securities and business intelligence. Main memory ontologies can provide better performance in discovering semantic association since data is directly accessible. Benchmarks have being proposed to validate their claim to performance. However these benchmarks provide limited information on semantic stores. This project copes with the deficiencies of exiting benchmarks. The further details can be seen here.

Funding

National University of Sciences and Technology (NUST), Islamabad, Pakistan

Publications

  1. Scalability Evaluation of Native RDF Storage Systems

Alignment of digital documents with Hierarchical Domain Ontology

(2008 - 2009)

Effective information retrieval in digital libraries requires semantic alignments of documents with taxonomy. The alignments provide the semantic description of documents. The proposed project aligns documents using the hierarchical structure of taxonomy. It refines the results of the existing semantic key phrase extraction algorithm.

Funding

  • Higher Education Commission (HEC), Islamabad, Pakistan

  • National University of Sciences and Technology (NUST), Islamabad, Pakistan

Publications

  1. I. Fatima, S. Khan and K. Latif, “Refinement Methodology for Automatic Document Alignment using Taxonomy in Digital Libraries”. In the proceedings of 3rd IEEE International Conference on Semantic Computing (ICSC2009), pp:281-286, September 14-16, 2009, Berkeley, CA, USA. IEEE Xplore.

 

Ontology matching for Hierarchical Classification schemes

(2008 - 2009)

Hierarchical classification ontologies play a key role to organize the documents and are widely used by the community. While mapping the ontologies, relationships among concepts are taken into consideration as a major aspect. However these relations are extinct in hierarchical classification ontologies because concepts in a hierarchy are associated with each other only through “is-a” relation. This project cope  with heterogeneities and inconsistencies found in hierarchical structure.

Funding

  • Higher Education Commission (HEC), Islamabad, Pakistan

  • National University of Sciences and Technology (NUST), Islamabad, Pakistan

Publications

  1. M. Safyan and S. Khan; “Bridging Hierarchical Ontologies for Interoperability and Query Reformulation”, In the proceedings of 2008 International Conference on Advanced Computer Theory and Engineering (ICACTE 2008), pp:832 – 836, December 2008, Phuket Island, Thailand. IEEE Xplore

Semantic Based Mapping and Query Rewriting in Information Integration

(2008 - 2009)

Data integration systems are facing problems because (i) sources are semantically heterogeneous and are autonomous; (ii) their schemas and database design are different and (iii) their query languages are also different. This project highlights some issues of semantic-based query rewriting and proposes a solution to (a) provide uniform access to all sources and (b) relieve users to understand local schemas.

Funding

National University of Sciences and Technology (NUST), Islamabad, Pakistan

Publications

  1. I. Qasim, S. Khan and K. Latif, “Semantic Mapping between Global and Source Ontologies for Scalable Data Integration System”, In the proceedings of 2nd International Conference on Information and Communication Technologies (ICICT), August 2008, Bannu, Pakistan.

  2. A. Aslam and S. Khan, “Semantic Based Query Rewriting in Data Integration”, In the proceedings of 2nd International Conference on Information and Communication Technologies (ICICT), August 2008, Bannu, Pakistan

  3. A. Aslam, S. Khan and K. Latif, “Semantic Based Query Rewriting in Distributed Systems”, In the proceedings of 4th IEEE international conference on Emerging Technology (ICET 2008), pp:292-297, October 2008. Rawalpindi Pakistan. IEEE Xplore.

  4. I. Qasim, S. Khan and K. Latif. “Semantic Mapping between Global and Source Ontology using WordNet”. In the proceedings of 2nd IEEE International Conference on Computer, Control & Communication (IEEE-IC4 2009), pp:1 – 5, February 2009. Karachi, Pakistan. IEEE Xplore.

Extraction and Generation of Semantic Annotations for Digital Repository

(2008 - 2009)

The semantic-based information retrieval techniques extract keywords with semantic relationship using domain ontology from the digital documents. Domain ontology restricts the information extraction to its defined scope and ignores concepts and relationships from documents that are not defined in domain ontology. Our proposed engine automatically extracts concepts and the relationships exist between the concepts from documents without employing domain ontology. Secondly, the engine builds an association between concepts and components of documents to enable physical binding among the document sections/components.

Funding

National University of Sciences and Technology (NUST), Islamabad, Pakistan

Publications

  1. N. Ahmed, S. Khan, K. Latif and A. M. Khattak, “Extracting Semantic Annotation and their Correlation with Document”. In the proceedings of 4th IEEE international conference on Emerging Technology (ICET 2008), pp: 32-37, October 2008, Rawalpindi Pakistan. IEEE Xplore

  2. A.M. Khattak, J. Mustafa, N. Ahmed, K. Latif and S. Khan, “Intelligent Search in Digital Documents”, In the proceedings of 2008 IEEE/WIC/ACM International Conferences on Web Intelligence (WI), pp: 558 – 561, December 2008, Sydney, Australia. IEEE Xplore.

Semantic-based Information Retrieval in Digital Documents

(2007 - 2008)

Retrieving required information in digital documents is difficult due to semantic heterogeneity. Keyword–based Semantic similarity helps in handling the issue by improving the recall of the information; however, it cannot improve the precision of the information. In order to improve the precision, we adopt thematic semantic similarity approach for information retrieval. RDF triples are employed instead of keywords in maintaining sources metadata and formulating user queries for capturing the context of the keywords. This improves the information precision.

Funding

National University of Sciences and Technology (NUST), Islamabad, Pakistan

Publications

  1. J. Mustafa, S. Khan and K. Latif. Intelligent Informational Retrieval.  In Proceedings of 1st National Conference on Security, Computing, & Communication (SCC), pp: 58-62, May 2008, Kohat, NWFP, Pakistan.

  2. J. Mustafa, S. Khan and K. Latif. Ontology Based Semantic Information Retrieval. In Proceedings of 4th IEEE International Conference on Intelligent Systems (IS'08), pp: 22.14 – 22.19, September 2008, Varna, Bulgaria. IEEE Xplore.

  3.  A.M. Khattak, J. Mustafa, N. Ahmed, K. Latif and S. Khan, “Intelligent Search in Digital Documents”, In the proceedings of 2008 IEEE/WIC/ACM International Conferences on Web Intelligence (WI), pp: 558 – 561, December 2008, Sydney, Australia. IEEE Xplore.

Ontology Driven Relevance Reasoning for Source Selection in Data Integration

(2007-2008)

Data sources are integrated for knowledge discovery. Online data sources join and leave integration systems arbitrarily because they are autonomous. All available data sources may not contribute to a user query result necessarily. Therefore, executing a user query against all the data sources consumes resources and makes the query expensive. Relevant data sources should be selected through relevance reasoning. Existing techniques take comparatively more time in relevance reasoning as the number of sources increase and lack semantics to sort out fine-grained semantic heterogeneities. This project extends these techniques by including (a): bitmap index for efficient reasoning and (b): semantics to sort out fine-grained semantic heterogeneities.

Funding

National University of Sciences and Technology (NUST), Islamabad, Pakistan

Publications

  1. M. Bilal and S. Khan. Ontology-Driven Relevance Reasoning in Data Integration Techniques. In the proceedings of 5th international workshop on the Frontiers of Information Technology (FIT 2007), Islamabad, Pakistan, December 17-18, 2007

  2. M. Bilal and S. Khan. Ontology Driven Relevance Reasoning Architecture for Data Integration Techniques. In Proceedings of 4th IEEE International Conference on Intelligent Systems (IS'08), pp: 22.8 – 22.13, September 2008, Bulgaria. IEEE Xplore.

Transforming Database Relations to Source Ontology in Data Integration

(2007 - 2008)

Scalable and flexible data integration of autonomous and distributed data sources requires to represent the sources' descriptions into a conceptual model i.e., ontology to make the sources interoperable. Sources' metadata are either incomplete or not available. The essential metadata of the sources can be extracted from the database relations. Extracting the metadata from database relations and transforming them into source ontology is tedious and error prone task. The contribution of this project is a methodology for: (a) extracting metadata from source relations, and (b) transforming the extracted metadata to ontology.

Funding

National University of Sciences and Technology (NUST), Islamabad, Pakistan

Publications

  1. K. Sonia and S. Khan. Building Local Ontology from Database Relations in Data Integration. In the proceedings of IEEE 3rd international conference on Emerging Technology (ICET 2007), pp: 108 – 113, November 2007. Islamabad, Pakistan, IEEE Xplore.

  2. K. Sonia and S. Khan. Transforming Relational Model to Source Ontology for Data Integration. In Proceedings of 4th IEEE International Conference on Intelligent Systems (IS'08), pp: 22.2 – 22.7, September 2008, Varna, Bulgaria. IEEE Xplore.

  3. K. Sonia and S. Khan. R2O Transformation System: Relation to Ontology Transformation for Scalable Data Integration. In Proceedings of the 12th International Database Engineering & Applications Symposium (IDEAS'08), pp: 291-295 September 2008, Coimbra, Portugal. IEEE Xplore.

Ontology-driven Query Expansion

(2006-2007)

Semantic queries are generated with the support of a semantic model (i.e. domain ontology) through utilizing relationships between the existing concepts in data sources. This research expands the user given queries through hierarchical relationships in ontology to transform them into semantic queries. They provide transparent access to data sources. It is evident from the evaluation of results that recall is improved. Hence information loss is minimized.

Funding

  • Higher Education Commission (HEC), Islamabad, Pakistan

  • National University of Sciences and Technology (NUST), Islamabad, Pakistan

Publications

  1. W. Ali and S. Khan. Global Query Generation over Diverse Data Sources Using Ontology. In the proceedings of First national conference on Information and Communication Technologies (NCICT 2007), Bannu, Pakistan, June 2007.

  2. 27) W. Ali and S. Khan, “Ontology Driven Query Expansion in Data Integration”, In the proceedings of 4th International Conference on Semantic, Knowledge and Grid (SKG), pp:57 – 63, December 2008, Beijing China,. IEEE Xplore

Integrating Biomedical Data Sources on the Internet

(2005 - 2006)

A large number of biomedical data sources are publicly available on the Internet. These data sources are intermediaries between experimental observation and innovative synthesis. Unfortunately, these sources have been structured with diverse interests, subject areas and/or institutional convenience. The generated information are impressive in scale but limited in usefulness to the organization. Its decentralized, autonomous and heterogeneous nature have made very difficult its accessibility and integration by scientists for innovative synthesis. A real challenge is integrating the biomedical sources in a meaningful way.

Funding

Institute de Recherche en Informatique de Toulouse (IRIT), UMR 5505 CNRS, Universite Paul Sabatier (UPS), Toulouse, France.

Publication

  1. S. Khan and F. Marvon. Ontology-based Query Reformulation for Scalable Data Integration. Technical Report IRIT/RR-2006-07-FR. Institute de Recherche en Informatique de Toulouse (IRIT), UMR 5505 CNRS, Universite Paul Sabatier (UPS), Toulouse, France, June 2006.

  2. S. Khan and F. Marvon. Query Processing in Biomedical Informatics. Technical Report IRIT/RR-2006-07-FR. Institute de Recherche en Informatique de Toulouse (IRIT), UMR 5505 CNRS, Universite Paul Sabatier (UPS), Toulouse, France, March 2006.

  3. S. Khan and F. Marvon. Scalable Integration of Biomedical Sources. In the proceedings of the Workshop on Ontology, Grid and Semantic Integration for Biology (OGSB), July, 2006. Bordeaux, France.

  4. S. Khan and F. Marvon. Integrating Biomedical Sources on the Internet. In the proceedings of ISCA 19th International Conference on Parallel and Distributed Computing Systems (PDCS), pp: 165-170, San Francisco, California, USA, September, 2006.

  5. S. Khan and F. Marvon. Identifying Relevant Sources in Query Reformulation. In the proceedings of the 8th International Conference on Information Integration and Web-based Applications & Services (iiWAS2006), pp: 357-366, Yogyakarta, Indonesia, December 2006.

  6. S. Khan. The Role of Ontology in Query Mediation, In the proceedings of the 4th International Workshop on Multi-Agent Systems and Semantic Grid (MASSG06) Rawalpindi, Pakistan, December 2006.

  7. K. Munir, M. Odeh, R.McClatchey, S. Khan and I. Habib, Semantic Information Retrieval from Distributed Heterogeneous Data Sources, In the proceedings of 4th international workshop on the Frontiers of Information Technology (FIT 2006), Islamabad, Pakistan, December, 2006.

Optimization of Semantic Caching for XML Databases

(2004 - 2005)

The availability of information on the Internet has increased exponentially the Internet users in the last few years. This made the web servers overloaded. Semantic caches store the semantics and results of the previously asked queries close to the users, in order to answer new queries. The caching reduces the burden of the web servers and optimizes their performance. This project proposed the SMOCE Cache, a framework for semantic caching of XML database. Specialized techniques such as indexing and broadening of semantic regions have been developed for the optimization of semantic caching.

Funding

National University of Sciences and Technology (NUST), Islamabad, Pakistan

Publications

  1. A. Hamid and S. Khan. Optimized Matching for Semantic Caches Through Indexing. In the proceedings of IADIS Virtual Multi Conference on Computer Science and Information Systems (MCCIS 2005), April, 2005, Idea Group.

  2. A. Hamid and S. Khan. Optimization of Semantic Caches for XML Databases. In the proceedings the First International Conference on Information and Communication Technology (ICICT 2005), pp: 201-205, Karachi, Pakistan. August 2005. IEEE Xplore.

Expertise Matching System

(2004 - 2005)

Internet is a vital source to disseminate and share information to the masses. This has made information available in abundance on the Web. However, finding relevant information is difficult if not impossible. This difficulty is bilateral between information providers and seekers in terms of information presentation and accessibility respectively. This project proposed an online Peer-to-Peer Expertise Matching system. The approach provides a highly scalable and self-organizing system. This helps individuals in presenting and accessing the information in a consistent format on the Web. It makes the sharing of information among the autonomous organizations successful.

Funding

National University of Sciences and Technology (NUST), Islamabad, Pakistan

Publications

  1. S. Khan and S. M. Nabeel. OPEMS: Online Peer-to-Peer Expertise Matching System. In the proceedings the First International Conference on Information and Communication Technology (ICICT 2005), pp: 148-151, Karachi, Pakistan. August 2005, IEEE Xplore.

Efficient Data Storage for Dynamically Grouped CQs in XML Databases

(2004 - 2005)

Globalization of the Web made it difficult, if not impossible, to retrieve data in a structured format because data are not available in a single format over the Web. This project enhances the existing continual queries (CQ) optimization techniques form structured data to semistructured data for efficient retrieval of the data over the Web. The project proposes a technique to handles the proliferation of CQs’ results that happen due to dynamic grouping

Funding

National University of Sciences and Technology (NUST), Islamabad, Pakistan

 

Optimization of Web Caching

(2004 - 2005)

This project was based on content classification of webpages into categories such as static, periodic and dynamic. The project distributes the contents along with the webpage template in the hierarchical architecture of the Web caching in order to improve network bandwidth, save storage consumption in caches; and maintains load balance in the caching system.

Funding

National University of Sciences and Technology (NUST), Islamabad, Pakistan

Publications

  1. M. A. Iqbal and S. Khan. Content-based Hierarchical Web Cache. In the Proceedings International Conference on Computing and Informatics (ICOCI). Kuala Lumpur, Malaysia, June, 2006, IEEE CS

Optimization of Continual Queries

(1999-2002)

Continual queries are persistent queries that are issued once and then are run at regular intervals or when source data change until a termination condition is satisfied. The systems need to support a large number of queries due to the scale of the Internet. The contribution of this research are: (a) design of a new architecture for a scalable continual queries system, (b) design of a scalable and dynamic grouping technique to optimise the evaluation of continual queries, (c) design of a simple model for differential evaluation of continual queries to bypass the complete evaluation after an initial evaluation; and (d) development of a technique for differential evaluation of grouped continual queries

 

Funding

Ministry of Education, Govt. of Pakistan, Islamabad, Pakistan

 

Publications

  1. S. Khan and P. Mott. LeedsCQ: A Scalable Continual Queries System. Technical Report 2002.03, School of Computing, the University of Leeds, February 2002.

  2.  S. Khan and P. Mott. A Scalable and dynamic Grouping of Continual Queries. Technical Report 2001.17, School of Computing, the University of Leeds, September 2001.

  3.  S. Khan and P. Mott. Differential Evaluation of Continual Queries. Technical Report 2001.11, School of Computing, the University of Leeds, May 2001.

  4.  S. Khan and P. Mott. LeedsCQ: A Scalable Continual Queries System. In Proceedings of the International Conference on Database and Expert Systems Applications (DEXA), pp: 607-617, Aix-en Provence, France, September 2002, Springer.

  5. S. Khan and P. Mott. A Scalable and dynamic Grouping of Continual Queries. In Proceedings of the International Conference on Advances in Information Systems (ADVIS), pp: 31-42, Izmir, Turkey, October 2002, Springer.

 © Copyright SEECS 2009, All Rights Reserved

Last Updated (June 2010)