Webservice
Goals
The webservice aims to extract textual content from a Mediawiki in order to integrate it as a whole or as a part in a thid-party website.The extracted data could also be processed on these websites to produce schemes, graphics, maps etc.
On the other hand, the webservice must allow data harvesting from external sources (like GRIN, Mansfeld, Prota4U, ...) to complete and improve the content already available on the MW
Content Export
The export of textual content will have 2 objectives :
- Displaying text on a distant website, with the possibility of displaying it as a whole or only a part of it.
- Propose a structure for exported data in order to allow processing similar to relational database processings
How ?
1) Explore all the possibilities included in Mediawiki for exporting pages.
2) Work on standardization of pages structure on our MW
3) Uniformize the structure for data of the same type (taxobox, phylogeny, ...)
4) Develop webservices based on REST architecture to get XML file related to page using common structure
5) Use these WebServices to print data from the wiki in third-party websites.
6) Export contents through these Web services.
- allow exporting a whole page.
- allow exporting a precise dataType ex : a particular template.
7) Get (XML ?) contents from webservices and display them as graphs/schemes.
Import Content
Importing content from external websites will answer two needs :
- Give existing groups useful informations to enhance collaboration.
- Complete informations of the MW with data extracted from internationnal databanks.
How?
1) Define what is the common structure for all the reference website (maybe HTML)
2) Discover specific data structures of GRIN, MANSFELD and GBIFF
3) Create a Web Service for the actual import of data in the mediaWiki
4) Define witch data are of interest and importing them in the MW.
Technological choice
Not define at this time : maybe PHP in conjonction with JQUERY and REST architecture, and data encoded in JSON.
Eventually, the WebServices must be able to recognize requests from both a partially controlled (MW) and Out of Control (third party sites) souces, also to recognize the relevant information, retrieve, format, and return them to the sender of the request.
[|| VF]