Jump to content

User:Pathoschild/Automated metadata reporting

From Wikisource

(Don't mind this page, just a note to self.)

Goal

[edit]

Automated maintenance and creation of index pages, author pages, report pages, and other lists based on {{header}} and {{author}} data.

Method

[edit]
  1. Read all pages on Wikisource (or local copy thereof), extracting the metadata based on HTML markup;
  2. create comprehensive database of metadata;
  3. regularly read new pages and edits, update database, and update pages appropriately;
  4. update author pages with works attributed to them but not listed;
  5. create customizeable index/report pages using an editable subpage, mockup:
<report>
	<title>Wikisource:Ancient_authors</title>
	<query>SELECT author_name,author_birth,author_death FROM authors WHERE category LIKE "Ancient_authors"</query>
	<format>* [[Author:<author_name/>|]] (<author_birth/>–<author_death/>)</format>
</report>

Various ideas

[edit]
Dynamic report pages
<report>
	<title>Author:<author_name/></title>
	<query>SELECT author_name,work_title,work_date FROM authors LEFT JOIN works on author_name=work_author ORDER BY author_name</query>
	<format>* [[<work_title/>|]] (<work_date/>)</format>
</report>
Indexes report
  • incomplete indexes, without ToC, publisher, date, etc.
  • indexes ordered by size / number of pages