Wikisource:Scan Lab
If you need help with a scan, add your request in the relevant section below as a new sub-section. If you can, include all the details someone will need to work on the request without further questioning. You can use {{ping project|Scan Lab}} to send an immediate notification to all subscribed Scan Lab members. Once you have been answered, ping only that user when you reply with {{re|Their username}} (do not ping the whole project on every comment).
If your request has been completed, you should acknowledge that your issue is resolved and close the section with {{section resolved|1=~~~~}}
.
Participants
[edit]Add your name to Module:Mass notification/groups/Scan Lab to be notified via {{ping project|Scan Lab}}. Also add your name below with details of any particular tasks you can help with.
Participant | Can help with | Instructions |
---|---|---|
Inductiveload |
|
|
Xover |
|
|
Mpaa |
|
Requests for downloading scans
[edit]If you would like scans that already exist online to be transferred to Wikisource, leave a message here. This includes batch transfers from the Internet or Hathi Trust for multi-volume works. Please include necessary bibliographic information so that scans can be uploaded to Commons with proper information and license templates. Author, country, and date of first publication. A suggested file name on Commons can also be helpful.
Jane Austen Juvenilia Volume 2 and 3
[edit]Notifying all members of Scan Lab (more info · opt out): The scans of the manuscripts of Austen's Juvenelia are available on here and here. They're both in the PD, but I have absolutely no clue as how to download them. The images are higher resolution than the ones on the BL website, but they're in the zoomify flash format. Languageseeker (talk) 02:58, 2 February 2022 (UTC)
- Languageseeker: I know Volume the Second is in the public domain; it’s already been transcribed here. Are we sure that Volume the Third is in the public domain? It could easily fall into a copyright trap, so I just want to make sure. TE(æ)A,ea. (talk) 22:59, 8 February 2022 (UTC)
- @TE(æ)A,ea. The British Library has it listed as "Public Domain in most countries other than the UK." Languageseeker (talk) 23:07, 8 February 2022 (UTC)
- So it looks like it was definitely published in 1951 (which would imply copyright expiry in 2001 in the UK as 50 years after publication), which makes the UK copyright claim weird. If true that would postdate the URAA date ... MarkLSteadman (talk) 00:06, 9 February 2022 (UTC)
- That is volume 3 (Evelyn and Kitty the Bower). Volume 1 was published in 1933 (so it was in the PD on the URAA date). MarkLSteadman (talk) 00:19, 9 February 2022 (UTC)
- So it looks like it was definitely published in 1951 (which would imply copyright expiry in 2001 in the UK as 50 years after publication), which makes the UK copyright claim weird. If true that would postdate the URAA date ... MarkLSteadman (talk) 00:06, 9 February 2022 (UTC)
- @TE(æ)A,ea. The British Library has it listed as "Public Domain in most countries other than the UK." Languageseeker (talk) 23:07, 8 February 2022 (UTC)
Mooresville, Indiana High School yearbooks, 1914–1930
[edit]Notifying all members of Scan Lab (more info · opt out): These scans exist in the form of galleries on the Mooresville High School Alumni Association's Facebook page, and extracting them by hand is tedious enough that I'm hoping someone can do it with a bot. The procedure I have in mind is:
- Go to the Facebook galleries, which are more conveniently linked at https://mooresvillelib.org/mooresville-high-school-digitized-yearbooks/
- Except the 1920 yearbook, because there are two albums for it and the one linked on the library website is missing a page. For 1920, use this album: https://www.facebook.com/media/set/?set=a.2615532255167809&type=3
- Extract the images.
- Upload the images to Commons, numbered sequentially, to allow for for later image extraction (and scan repair if needed; it turned out their scan of the 1909 yearbook was missing some pages).
- Combine the images into a PDF or DJVU and upload that to Commons as well.
- Use commons:Category:Mooresville High School Yearbook, 1911 as a model for categorization, metadata, license tags and naming convention.
- Main file: File:The Levenite, Mooresville High School, 1911.pdf; page image: File:The Levenite, Mooresville High School, 1911 01.jpg
- The 1930 yearbook should be tagged {{PD-US-no notice}}.
- Note that the yearbooks aren't all called "The Levenite"; they changed the title each year. This matters for filling out the {{book}} template anyway, so I will also request that the title be used in the file name.
Thanks! —CalendulaAsteraceae (talk • contribs) 01:59, 22 September 2023 (UTC)
Penny Cyclopedia volumes 1 to 27
[edit]The IA scans currently linked on the page are unusable (blank pages where there should be content), so I checked HathiTrust ([here's the search I used]). There are four complete sets of scans attached to [this record] (ignoring the supplements for now), but I'm not sure at the moment which ones would be the best to import. Arcorann (talk) 02:14, 24 December 2023 (UTC)
- I've found pretty good scans of volumes 4 and 24 which are already on Commons, and I've added the links to the Penny Cyclopedia page. I don't have a Hathi Trust account, so I can't help you there. Ciridae (talk) 05:21, 27 December 2023 (UTC)
Journal of the Optical Society of America
[edit]Volumes 1-40 of this fairly esteemed journal are out of copyright. Vol. 30, issue 12 and Vol. 33, issue 7 are here already, but there are *a lot* that are not here: https://archive.org/details/pub_optical-society-of-america-journal If you upload them, I can tidy the pile up at commons and get them ready to go here. For copyright concerns: https://onlinebooks.library.upenn.edu/webbin/cinfo/jopticalsocamerica --RaboKarbakian (talk) 20:43, 8 February 2024 (UTC)
Notifying all members of Scan Lab (more info · opt out): would someone with access please download the scan from HathiTrust? Thanks! —Beleg Âlt BT (talk) 17:34, 22 April 2024 (UTC)
- Beleg Âlt: Here you go: File:The Singing Bone.pdf. TE(æ)A,ea. (talk) 22:46, 17 May 2024 (UTC)
Notifying all members of Scan Lab (more info · opt out): would someone with access please download the scan from HathiTrust? Thanks! —Beleg Âlt BT (talk) 16:31, 6 May 2024 (UTC)
- @Beleg Tâl: -- c:File:The Coming of Cassidy and the Others - Clarence E. Mulford.pdf -- Hrishikes (talk) 14:20, 4 July 2024 (UTC)
Per this discussion, I'd like to replace the current PDF scan with a DJVU scan. However, both IA-Upload and Any2Djvu are stalling out on me, and pdf2djvu.com gave pretty shoddy results. Could someone please upload this scan as a new version of File:Dictionary of Hymnology 1908.djvu? Thanks!
- The IAA-Upload failure looks to be: https://phabricator.wikimedia.org/T215647 caused by the number of pages. MarkLSteadman (talk) 03:34, 4 July 2024 (UTC)
- @Beleg Tâl: -- c:File:A Dictionary of Hymnology - John Julian.djvu -- Hrishikes (talk) 16:38, 5 July 2024 (UTC)
- @Hrishikes you're the best!! —Beleg Tâl (talk) 17:59, 5 July 2024 (UTC)
- @Beleg Tâl: -- c:File:A Dictionary of Hymnology - John Julian.djvu -- Hrishikes (talk) 16:38, 5 July 2024 (UTC)
Finding scans
[edit]Requests for locating scans for existing works at Wikisource, or works you wish to add yourself but cannot find scans for. For general text requests, see Wikisource:Requested texts.
The Criterion Volume 2 and 3
[edit]Notifying all members of Scan Lab (more info · opt out): Would it be possible to locate Volumes 2 and 3 of The Criterion? I'm especially trying to complete The Woman Who Rode Away that began in Volume 3. Languageseeker (talk) 18:36, 23 December 2022 (UTC)
- Languageseeker: Volume 2 is available here. I can’t find volume 3; however, it is available on microfilm. TE(æ)A,ea. (talk) 22:05, 26 December 2022 (UTC)
- Volume 2 now at Index:The Criterion - Volume 2.djvu (and v1 replaced too, it was previously a reprint). Inductiveload—talk/contribs 15:54, 29 December 2022 (UTC)
- @TE(æ)A,ea. @Inductiveload Thank you both for these. Hopefully, Volume 3 turns up at some point. Soon we can also look for Volume 5. Hooray for PD day! Languageseeker (talk) 17:35, 29 December 2022 (UTC)
- Languageseeker: The Criterion, volume 3, is available here. Some extra work will need to be done: the pages are two-to-one (scan), the 102–103 spread is duplicated, the 340–341 spread is duplicated, and the first page of the index (spread) is duplicated. I still have the reel, so if you need anything from this volume or from volumes 1 or 2 of The Criterion or volume 4 of The New Criterion, I can go through the reel. TE(æ)A,ea. (talk) 16:19, 13 January 2023 (UTC)
- @TE(æ)A,ea. @Inductiveload Thank you both for these. Hopefully, Volume 3 turns up at some point. Soon we can also look for Volume 5. Hooray for PD day! Languageseeker (talk) 17:35, 29 December 2022 (UTC)
- Volume 2 now at Index:The Criterion - Volume 2.djvu (and v1 replaced too, it was previously a reprint). Inductiveload—talk/contribs 15:54, 29 December 2022 (UTC)
Scan repair
[edit]Request repair work on existing scans here.
When requesting page insertion, rearrangement or deletion, always include the page numbers (as marked on the pages) as well as the position of the page within the scan file. This makes it much easier for the repairing user to locate the defect in the file and fix it, as well as allowing a double-check against mistakes.
Please do not use this page to request repairs on works that you don’t really care about: the backlog at Category:Index - File to fix is a known backlog. If you want to help with those, you can add {{missing pages}} to those indexes if they do not already have it, along with details of the missing pages.
Notifying all members of Scan Lab (more info · opt out): This scan is missing two pages (xxvi–xxvii). Also, it would be nice if the images for this volume and the second volume could be regenerated, as they are of quite poor quality. TE(æ)A,ea. (talk) 22:21, 3 December 2023 (UTC)
- @TE(æ)A,ea.: Done (missing pages xxvi–xxvii). Existing text moved. M-le-mot-dit (talk) 15:10, 25 October 2024 (UTC)
Notifying all members of Scan Lab (more info · opt out): Pages 482 and 483 of this volume were missing in the original scan; pageholders have been introduced, so all that is necessary is the replacement. That replacement can come from Index:Alumnioxonienses02univ.pdf, which exists solely for the purpose of supplying that gap. So, the missing pages from the PDF should be added in over the pageholders from the DJVU; the transclusion fixed; and the PDF deleted. TE(æ)A,ea. (talk) 23:46, 3 December 2023 (UTC)
- Not sure I follow, pages 482 and 483 (djvu/99 and djvu/100) seem to be legit images and the 2 missing pages should be inserted between djvu/100 and djvu/101. Or ...? Mpaa (talk) 18:09, 4 December 2023 (UTC)
This file claims to be Volume 135 and is residing in the list of volumes as Volume 135 but it is actually Volume 136, probably (but not verified) a duplicate of Index:The Atlantic Monthly Volume 135.djvu. Can the file be replaced with https://babel.hathitrust.org/cgi/pt?id=uc1.32106019602660 ?--RaboKarbakian (talk) 15:47, 29 March 2024 (UTC)
- Also, while you are at it:
- --RaboKarbakian (talk)
File was renamed at Commons, and needs re-aligning.
https://en.wikisource.org/w/index.php?search=intitle%3A%2FA+dictionary+of+the+language+of+Mota.djvu%2F&title=Special:Search&profile=advanced&fulltext=1&ns0=1&ns100=1&ns102=1&ns104=1&ns106=1&ns114=1 ShakespeareFan00 (talk) 17:40, 1 May 2024 (UTC)
Notifying all members of Scan Lab (more info · opt out): A bit of a different one this time. This work contains several copyrighted images that need to be blanked out in the scan. The affected pages are listed here: Index talk:Sm all cc.pdf#Possible copyright violation. —Beleg Tâl (talk) 15:17, 14 May 2024 (UTC)
- Beleg Tâl: I’ve redacted all of the images marked as copyrighted in the text from the talk page and re-uploaded the file. The images should be gone once you clear the caches. TE(æ)A,ea. (talk) 17:18, 17 May 2024 (UTC)
- @Beleg Tâl (CC: @Beleg Âlt): Is this resolved? Xover (talk) 16:38, 1 June 2024 (UTC)
- Hmm, the mediawiki software doesn't want to load the page images for me, but I trust TE(æ) to have done a good job :) —Beleg Tâl (talk) 13:17, 3 June 2024 (UTC)
- @TE(æ)A,ea.: It looks like the new PDF you uploaded makes MediaWiki choke. Could you try to regenerate it using different tools or options? Xover (talk) 09:18, 4 June 2024 (UTC)
- Xover: MediaWiki (maybe just Commons?) has been causing me a lot of problems lately in terms of PDFs, so many times I’m not sure if any of the files work. TE(æ)A,ea. (talk) 00:17, 16 June 2024 (UTC)
- @TE(æ)A,ea.: It looks like the new PDF you uploaded makes MediaWiki choke. Could you try to regenerate it using different tools or options? Xover (talk) 09:18, 4 June 2024 (UTC)
- Hmm, the mediawiki software doesn't want to load the page images for me, but I trust TE(æ) to have done a good job :) —Beleg Tâl (talk) 13:17, 3 June 2024 (UTC)
- @Beleg Tâl (CC: @Beleg Âlt): Is this resolved? Xover (talk) 16:38, 1 June 2024 (UTC)
Notifying all members of Scan Lab (more info · opt out): The pages here are offset when they are loaded. The PDF is correct, the text layer is correct, and if you call OCR the right pages are referenced; however, the wrong pages show us visually. I don’t where this problem originates. TE(æ)A,ea. (talk) 00:16, 16 June 2024 (UTC)
- @TE(æ)A,ea. it seems ok to me. Mpaa (talk) 20:15, 6 July 2024 (UTC)
- Mpaa: Yes, this has been fixed. Feel free to close this request. TE(æ)A,ea. (talk) 18:40, 8 July 2024 (UTC)
This scan is in the Monthly Challenge, but is missing the images facing pages 16 and 304. Can those images be found and inserted (and black verso) into the correct locations in the scan? There is a list of illustrations beginning on this page. --EncycloPetey (talk) 16:46, 14 August 2024 (UTC)
- @EncycloPetey: Done. 2 images inserted (without text layout). Pages after djvu 77 should be moved or deleted.--M-le-mot-dit (talk) 12:41, 28 August 2024 (UTC)
Notifying all members of Scan Lab (more info · opt out): The OCR layer is offset from the pages, but not in a consistent way.
- Pages 27-65, OCR off by one page
- Pages 66-123, OCR off by two pages
- Pages 124+, OCR off by three pages
(Page numbers refer to the scan page, not the book page. OCR is shifted forwards, toward the front of the book.) —Beleg Tâl (talk) 14:43, 23 September 2024 (UTC)
I have yet to create to create an index for this file, as I did not realise that the front page was Google's digital scan statement (the page images weren't all showing up properly on Internet Archive, before I uploaded to Commons). Could you please delete the first page of the djvu?
Thanks, TeysaKarlov (talk) 23:49, 26 October 2024 (UTC)
- @TeysaKarlov: Done. Content replaced by a pdf conversion (from the same source). Original djvu file has a poor definition and many images are missing. // M-le-mot-dit (talk) 10:04, 27 October 2024 (UTC)
- @M-le-mot-dit Thanks for the improvements, and quick turnaround! Regards, TeysaKarlov (talk) 19:18, 27 October 2024 (UTC)
File:The Best continental short stories of and the yearbook of the continental short story 1924-25.pdf
[edit]This file has a lot of pages missing, see Index:The Best continental short stories of and the yearbook of the continental short story 1924-25.pdf. There is a better scan now available at https://babel.hathitrust.org/cgi/pt?id=uc1.b3123528 but the sequenced pages 264 to 317 of the scan are duplicated and need to be removed. --Jan Kameníček (talk) 11:51, 30 October 2024 (UTC)
- @Jan.Kamenicek: Done // M-le-mot-dit (talk) 16:38, 30 October 2024 (UTC)
Notifying all members of Scan Lab (more info · opt out): The current scans for The Story of My Experiments with Truth/Volume 1 is missing pages (without placeholders) and duplicates others. I've uploaded a corrected file here: File:Gandhi, 1927, The Story of My Experiments With Truth, Vol 1.pdf. I need assistance in moving the current project over to the new scans while keeping the already proofread pages. Thanks! — Qx3Jw (talk) 14:40, 30 October 2024 (UTC)
See also
[edit]- Commons:Graphic Lab at Wikimedia Commons - they can help with general image problems
- Image extraction - guidance for extracting images from scans
- Requested texts - general text requests. Many of these also need scans to be located.
- Category:Index - File to fix - contains indexes that have various defects. Please do add templates like {{missing pages}} if needed to indicate what the problems are, but please do not bring the files here unless you would like it fixed to allow work in the near future.