⚓ T301291 PDF and Djvu files on Commons failed to be processed (no thumbnails, zero pages) but otherwise valid
Page Menu
Phabricator
Create Task
Maniphest
T301291
PDF and Djvu files on Commons failed to be processed (no thumbnails, zero pages) but otherwise valid
Open, Needs Triage
Public
Actions
Edit Task
Edit Related Tasks...
Create Subtask
Edit Parent Tasks
Edit Subtasks
Merge Duplicates In
Close As Duplicate
Edit Related Objects...
Edit Commits
Edit Mocks
Mute Notifications
Protect as security issue
Assigned To
None
Authored By
Mitar
Feb 8 2022, 9:15 PM
2022-02-08 21:15:21 (UTC+0)
Tags
Commons
(Incoming)
MediaWiki-extensions-PdfHandler
(Backlog)
Referenced Files
None
Subscribers
Aklapper
Albertoleoncio
Ammarpad
Arcorann
Balajijagadesh
Base
Draco_flavus
View All 16 Subscribers
Description
I went through Wikimedia Commons dump and checked for all invalid PDF and Djvu files (those with no thumbnails, 0x0 size, and zero pages) and tested them. Those which were really invalid I tried to replace with a fixed version, or if I could not find a fixed version, I marked them for speedy deletion.
But I have found some files which look invalid on Commons which seems to load fine (at least in Firefox for PDF, and ddjvu for Djvu files). Maybe there is some issue with how they are processed on the backend?
Here is the list:
(processing of thumbnails started, but then it died)
See also (and possibly duplicate with):
T297942
T298417
T299521
Related Objects
Mentions
Duplicates
Mentioned In
T364445: DJVU file generated is apparently 0x0 pixels
T342510: 175MB PDF on Portuguese Wikisource failed to be processed but otherwise valid
T302677: Metadata of a PDF in image table dump does not match the website
T297942: Specific PDF on Commons has no image thumbnails, dimensions shown as 0x0 pixels
Mentioned Here
T297942: Specific PDF on Commons has no image thumbnails, dimensions shown as 0x0 pixels
T298417: Undeleted djvu files show incorrect metadata: 0x0 size, no page number info
T299521: PDF file has 0x0 image size in Commons after uploading a new version while the page number is correct
Duplicates Merged Here
T342510: 175MB PDF on Portuguese Wikisource failed to be processed but otherwise valid
Event Timeline
Mitar
created this task.
Feb 8 2022, 9:15 PM
2022-02-08 21:15:21 (UTC+0)
Restricted Application
added subscribers:
Base
Aklapper
View Herald Transcript
Feb 8 2022, 9:15 PM
2022-02-08 21:15:22 (UTC+0)
Stang
subscribed.
Feb 8 2022, 10:16 PM
2022-02-08 22:16:04 (UTC+0)
Bugreporter
updated the task description.
(Show Details)
Feb 9 2022, 1:10 AM
2022-02-09 01:10:05 (UTC+0)
Mitar
added a comment.
Feb 9 2022, 6:51 AM
2022-02-09 06:51:42 (UTC+0)
Comment Actions
What is this wikimirror.org? Why change links to that?
So this list is exhaustive. I went through all PDFs and Djvu files on Wikimedia Commons as of previous week. Not just a random example. if we fix these, then all of them will be fixed. :-)
Mitar
mentioned this in
T297942: Specific PDF on Commons has no image thumbnails, dimensions shown as 0x0 pixels
Feb 9 2022, 6:52 AM
2022-02-09 06:52:51 (UTC+0)
Bugreporter
updated the task description.
(Show Details)
Feb 9 2022, 9:52 AM
2022-02-09 09:52:02 (UTC+0)
Yann
subscribed.
Feb 12 2022, 8:35 PM
2022-02-12 20:35:16 (UTC+0)
mau
subscribed.
Feb 13 2022, 9:49 AM
2022-02-13 09:49:28 (UTC+0)
Comment Actions
the same happens with
(which is newer, so it was not in the list)
Mitar
added a comment.
Feb 15 2022, 7:48 PM
2022-02-15 19:48:23 (UTC+0)
Comment Actions
No, this one seems just a slightly broken PDF. I just fixed it.
mau
added a comment.
Feb 15 2022, 8:45 PM
2022-02-15 20:45:03 (UTC+0)
Comment Actions
that's odd, I saved the pdf file starting from a Word document. (Ok, at a second thought that's not odd at all :-) ) Thanks!
Mitar
added a comment.
Feb 15 2022, 8:51 PM
2022-02-15 20:51:59 (UTC+0)
Comment Actions
So I fixed it using
mutool clean
. But the ones I listed above cannot be fixed this way. And this is what I am reporting. So
mutool clean
does not fix it, looking at
MediaBox
values show reasonable page sizes (including the first page), and even metadata (
example for the first file above
shows page size available:
"name": "pdf-PageSize",
"value": [
"name": 0,
"value": "612 x 792 pts (letter)"
},
"name": 1,
"value": "697 x 855 pts"
But Mediawiki does not show width and height. So something is wrong.
Mitar
added a comment.
Feb 15 2022, 8:52 PM
2022-02-15 20:52:47 (UTC+0)
Comment Actions
@mau
If you made this PDF yourself, could I recommend removing the first blank page? Because otherwise the first thumbnail does not show anything.
mau
added a comment.
Feb 15 2022, 8:54 PM
2022-02-15 20:54:34 (UTC+0)
Comment Actions
@Mitar
probably it's even better to substitute the first page with the actual cover for the book, indeed. I proceed :-)
Mitar
updated the task description.
(Show Details)
Feb 27 2022, 2:45 PM
2022-02-27 14:45:34 (UTC+0)
Mitar
updated the task description.
(Show Details)
Feb 27 2022, 2:49 PM
2022-02-27 14:49:15 (UTC+0)
Mitar
updated the task description.
(Show Details)
Feb 27 2022, 3:32 PM
2022-02-27 15:32:21 (UTC+0)
Mitar
updated the task description.
(Show Details)
Feb 27 2022, 4:56 PM
2022-02-27 16:56:21 (UTC+0)
Mitar
updated the task description.
(Show Details)
Mitar
updated the task description.
(Show Details)
Feb 27 2022, 5:01 PM
2022-02-27 17:01:52 (UTC+0)
Mitar
updated the task description.
(Show Details)
Mitar
updated the task description.
(Show Details)
Feb 27 2022, 5:23 PM
2022-02-27 17:23:50 (UTC+0)
Mitar
mentioned this in
T302677: Metadata of a PDF in image table dump does not match the website
Mar 1 2022, 5:14 PM
2022-03-01 17:14:45 (UTC+0)
Want
subscribed.
May 19 2023, 9:14 AM
2023-05-19 09:14:04 (UTC+0)
Comment Actions
I ran into the same problem. I don't know if this can be considered a solution, because these steps have to be done on the server side, but I solved my problem:
step – repair thumbnails for files of the core MediaWiki
php maintenance/refreshImageMetadata.php --verbose --mime image/vnd.djvu --force
step – do null edit of the index pages by Extension:Proofread_Page (need for actualization info about the pages count for special page)
php maintenance/refreshLinks.php --namespace 252
Albertoleoncio
mentioned this in
T342510: 175MB PDF on Portuguese Wikisource failed to be processed but otherwise valid
Jul 23 2023, 7:50 PM
2023-07-23 19:50:29 (UTC+0)
Aklapper
added a project:
MediaWiki-extensions-PdfHandler
Jul 24 2023, 9:15 AM
2023-07-24 09:15:42 (UTC+0)
Aklapper
merged a task:
T342510: 175MB PDF on Portuguese Wikisource failed to be processed but otherwise valid
Aklapper
added a subscriber:
Albertoleoncio
Draco_flavus
subscribed.
Sep 14 2023, 6:28 PM
2023-09-14 18:28:34 (UTC+0)
Comment Actions
I think, I encountered the same problem with the file:
. On wikisource it seems to be useless:
Arcorann
subscribed.
May 20 2024, 6:20 AM
2024-05-20 06:20:24 (UTC+0)
Uzume
mentioned this in
T364445: DJVU file generated is apparently 0x0 pixels
Aug 11 2024, 12:20 PM
2024-08-11 12:20:38 (UTC+0)
Uzume
subscribed.
Jan 4 2025, 2:05 PM
2025-01-04 14:05:52 (UTC+0)
Ammarpad
subscribed.
Oct 28 2025, 11:13 AM
2025-10-28 11:13:04 (UTC+0)
Comment Actions
The listed pdf files seem to have been fixed now? Maybe the problem only remain for djvu files now.
Seppl2013
subscribed.
Edited
Jan 5 2026, 8:32 AM
2026-01-05 08:32:24 (UTC+0)
Comment Actions
MediaWiki does not support unbundled DjVu files any more all non bundled/indexed files need to be converted with
djvmcvt -b "$DJVU_PATH" "$BUNDLED_FILE"
See
for a tool that can help with the mass migration of your wiki that can also help you created migration scripts.
Script example:
#!/bin/bash
# DjVu bundling script
# Generated for: /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1948_Hermsdorf.djvu
# Date: 2026-01-05T09:24:30.031857
set
-e
# Exit on error
# Define variables
DJVU_PATH
/var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1948_Hermsdorf.djvu
DJVU_DIR
/var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08
BASENAME
Stadtroda-Kreis-AB-1948_Hermsdorf.djvu
BACKUP_FILE
/var/www/mediawiki/sites/genwiki.genealogy.net/djvu/backup/Stadtroda-Kreis-AB-1948_Hermsdorf.zip
BUNDLED_FILE
/var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1948_Hermsdorf_bundled.djvu
# Step 1: Create backup ZIP
cd
$DJVU_DIR
echo
'Creating backup ZIP...'
zip -j
$BACKUP_FILE
$BASENAME
Stadtroda-Kreis-AB-1942-010_0001.djbz
Stadtroda-Kreis-AB-1942-001_0001.djvu
Stadtroda-Kreis-AB-1942-002_0001.djvu
Stadtroda-Kreis-AB-1942-003_0001.djvu
Stadtroda-Kreis-AB-1942-004_0001.djvu
Stadtroda-Kreis-AB-1942-005_0001.djvu
Stadtroda-Kreis-AB-1942-006_0001.djvu
Stadtroda-Kreis-AB-1942-007_0001.djvu
Stadtroda-Kreis-AB-1942-008_0001.djvu
Stadtroda-Kreis-AB-1942-009_0001.djvu
Stadtroda-Kreis-AB-1942-010_0001.djvu
Stadtroda-Kreis-AB-1942-020_0001.djbz
Stadtroda-Kreis-AB-1942-011_0001.djvu
Stadtroda-Kreis-AB-1942-012_0001.djvu
Stadtroda-Kreis-AB-1942-013_0001.djvu
Stadtroda-Kreis-AB-1942-014_0001.djvu
Stadtroda-Kreis-AB-1942-015_0001.djvu
Stadtroda-Kreis-AB-1942-016_0001.djvu
Stadtroda-Kreis-AB-1942-017_0001.djvu
Stadtroda-Kreis-AB-1942-018_0001.djvu
Stadtroda-Kreis-AB-1942-019_0001.djvu
Stadtroda-Kreis-AB-1942-020_0001.djvu
Stadtroda-Kreis-AB-1942-030_0001.djbz
Stadtroda-Kreis-AB-1942-021_0001.djvu
Stadtroda-Kreis-AB-1942-022_0001.djvu
Stadtroda-Kreis-AB-1942-023_0001.djvu
Stadtroda-Kreis-AB-1942-024_0001.djvu
Stadtroda-Kreis-AB-1942-025_0001.djvu
Stadtroda-Kreis-AB-1942-026_0001.djvu
Stadtroda-Kreis-AB-1942-027_0001.djvu
Stadtroda-Kreis-AB-1942-028_0001.djvu
Stadtroda-Kreis-AB-1942-029_0001.djvu
Stadtroda-Kreis-AB-1942-030_0001.djvu
Stadtroda-Kreis-AB-1942-039_0001.djbz
Stadtroda-Kreis-AB-1942-031_0001.djvu
Stadtroda-Kreis-AB-1942-032_0001.djvu
Stadtroda-Kreis-AB-1942-033_0001.djvu
Stadtroda-Kreis-AB-1942-034_0001.djvu
Stadtroda-Kreis-AB-1942-035_0001.djvu
Stadtroda-Kreis-AB-1942-036_0001.djvu
Stadtroda-Kreis-AB-1942-037_0001.djvu
Stadtroda-Kreis-AB-1942-038_0001.djvu
Stadtroda-Kreis-AB-1942-039_0001.djvu
# Step 2: Verify backup was created
if
! -f
$BACKUP_FILE
then
echo
'Error: Backup ZIP not created'
exit
fi
echo
'Backup created: '
$BACKUP_FILE
# Step 3: Convert to bundled format
echo
'Converting to bundled format...'
djvmcvt -b
$DJVU_PATH
$BUNDLED_FILE
# Step 4: Verify bundled file was created
if
! -f
$BUNDLED_FILE
then
echo
'Error: Bundled file not created'
exit
fi
echo
'Bundled file created: '
$BUNDLED_FILE
# Step 5: Sleep for CIFS sync (if needed)
sleep
# Step 6: Remove original files
echo
'Removing original files...'
rm -f
$DJVU_PATH
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-010_0001.djbz
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-001_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-002_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-003_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-004_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-005_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-006_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-007_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-008_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-009_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-010_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-020_0001.djbz
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-011_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-012_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-013_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-014_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-015_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-016_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-017_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-018_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-019_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-020_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-030_0001.djbz
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-021_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-022_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-023_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-024_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-025_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-026_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-027_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-028_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-029_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-030_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-039_0001.djbz
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-031_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-032_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-033_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-034_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-035_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-036_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-037_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-038_0001.djvu
rm -f /var/www/mediawiki/sites/genwiki.genealogy.net/images/0/08/Stadtroda-Kreis-AB-1942-039_0001.djvu
# Step 7: Move bundled file to original location
echo
'Moving bundled file to original location...'
mv
$BUNDLED_FILE
$DJVU_PATH
echo
'Bundling complete!'
echo
'Backup saved at: '
$BACKUP_FILE
docker
exec
genwiki39-mw php maintenance/refreshImageMetadata.php --force --mime
image/vnd.djvu --start
Stadtroda-Kreis-AB-1948_Hermsdorf.djvu --end
Stadtroda-Kreis-AB-1948_Hermsdorf.djvu
Seppl2013
added a comment.
Jan 5 2026, 9:51 AM
2026-01-05 09:51:34 (UTC+0)
Comment Actions
since
im offering to help fixing all djvu files on commons. Whom do i need to contact?
Seppl2013
added a comment.
Jan 8 2026, 4:35 PM
2026-01-08 16:35:04 (UTC+0)
Comment Actions
To get more attention to this i posted
Seppl2013
added a comment.
Edited
Jan 10 2026, 12:53 PM
2026-01-10 12:53:09 (UTC+0)
Comment Actions
Some files just seem to be broken
wget https://upload.wikimedia.org/wikipedia/commons/2/25/%D0%A3%D0%BA%D0%B0%D0%B7%D0%B0%D1%82%D0%B5%D0%BB%D1%8C_%D1%81%D1%82%D0%B0%D1%82%D0%B5%D0%B9_%D0%BC%D0%BE%D1%80%D1%81%D0%BA%D0%BE%D0%B3%D0%BE_%D1%81%D0%B1%D0%BE%D1%80%D0%BD%D0%B8%D0%BA%D0%B0_1848_-_1872_%D0%B3._1875%282%29.djvu
mv "Указатель_статей_морского_сборника_1848_-_1872_г._1875(2).djvu" "morskoy_sbornik.djvu"
file morskoy_sbornik.djvu
morskoy_sbornik.djvu: DjVu multiple page document
djvudump morskoy_sbornik.djvu
*** [1-15108] Fehlerhafte IFF-Datei (ungültige Abschnitts-ID)
*** (IFFByteStream.cpp:248)
*** 'int DJVU::IFFByteStream::get_chunk(GUTF8String &, int *, int *)'
ReneeWrites
subscribed.
Edited
Jan 11 2026, 3:03 PM
2026-01-11 15:03:48 (UTC+0)
Comment Actions
This problem still persists with PDFs. If there is a working solution for PDFs (that I missed in the above discussion), I would love to know which steps I can take to fix them!
ReneeWrites
added a comment.
Jan 11 2026, 3:55 PM
2026-01-11 15:55:21 (UTC+0)
Comment Actions
I think I found a working solution using Mutool. Will update if the thumbnail breaks again after a few days, if not I will apply this solution to the other files.
RoyZuo
subscribed.
Jan 21 2026, 8:38 PM
2026-01-21 20:38:26 (UTC+0)
Comment Actions
Not sure how relevant this might be, but we just encountered some weird issues after a user uploaded new versions to try fixing some pdf that has the problem of 0x0 no thumbnail.
Seppl2013
added a comment.
Jan 24 2026, 8:50 AM
2026-01-24 08:50:10 (UTC+0)
Comment Actions
I think this task should be split - one subtask for the PDF problem and one subtask for the DjVu and only the common ground to be discussed her.
Balajijagadesh
subscribed.
Mar 8 2026, 4:07 AM
2026-03-08 04:07:01 (UTC+0)
Comment Actions
This pdf file showed correctly after uploading. But recently it is showing 0 x 0 pixel.
Log In to Comment
Content licensed under Creative Commons Attribution-ShareAlike (CC BY-SA) 4.0 unless otherwise noted; code licensed under GNU General Public License (GPL) 2.0 or later and other open source licenses. By using this site, you agree to the Terms of Use, Privacy Policy, and Code of Conduct.
Wikimedia Foundation
Code of Conduct
Disclaimer
CC-BY-SA
GPL
Credits