Check if file is image python

Update

I also implemented the following solution in my Python script here on GitHub.

I also verified that damaged files [jpg] frequently are not 'broken' images i.e, a damaged picture file sometimes remains a legit picture file, the original image is lost or altered but you are still able to load it with no errors. But, file truncation cause always errors.

End Update

You can use Python Pillow[PIL] module, with most image formats, to check if a file is a valid and intact image file.

In the case you aim at detecting also broken images, @Nadia Alramli correctly suggests the im.verify[] method, but this does not detect all the possible image defects, e.g., im.verify does not detect truncated images [that most viewers often load with a greyed area].

Pillow is able to detect these type of defects too, but you have to apply image manipulation or image decode/recode in or to trigger the check. Finally I suggest to use this code:

from PIL import Image

try:
  im = Image.load[filename]
  im.verify[] #I perform also verify, don't know if he sees other types o defects
  im.close[] #reload is necessary in my case
  im = Image.load[filename] 
  im.transpose[Image.FLIP_LEFT_RIGHT]
  im.close[]
except: 
  #manage excetions here

In case of image defects this code will raise an exception. Please consider that im.verify is about 100 times faster than performing the image manipulation [and I think that flip is one of the cheaper transformations]. With this code you are going to verify a set of images at about 10 MBytes/sec with standard Pillow or 40 MBytes/sec with Pillow-SIMD module [modern 2.5Ghz x86_64 CPU].

For the other formats xcf,.. you can use Imagemagick wrapper Wand, the code is as follows: Check the Wand documentation: here, to installation: here

im = wand.image.Image[filename=filename]
temp = im.flip;
im.close[]

But, from my experiments Wand does not detect truncated images, I think it loads lacking parts as greyed area without prompting.

I red that Imagemagick has an external command identify that could make the job, but I have not found a way to invoke that function programmatically and I have not tested this route.

I suggest to always perform a preliminary check, check the filesize to not be zero [or very small], is a very cheap idea:

import os

statfile = os.stat[filename]
filesize = statfile.st_size
if filesize == 0:
  #manage here the 'faulty image' case

Python has many modules in its standard library. One that is often overlooked is imghdr which lets you identify what image type that is contained in a file, byte stream or path-like object.

The imghdr can recognize the following image types:

rgb
gif
pbm
pgm
ppm
tiff
rast
xbm
jpeg / jpg
bmp
png
webp
exr

Here is how you would use it imghdr to detect the image type of a file:

>>> import imghdr
>>> path = 'python.jpg'
>>> imghdr.what[path]
'jpeg'
>>> path = 'python.png'
>>> imghdr.what[path]
'png'

All you need to do is pass a path to imghdr.what[path] and it will tell you what it thinks the image type is.

An alternative method to use would be to use the Pillow package which you can install with pip if you don't already have it.

Here is how you can use Pillow:

>>> from PIL import Image
>>> img = Image.open['/home/mdriscoll/Pictures/all_python.jpg']
>>> img.format
'JPEG'

This method is almost as easy as using imghdr. In this case, you need to create an Image object and then call its format attribute. Pillow supports more image types than imghdr, but the documentation doesn't really say if the format attribute will work for all those image types.

Anyway, I hope this helps you in identifying the image type of your files.

[

I am currently using PIL.

from PIL import Image
try:
    im=Image.open[filename]
    # do stuff
except IOError:
    # filename not an image file

However, while this sufficiently covers most cases, some image files like, xcf, svg and psd are not being detected. Psd files throws an OverflowError exception.

Is there someway I could include them as well?

I have just found the builtin imghdr module. From python documentation:

The imghdr module determines the type
of image contained in a file or byte
stream.

This is how it works:

>>> import imghdr
>>> imghdr.what['/tmp/bass']
'gif'

Using a module is much better than reimplementing similar functionality

In addition to what Brian is suggesting you could use PIL’s verify method to check if the file is broken.

im.verify[]
Attempts to determine if the file is
broken, without actually decoding the
image data. If this method finds any
problems, it raises suitable
exceptions. This method only works on
a newly opened image; if the image has
already been loaded, the result is
undefined. Also, if you need to load
the image after using this method, you
must reopen the image file. Attributes

Additionally to the PIL image check you can also add file name extension check like this:

filename.lower[].endswith[['.png', '.jpg', '.jpeg', '.tiff', '.bmp', '.gif']]

Note that this only checks if the file name has a valid image extension, it does not actually open the image to see if it’s a valid image, that’s why you need to use additionally PIL or one of the libraries suggested in the other answers.

A lot of times the first couple chars will be a magic number for various file formats. You could check for this in addition to your exception checking above.

One option is to use the filetype package.

Installation

python -m pip install filetype

Advantages

Fast: Does its work by loading only the first few bytes of your image [check on the magic number]
Supports different mime type: Images, Videos, Fonts, Audio, Archives.

Example

filetype >= 1.0.7

import filetype

filename = "/path/to/file.jpg"

if filetype.is_image[filename]:
    print[f"{filename} is a valid image..."]
elif filetype.is_video[filename]:
    print[f"{filename} is a valid video..."]

filetype

Bài Viết Liên Quan

Hướng dẫn dùng python script python

Hướng dẫn tính tuổi trong python

Hướng dẫn javascript reload content

Get data from mongodb python

Hướng dẫn php iso 8601

Hướng dẫn python server login

Where can i write python codes online?

Code trang thanh toán html

Can we connect mysql with java?

Đề thi toefl primary vòng 2 2023

Delete all file in folder python

How do you remove a character at the end of a cell in excel?

Hướng dẫn dùng diccionarios python

Hướng dẫn background-image trong html w3schools

How do i start python in kali linux?

Hướng dẫn dùng 70 php trong PHP

Hướng dẫn generate array python

Why is javascript not asynchronous?

Can javascript call a php function?

Display: block in html code

Toplist mới

#1

Top 9 tập bản đồ lớp 8 bài 31 2023
7 tháng trước

#2

Top 6 kết quả thi hsg đà nẵng 2022 2023
7 tháng trước

#3

Top 9 tủ nhựa đài loan 4 cánh 3d 2023
7 tháng trước

#4

Top 9 chất khí có thể làm mất màu dung dịch nước brom là: a. so2. b. co2. c. o2. d. hcl. 2023
7 tháng trước

#5

Top 8 tìm việc làm tiện, phay bảo q7 2023
7 tháng trước

#6

Top 3 tôi xuyên thành tiểu kiều the của lão đại phản 2 2023
7 tháng trước

#7

Top 9 đổi mới phong cách, thái độ phục vụ của cán bộ y tế hướng tới sự hài lòng của người bệnh 2023
7 tháng trước

#8

Top 2 bài the dục phát triển chung lớp 6 2022 2023
7 tháng trước

#9

Top 3 bài giảng vũ điệu sắc màu (lớp 4) 2023
7 tháng trước

Bài mới nhất

Khi nào đăng ký nguyện vọng đại học 2023 năm 2024

Trung tâm thực tại y khoa bình hòa tp hcm năm 2024

Bài toán liên quan tới mức cường độ âm năm 2024

Top 5 em gái đồng phim sex đẹp nhất năm 2024

Giải bài tập hệ phương trình tuyến tính năm 2024

Cơ sở hình thành các nền văn minh phương đông năm 2024

Bài tập vẽ hình tiêu chuẩn jis 2d sang 3d năm 2024

Bài 7 sgk toán lớp 9 trang 69 năm 2024

Top and bottom margins in word not showing năm 2024

Lưu văn bản vào đĩa ta thực hiện năm 2024

Chủ Đề

programming Hỏi Đáp Toplist Là gì Mẹo Hay Địa Điểm Hay mẹo hay Học Tốt Công Nghệ Nghĩa của từ Khỏe Đẹp Bao nhiêu đánh giá Top List bao nhieu bao nhiêu hướng dẫn Bài tập Xây Đựng So Sánh Tiếng anh So sánh Sản phẩm tốt Ngôn ngữ Bài Tập javascript Thế nào Ở đâu Hướng dẫn Dịch Máy tính Tại sao Đại học Món Ngon Khoa Học