Fix for issue #121 #129

ArjohnKampman · 2015-10-30T08:28:33Z

When an incorrect jpeg segment length is specified, try to recover by scanning for the next segment marker rather than bailing out with an exception.

…fied, try to recover by scanning for the next segment marker rather than bailing out with an exception

drewnoakes · 2015-10-31T12:03:01Z

Thanks for this. In general the library tries to avoid scanning in response to unexpected data as eventually you'll find something that matches even if it's not correct, after which all bets are off and it's not uncommon to start emitting imaginary junk-filled tags.

Some ideas to limit this issue are:

Only apply this scanning if you can deduce that the image was created by software known to have this problem (this happens a lot with camera makernote handling based upon the make and model)
Only scan a limited number of bytes to reduce the likelihood of finding a matching byte pair

However there is clearly a use case for this. I wonder whether it'd make sense to introduce a configuration option that sets the desired compatibility level. For cases where values are presented directly on a user interface, junk results are more of a problem.

Before merging this I'll think this through a little more and let any feedback gather. I'll also run this code over the image database to see whether it fixes more problems than it creates on a real-world data set. It certainly sounds like that's the case in your scenario.

If the library goes in this direction, then such permissive parsing should be enabled in several places.

ArjohnKampman · 2015-11-02T11:08:14Z

I generally appreciate being able to switch between a 'strict' and a 'lenient' mode for data parsing, but in the case of this error I think that most people will want the lenient mode. If this was an error in the data of one of the segments then the parser could recover from that by skipping to the next segment. But this particular error affects the structure of the data in the file and there is no other meta-structure to fall back to. So in this case, the strict mode would mean to stop processing completely and ignore everything else in the file. If the error occurs near the start of the file, you'll end up with little or no metadata at all.

Considering that the actual image data is often/always at the end of the file and that almost all viewers/tools are able to handle such broken data, a lenient parsing approach seems to be what is generally used for processing JPEG data. Some software will have better error recovery solutions than others though. Recording such parsing errors in a way like this is currently done in class Directory would be ideal, but I couldn't find a way to do this in JpegMetadataReader.

Fix for issue #121

…a-extractor#129

drewnoakes · 2016-03-21T10:29:35Z

Thanks. Coming back to this and running it over a lot of images, it seems pretty harmless and fixes a real world problem. Thanks for your PR, and your patience in getting it merged.

ArjohnKampman · 2016-03-21T11:54:15Z

Thanks for the merge. The last (maven) release is almost a year old, are you planning to do a new release some time soon?

[Issue drewnoakes#121] when an incorrect jpeg segment length is speci…

4bac919

…fied, try to recover by scanning for the next segment marker rather than bailing out with an exception

drewnoakes added a commit that referenced this pull request Mar 21, 2016

Merge pull request #129 from ArjohnKampman/master

8c14064

Fix for issue #121

drewnoakes merged commit 8c14064 into drewnoakes:master Mar 21, 2016

drewnoakes added a commit to drewnoakes/metadata-extractor-images that referenced this pull request Mar 21, 2016

Fix issue drewnoakes/metadata-extractor#121 via PR drewnoakes/metadat…

be70337

…a-extractor#129

drewnoakes mentioned this pull request Mar 21, 2016

JpegProcessingException: Expected JPEG segment start identifier 0xFF, not 0x0 #121

Closed

drewnoakes added a commit to drewnoakes/metadata-extractor-dotnet that referenced this pull request Mar 24, 2016

Port drewnoakes/metadata-extractor#129.

4489a50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix for issue #121 #129

Fix for issue #121 #129

Uh oh!

ArjohnKampman commented Oct 30, 2015

Uh oh!

drewnoakes commented Oct 31, 2015

Uh oh!

ArjohnKampman commented Nov 2, 2015

Uh oh!

drewnoakes commented Mar 21, 2016

Uh oh!

ArjohnKampman commented Mar 21, 2016

Uh oh!

Uh oh!

Fix for issue #121 #129

Fix for issue #121 #129

Uh oh!

Conversation

ArjohnKampman commented Oct 30, 2015

Uh oh!

drewnoakes commented Oct 31, 2015

Uh oh!

ArjohnKampman commented Nov 2, 2015

Uh oh!

drewnoakes commented Mar 21, 2016

Uh oh!

ArjohnKampman commented Mar 21, 2016

Uh oh!

Uh oh!