Slashdot Log In
Content-Aware Image Resizing
Posted by
kdawson
on Sat Aug 25, 2007 06:30 PM
from the got-a-nice-gui-too dept.
from the got-a-nice-gui-too dept.
An anonymous reader writes "At the SIGGRAPH 2007 conference in San Diego, two Israeli professors, Shai Avidan and Ariel Shamir, have demonstrated a new method to shrink images. The method is called 'Seam Carving for Content-Aware Image Resizing' (PDF paper here) and it figures out which parts of an image are less significant. This makes it possible to change the aspect ratio of an image without making the content look skewed or stretched out. There is a video demonstration up on YouTube."
Related Stories
Firehose:Content-aware image resizing by Anonymous Coward
This discussion has been archived.
No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
Full
Abbreviated
Hidden
Loading ... Please wait.

The paper via ACM (Score:5, Informative)
Re:The paper via ACM (Score:5, Informative)
Re:The paper via ACM (Score:5, Informative)
Shrink image:
Step 1: Run an edge detection algorithm.
Step 2: Find minimal energy (least amount of edges crossed) path from top to bottom or left to right (graph-cut algorithm).
Step 3: Remove pixels along that path.
Step 4: Repeat steps 2 and 3 as necessary.
Extend image:
Step 1: Run an edge detection algorithm.
Step 2: Find minimal energy (least amount of edges crossed) path from top to bottom or left to right (graph-cut algorithm).
Step 3: Insert pixels along that path (interpolated from neighbors)
Step 4: Repeat steps 2 and 3 as necessary.
Remove objects:
Step 1: Run an edge detection algorithm.
Step 2: Mask object by giving its pixels low/negative energy values.
Step 3: Find minimal energy (least amount of edges crossed) path from top to bottom or left to right (graph-cut algorithm).
Step 4: Remove pixels along that path.
Step 5: Repeat steps 3 and 4 as necessary.
Re:The paper via ACM (Score:5, Insightful)
Step 6: Extend image to match original size using the previous extend image algorithm
(Of course, I leave the obligatory Profit step as an exercise for the reader).
Video is on youtube.... (Score:3, Informative)
Clicky [youtube.com]
Tm
Impressive (Score:2)
I Think You'll Find (Score:3, Insightful)
nice! (Score:5, Interesting)
Other than that though, that's pretty awesome... I'm sure there's more instances where it doesn't look right than what they showed, but it's definitely cool how well it works as it stands!
I can imagine it would be extremely useful for ex-boyfriends or ex-girlfriends; just load up all their photos of them and their ex, wave the magic eraser, and *boom* you don't have to delete all your old vacation shots
I wonder how well it would work for the porn industry too; nice automatic resizing of breasts without ruining the picture! Fetishists will be SO happy!
Re: (Score:2, Interesting)
Re: (Score:2)
Re:nice! (Score:5, Funny)
Better never get a partner then at all if you are going to hate the person once it doesn't work longer.
But then I'm a regular slashdot visitor and don't have any exs so what do I know.
Practical uses (Score:2, Funny)
Re: (Score:2)
I see your reduced breasts and raise you a 'Seam Carvied Content-Aware Resized Image' midget porn. Guess who Elizabeth Hurley looks like now.
Slightly Strange (Score:3, Interesting)
Re: (Score:2, Interesting)
According to the video, the added backgro
Re: (Score:2)
Re: (Score:2)
Re:Slightly Strange (Score:5, Insightful)
It's not perfect of course. I'm guessing that if you had a picture of two people next to each other, one with a solid colored shirt, and the other with a striped colored shirt, that the solid colored shirt guy would get skinner than the striped when shrinking, and the reverse when enlarging. However, it's a neat idea, and I look forward to reading the paper.
A picture speaks a thousand words... (Score:4, Insightful)
There are probably a few situations where the 'unimportant' bits of an image are still as relevant as the rest. Sports photos for instance - especially those played on grass - would not give you a true picture (literally) of what's going on in the scene.
This'd be good for reference photos - like the animals at the start of the YouTube video, but applications where precision and distance are required wouldn't benefit. Nice bit of work though and I reckon with some smart scaling embedded too (rather than its 'folding effect'), it'd cater for most image retargetting requirements.
Re: (Score:2)
Re: (Score:2)
There are circumstances where it makes sense to abridge (or retarget) and others where it makes more sense to simply rescale. Since this appears to allow the content provider to choose the method that will
Re:A picture speaks a thousand words... (Score:5, Informative)
he uic bownfoxjumed verthelaz yelowdog
You get:
Th qik brwn fx jmpd ovr th lzy ylo dog
Which reduces the total size by the same amount, but retains more information than treating every bit of information the same.
Re:A picture speaks a thousand words... (Score:5, Interesting)
if you have 3 people in a picture and you crop it down to 2, you've erased a person, but you haven't changed who is seated next to whom. if you use this method and the middle person is erased, you make it appear as though the outer two people were in fact seated next to each other when they weren't.
we are used to the idea that a picture can be cropped (mentally considering what might be just outside the frame). We aren't yet used to the concept that the photo has effectively been cut and pasted together to create new relationships between the objects in the photo (though of course photoshop is getting us there).
to continue your analogy, if we take:
the quick brown fox jumped over the lazy dog
and drop letters, we can create:
the cow jumped over the dog
whereas "cropping" might let us say:
the quick brown fox jumped
I think it's clear that one of these is more misleading than the other, though in both cases you're just removing information. (in one case, some of that information happens to be spaces between letters/words)
Re: (Score:3, Insightful)
Re: (Score:3, Insightful)
There are probably a few situations where the 'unimportant' bits of an image are still as relevant as the rest. Sports photos for instance - especially those played on grass - would not give you a true picture (literally) of what's going on in the scene.
DP Approach (Score:4, Interesting)
or entropy of the background is as great as the foreground. Also the paper doesn't go into
too much details about the dynamic programming approach they used to find the path of least
energy, I guess that aspect of it is patentable. Another thing they could investigate is the
use of diagonal seams instead of just staggered vertical and horizontal seams.
All in all a very interesting read.
Re: (Score:3, Informative)
Also the paper doesn't go into too much details about the dynamic programming approach they used to find the path of least energy, I guess that aspect of it is patentable.
Not so much patentable, as "Easy enough for the reader to implement that it deser
Prior art (Score:2, Informative)
Before [wikipedia.org]
After [wikipedia.org]
Insignificant person removed.
Re: (Score:3, Informative)
Whao (Score:5, Funny)
Gimp! (Score:5, Interesting)
Paranoia! It's not just for Gimps (Score:2, Insightful)
Is that check going to cover the removal of their paper from above and the ACM archives, let alone OUR archives?
I can see the spam now (Score:3, Funny)
Re: (Score:2)
Open source alternative via the GIMP:
- use the "magic wand" tool to select your "magic wand tool"
- "convert selection to path"
- "stroke path"
Feel free to experiment by repeateDoes Anyone Find It Ironic (Score:5, Funny)
Finally! (Score:2)
(Yes, I know, this thread is worthless without pictures)
My Implementation (Score:5, Interesting)
Ariel Shamir (Score:3, Informative)
some code (Score:4, Interesting)
http://rafb.net/p/jinioy45.html [rafb.net]
(yeah my coding sucks but it produces awesome results and I reversed engineered the algorithm from youtube so please grovel...)
I'll improve it soon to remove an arbitrary number of line, horizontally or vertically
- no recalculation of gradient, only the gradient near the line needs to be recomputed
- precomputes a file that store the order of the pixel needing to be removed
I need help with something though, I understand how the algorithm can precompute a file which says in which order pixel should be removed, but I don't see how this can work in *both* direction. Suppose you want to reduce vertically and horizontally at the same time, the horizontal change should completely break the precomputed vertical changes. How would you handle that?
Re: (Score:3, Interesting)
original
http://img96.imageshack.us/my.php?image=testxq4.jp g [imageshack.us]
somewhat reduced
http://img361.imageshack.us/my.php?image=outew8.pn g [imageshack.us]
very reduced
http://img484.imageshack.us/my.php?image=outas2.pn g [imageshack.us]
Re: (Score:3, Funny)
Re:I For One (Score:4, Insightful)
I'm really impressed. Again, maybe not too hard to implement at first, but probably damn hard to get working perfectly, and I might just be ignorant (and I'm entitled too, it's far from my field of work), but I've not seen anyone doing it before.
Re:Let us be wholly thankful... (Score:4, Funny)
hehe... gaping... deep deep... rectum... i mean rectify... hehe
i need to get some sleep
Re: (Score:2)
"This technology could render very visually-convincing (but not computer/analytically convincing) image censorship or alteration. I am strongly reminded of this example of photo-editing from the 1940s:
http://www.newseum.org/berlinwall/commissar_vanish [newseum.org]
Re: (Score:2)
Re: (Score:3, Insightful)
Re:Not ready for Prime Time (Score:5, Insightful)
It has nothing to do with edge detection. The algorithm simply detects paths of minimal gradient which lead from one side of the image to the opposite side. This can be used to produce a "pretty picture" which shows the edges -- but this is merely fallout.
They showed what I thought were several realistic photos with complex backgrounds, and the algorithm did well overall, except on structures where people are closely attuned to exact detail -- such as human faces. If we weren't innately wired to process faces in incredible detail, we wouldn't even notice the distortion.
So it's not perfect. Can you show me something in this world that is? And I don't think there has been any mention of "prime time" application, whatever that means.
Re: (Score:3, Funny)
Re:Great - We can do this, but should we? (Score:5, Insightful)
By your reasoning
Cars can be used by criminals to travel faster.
A knife can be used to kill
Electricity can be used to kill
Computers can be used by the govt to collect more information abt us effectively
Is that really what we want?
see the flaw in the logic?
Re: (Score:3, Funny)