Download Report

This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication.
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL. ??, NO. ?, APR 2011
1
Blind Image Watermarking Using a Sample
Projection Approach
Mohammad Ali Akhaee*, Member, IEEE, Sayed Mohammad Ebrahim Sahraeian, Student Member, IEEE, and
Craig Jin, Member, IEEE
Abstract—This paper presents a robust image watermarking
scheme based on a sample projection approach. While we consider the human visual system in our watermarking algorithm,
we use the low frequency components of image blocks for data
hiding to obtain high robustness against attacks. We use four
samples of the approximation coefficients of the image blocks
to construct a line segment in the 2-D space. The slope of this
line segment, which is invariant to the gain factor, is employed
for watermarking purpose. We embed the watermarking code
by projecting the line segment on some specific lines according
to message bits. To design a maximum likelihood decoder,
we compute the distribution of the slope of the embedding
line segment for Gaussian samples. The performance of the
proposed technique is analytically investigated and verified via
several simulations. Experimental results confirm the validity
of our model and its high robustness against common attacks
in comparison with similar watermarking techniques that are
invariant to the gain attack.
Index Terms—Image watermarking, Maximum Likelihood detector, sample projection, gain attack.
I. I NTRODUCTION
Digital watermarking embeds information within a digital
work as a part of the media. Watermarking techniques falls into
three categories of robust, semi fragile, and fragile methods
according to their specific applications [1]–[3]. Robust watermarking mainly serves for identification purposes while the
fragile and semi fragile watermarking are usually employed
in authentication applications. Since a good watermarking
scheme should always be able to deal with some kinds of
attacks, studies in the watermarking research area mostly target
robust watermarking problems.
Several robust watermarking techniques have been proposed
so far. Cox et al. [4] have proposed an additive watermarking
approach based on spread spectrum concept which remains
highly robust against noise and cropping attacks. Several other
studies have improved this approach further [5]–[10]. Based
on this observation that boosting the watermarking power
increases the barrier against attacks, most of the effective
watremarking schemes try to match the characteristics of
the watermark to those of the image asset. Multiplicative
Copyright (c) 2010 IEEE. Personal use of this material is permitted.
However, permission to use this material for any other purposes must be
obtained from the IEEE by sending a request to [email protected].
M. A. Akhaee and Craig Jin are with the Computing and Audio Research Laboratory (CARLAB), Department of Electrical and Information
Engineering, University of Sydney, NSW, Australia, 2006 (e-mail: akhaee,
[email protected]).
S.M.E. Sahraeian is with the Genomic Signal Processing (GSP) Lab,
Department of Electrical and Computer Engineering, Texas A&M University,
College Station, TX 77843, USA (e-mail: [email protected]).
watermarking, as an example, has been introduced in [11] and
has been widely studied later on using local optimum decoders
in multiresolutional transform domains such as wavelet and
contourlet domains [12]–[16]. Besides, a universal optimal
detector for scaling based watermarking schemes is presented
in [17]. These schemes are highly robust against noise and
compression attacks. To satisfy robustness against geometric
attacks and reduce the watermark synchronization problem,
Tang and Hang [18] proposed a watermarking scheme which
employs a feature extraction and image normalization approach. Based on log polar mapping (LPM) and phase correlation, Zheng et al. [19] proposed an image watermarking
technique which embeds the watermark into the LPMs of the
image Fourier magnitude spectrum. This scheme is invariant
to rotation and remains comparatively robust to scaling attack.
One of the common but effective attacks in watermarking
systems is volumetric distortions ( i.e., any kind of amplitude scaling or gamma compensation). For image signals, it
may happen due to the scanning process where light is not
distributed uniformly over the paper. This simple attack is the
main drawback of most recent studies like [12], [14]–[16] and
even the quantization index modulation (QIM) algorithm [20]
which has attained great popularity due to its lossless performance with lattice-based codebooks. Three types of solutions
can tackle this problem, especially for the QIM method:
1) adopting auxiliary pilots through the watermarked signal
known at both the encoder and decoder [21]; 2) using spherical
codewords [22] with correlator decoders [23], or using angle
QIM (AQIM) [24], [25]; iii) introducing a domain in which the
embedding process is invariant to the gain attack [26]. Some
improvement/generalization on this scheme which is referred
to as rational dither modulation (RDM) are proposed in [27]–
[29].
The first solution against the gain attack decreases the
security of the algorithm, since the malicious attacker can
change either the watermark or the pilot signals. Besides, pilots
are deterministic objects in the main signal and can be easily
detected. Although the second and third approach keep the
security of the QIM algorithm, they cause high computational
cost. Besides, the low robustness of AQIM against additive
white Gaussian noise (AWGN) and the high peak to average
power ratio (PAPR) of RDM algorithm which happens due
to its momentarily large quantization step size are the main
drawbacks that should be addressed. Thus, no approach has
been presented so far that both proposes an optimal decoder
and remains invariant to the gain attack.
In this paper, we propose a novel gain invariant watermark-
Copyright (c) 2011 IEEE. Personal use is permitted. For any other purposes, Permission must be obtained from the IEEE by emailing [email protected].
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication.
2
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL. ??, NO. ?, APR 2011
ing scheme based on a sample projection scheme. Embedding
the watermark bits into the approximation coefficients of
the image blocks makes the algorithm highly robust against
noise and compression attacks. Any possible selection of four
approximation coefficients, that may be selected using a secret
key, constructs a line segment whose slope is considered for
data hiding. We embed the watermark bits by projecting the
line segment on some specific coding lines while preserving
its center of mass. In this way, the slope of the line segment
carries the watermark information while the distortion imposed
to its constructive samples is minimal. Since our embedding
process is linear, it can be denoted by multiplication of specific
embedding matrices. To implement the maximum likelihood
(ML) detector for data extraction, we should calculate the
distribution of the slope of the line segment. To this aim,
we consider the fact that the approximation coefficients of
most of the image blocks can be well-modeled by Gaussian distribution [15]. The performance of our watermarking
method is analytically investigated and verified via simulation
on artificial signal. Several experiments on sample images
confirm the effectiveness of the proposed scheme in resisting
against common noise attacks.
The rest of the paper is organized as follows. In Section
II, we describe the model of the system. The watermark
embedding and decoding process are introduced in Section
III. Section IV analyzes and evaluates the performance of the
proposed scheme. Experimental results are demonstrated in
Section V, and Section VI concludes the paper.
II. S YSTEM M ODELING
In this section, we first introduce the model considered
for our watermarking algorithm. To this aim, we calculate
the distribution of the watermarking variable. We assume
to have four samples of an independently and identically
distributed (i.i.d) Gaussian random variable as the host signal.
We show this signal as u = [u1 , u2 , u3 , u4 ] with the Gaussian
distribution of N (0, σu2 ). These four samples form two points
p = [u1 , u2 ] and q = [u3 , u4 ] in the 2-D space. We employ
2
c = uu43 −u
−u1 , the slope of the line going through these two points
as our watermarking variable. We can see that the numerator
and the denominator of c are distributed as N (0, 2σu2 ).
For independent Gaussian variables a ∼ N (0, σa2 ) and b ∼
N (0, σb2 ), their ratio c = ab which is the ratio of two zero-mean
independent Gaussian variables is Cauchy as:
σ
fC (c) =
a
1
σb
,
π c2 + ( σσab )2
(1)
As shown in Appendix A, for the case that the two Gaussian
variables a and b are correlated with the correlation coefficient
r=
E[(a − µa )(b − µb )]
,
σa σb
the probability density function (PDF) of the variable c is given
as:
√
σa σb 1 − r 2
1
,
(2)
fC (c) =
a 2
2
2
π σb2 (c − rσ
σb ) + σa (1 − r )
and its cumulative distribution function (CDF), FC (c), can be
computed as:
FC (c) =
1
1
σb c − rσa
+ tan−1 √
.
2 π
σa 1 − r 2
(3)
As we will see in the next sections, we use the probabilistic
characteristics of the variable c, the slope of the mentioned
line segment, to design an optimal decoder and later analyze
its performance.
III. P ROPOSED M ETHOD
In this section, we introduce our blind watermarking
scheme. As discussed in the previous section, we assume the
host signal as a four-sample i.i.d. Gaussian random signal.
In practical applications, these four samples can come from
approximation coefficients of the image blocks which satisfy
our i.i.d. Gaussian assumption according to KolmogorovSmirnov test results.
A. Watermark embedding
Let us represent the four samples of the host signal as
u = [u1 , u2 , u3 , u4 ]. We model these four samples as two
points p = [u1 , u2 ] and q = [u3 , u4 ] in the 2-D space. Fig.
1(a) illustrates these two points as well as the line segment
connecting them. We denote the slope of this line segment as
3 u2 +u4
θ. The center of this (p, q) line is located at [ u1 +u
, 2 ].
2
By translating the center of this line to the origin, we reach
to the points pc and qc , where
( ′ ) ( u1 −u3 )
( ′ ) ( u3 −u1 )
u1
u3
2
2
pc =
= u2 −u4 , qc =
= u4 −u
. (4)
2
u′2
u′4
2
2
Now, to embed the M -ary watermark code, we project this
line segment to one of the M coding lines (L1 to LM if θ > 0
or their counterparts L′1 to L′M if θ < 0), shown in Fig. 1
(b), depending on the watermark code. We use projection and
keep the center of the line segment to impose less distortion
and thereby cause more invisibility of the watermarked signal.
Fig. 1 (a) illustrates the projection steps in details. We call the
resulted points in the mapped line as p⊥ and q⊥ . The slope of
the ith coding line is αi (for Li ) if θ, the slope of the primary
line connecting (p, q), is positive and −αi (for L′i ) otherwise,
where αi is given by
π M − 2i + 1
−
β).
(5)
4
2
Here, β is the angle between two consecutive coding lines.
The position of p⊥ for the case that the slope of the mapped
line is k, can be computed using the intersection of two
following lines:
{
y = kx
(6)
y − u′2 = − k1 (x − u′1 ).
αi = tan(
With a similar scheme, we can find q⊥ . The obtained points
are as follows:
( u′ +ku′ )
( u′ +ku′ )
1
p⊥ =
2
k2 +1
ku′1 +k2 u′2
k2 +1
3
,
q⊥ =
4
k2 +1
ku′3 +k2 u′4
k2 +1
.
(7)
Copyright (c) 2011 IEEE. Personal use is permitted. For any other purposes, Permission must be obtained from the IEEE by emailing [email protected].
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication.
IMAGE WATERMARKING
3


 
u′′1
u1
u′′2 
 u2 
 ′′  = T(k)   ,
 u3 
 u3 
′′
u4
u4
as:
(9)
using the transfer matrix T(k), given as:
 2
k +2
 k
1

T(k) = 2
2k + 2  k 2
−k

k
k2
−k
2k 2 + 1
−k
1 
 . (10)
−k,
k2 + 2
k 
1
k
2k 2 + 1
Therefore, the watermarking embedding process can be
figured as implementation of (9) where k is defined based
on the watermark bit and θ, the slope of the primary (p, q)
line, as:
{
αi
when θ ≥ 0
k=
(11)
−αi when θ < 0.
(a)
In Fig. 1(b), we can see that for each coding word we have
two lines, a solid one (Li ) for positive θ and a dashed one
(L′i ) for negative θ.
In this way, we obtain the watermarked signal u′′ =
′′
[u1 , u′′2 , u′′3 , u′′4 ].
B. Watermark decoding
(b)
Fig. 1.
(a) Steps of the proposed watermarking embedding scheme:
translation to the origin, projection, and translation back. (b) Coding space.
Solid lines: coding lines with positive slopes (L1 · · · LM ); dashed lines:
the counterpart coding lines with negative slopes (L′1 · · · L′M ); dotted lines:
Decision boundaries.
As the final step, we just need to translate back the mapped
line to its original center to obtain points pw = [u′′1 , u′′2 ] and
qw = [u′′3 , u′′4 ] as:
)
( ′′ ) ( u′1 +ku′2
u1 +u3
u1
k2 +1 +
2
pw =
= ku′ +k2 u′
1
2
4
u′′2
+ u2 +u
k2 +1
2
)
(
( ′′ )
u′3 +ku′4
u1 +u3
u3
k2 +1 +
2
.
qw =
= ku′ +k2 u′
3
4
4
u′′4
+ u2 +u
k2 +1
2
To extract the hidden bits, an optimum decoder is implemented using M-Hypothesis test as follows. We denote the
received signal as y = u′′ + n, where y = [y1 , y2 , y3 , y4 ]
represents the watermarked signal contaminated with zero
mean AWGN n ∼ N (0, σn2 ).
For the i.i.d. Gaussian host signal u, the watermarked signal
u′′ is also Gaussian as it is obtained by a linear transformation
through the matrix T(k). Thus, the received samples y are
Gaussian with the variance of σy2 = σu′′2 + σn2 . Now, we can
use the model given in Section II. We depict the four samples
in y as two points pr = [y1 , y2 ] and qr = [y3 , y4 ] in the 2-D
space and calculate the slope of the line segment connecting
them as:
y4 − y2
u′′ − u′′2 + n4 − n2
c=
= 4′′
.
(12)
y3 − y1
u3 − u′′1 + n3 − n1
As we fixed the slope of the line connecting pw = [u′′1 , u′′2 ]
and qw = [u′′3 , u′′4 ] to k, defined in (11), we have:
k=
u′′4 − u′′2
⇒ u′′4 − u′′2 = k(u′′3 − u′′1 ).
u′′3 − u′′1
(13)
Therefore, by defining v = u′′3 − u′′1 , we can rewrite (12) as:
c=
kv + n4 − n2
.
v + n3 − n1
(14)
Using (9) and (10), we have
1
(−u1 − ku2 + u3 + ku4 ).
+1
Thus, v is a Gaussian random variable with mean µv = 0
2
2
and variance σv2 = 1+k
2 σu . Consequently, if we show the
numerator and the denominator of c in (14) respectively as a
and b, we will have a ∼ N (0, k 2 σv2 +2σn2 ) and b ∼ N (0, σv2 +
v = u′′3 − u′′1 =
(8)
By inserting (4) in (8) we can summarize the whole procedure
k2
Copyright (c) 2011 IEEE. Personal use is permitted. For any other purposes, Permission must be obtained from the IEEE by emailing [email protected].
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication.
4
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL. ??, NO. ?, APR 2011
2σn2 ). The correlation coefficient between a and b can also be
computed as:
r=√
kσv2
(k 2 σv2 + 2σn2 )(σv2 + 2σn2 )
.
can be written as:
P + (e|i) =P (tan (−ϕi +
β
β
) < c|i) + P (c < tan (−ϕi − )|i)
2
2
(
(
β )
β )
=FC|i tan (ϕi − ) − FC|i tan (−ϕi + )
2
2
(
(
β )
β )
+1 − FC|i tan (ϕi + ) + FC|i tan (−ϕi − ) ,
2
2
(21)
(15)
+P (tan (ϕi +
Now we have c = ab , where a and b are two correlated
zero mean Gaussian random variables. Thus, the distribution
function of c can be given as in (2).
Having the distribution of the slope function, we can simply
use the Maximum Likelihood detector as:
i∗ = arg max fC (c|i),
(16)
i∈1,··· ,M
where fC (c|i) represents the distribution function for the ith
coding line. Here, without loss of generality, we suppose that
the received c is positive. The same argument also holds for
the negative case. Thus, the conditional distribution function
is
√
σ
σ
1 − r|i2
a
b
|i
|i
1
fC (c|i) =
,
(17)
π σ 2 (c − r|i σa|i )2 + σa2 (1 − r2 )
b|i
σb|i
|i
where, c is the slope of the detected line, and FC|i (.) is its
CDF if the embedded line is Li . Substituting FC (c) from (3)
and defining
β
π M − 2i
= −
β,
2
4
2
β
π M − 2(i − 1)
= ϕi − = −
β,
2
4
2
ϕ′i
= ϕi +
ϕ′i−1
and di =
σb|i
σa|i
P + (e|i)
=
di tan ϕ′i−1 − r|i
1
√
tan−1
π
1 − r2
|i
where,
−
2
2αi2 2
σ + 2σn2 , σb2|i =
σ 2 + 2σn2 , (18)
1 + αi2 u
1 + αi2 u
2αi
σ2
αi σu2
1+α2i u
=√ 2
r|i =
.
σa|i σb|i
(αi σu2 + (1 + αi2 )σn2 )(σu2 + (1 + αi2 )σn2 )
(19)
σa2|i =
′
−1 di tan(−ϕi−1 )
1
tan
π
+ 1−
+
√
1 − r|i2
Pe+
M
1 ∑ +
=
P (e|i),
M i=1
− r|i
di tan(ϕ′i ) − r|i
1
√
tan−1
π
1 − r|i2
di tan(−ϕ′i ) − r|i
1
√
,
tan−1
π
1 − r2
(23)
|i
where σa|i , σb|i , and r|i are defined as in (18) and (19).
Moreover, by defining
Hi (ϕ) =
di tan ϕ − r|i
di tan(ϕ) + r|i
1
1
√
+ tan−1
,
tan−1 √
2
π
π
1 − r|i
1 − r|i2
we can simplify (23) as:
P + (e|i) = 1 + Hi (ϕ′i−1 ) − Hi (ϕ′i ).
(24)
For i = 1 and i = M , we can see that:
β
β
P + (e|1) = 1 − P (tan (−ϕ1 − ) < c < tan (ϕ1 + )|1)
2
2
= 1 − H1 (ϕ′1 ),
(25)
IV. P ERFORMANCE E VALUATION
Here, we analytically study the error probability of the
proposed watermarking scheme in the presence of AWGN.
First, we compute Pe+ , the error probability when the slope
of embedding line (p, q) is positive, given as:
(22)
, we have:
|i
These parameters are computed by substituting k as defined
in (11) in definitions of variances of a, b and their correlation
coefficient r.
As shown in Appendix B, the decision boundary between
each two coding line is their bisector. Considering this ML
decision for all pairs of coding lines, we deduce that fC (c|i)
is maximum if the angle of the received line with slope c
falls between the bisectors of ith coding line and its two
neighboring coding lines. Therefore, we obtain the decision
boundaries as shown in Fig. 1(b) with dotted lines. It is
notable that the deduced decoder is independent of β, the angle
between two consecutive coding lines.
β
β
) < c < tan (ϕi − )|i)
2
2
P + (e|M )
=
1 − P (c > tan (ϕM −
− P (c < tan (−ϕM +
β
)|M )
2
β
)|M ) = HM (ϕ′M −1 ),
2
(26)
Substituting (24),(25), and (26) in (20),we have:
=
M
M −1
1 ∑
M −1
1 ∑
+
Hi (ϕ′i−1 ) −
Hi (ϕ′i )
M
M i=2
M i=1
=
M −1
1 ∑
M −1
+
[Hi+1 (ϕ′i ) − Hi (ϕ′i )],
M
M i=1
(20)
where P + (e|i) represents the error probability when the
embedding line is Li , the ith positive slope coding line.
According to (16), for i = 2 · · · M − 1, the error probability
Pe+
(27)
Copyright (c) 2011 IEEE. Personal use is permitted. For any other purposes, Permission must be obtained from the IEEE by emailing [email protected].
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication.
IMAGE WATERMARKING
5
(a)
(b)
(c)
Fig. 2. Comparison between the theoretical error probability and experimental one for a Gaussian random variable with σu = 40. (a) one bit embedding
(M = 2), (b) two bits embedding (M = 4), and (c) three bits embedding (M = 8).
Fig. 3. Comparing the probability of error with DWR of 19 dB for different
bit rates.
With a similar argument we can see that the error probability
for the negative slope embedding line is the same as Pe+ ; that
is Pe− = Pe+ . Besides, it is easy to show that the positive
and the negative embedding are equally probable in general;
therefore, we can write the error probability of the decoder as
Pe =
1 + 1 −
P + Pe = Pe+ .
2 e
2
(28)
Here, we obtained a closed form solution for the error
probability of the decoder. In Fig. 2 we compared this theoretical error probability with the experimental case of Gaussian
random variable with σu = 40 for different capacities of
one bit (M = 2), two bits (M = 4), and three bits
(M = 8). In this figure, the probability of error is shown
versus various noise strengths, measured by the watermark-tonoise ratio (WNR), which is the ratio between the embedding
distortion and the attacking noise distortion. As, we can see
the theoretical and experimental results match perfectly.
Fig. 4. Comparing the probability of error for the proposed method and the
AQIM method [25] with DWRs of 19 dB and 22 dB.
V. E XPERIMENTAL R ESULTS
We performed several experiments to test the proposed algorithm and evaluate its performance against different attacks.
A. AWGN attack on the artificial signal
In the first experiment, to validate our method, we present
the performance results for an artificial Gaussian signal under
AWGN attack. We conducted the simulations for various bit
rates and noise levels. The strength of the watermarking in
terms of the document-to-watermark ratio (DWR) is fixed to
19dB and bit rates are set to one bit (M = 2), two bits (M =
4), and three bits (M = 8) per each four samples. The results
versus various WNRs are given in Fig. 3. As expected, the
performance degrades as M increases. However, we observe
that the proposed scheme is practically applicable for M = 2,
M = 4, and M = 8. Nevertheless, for further experiments we
just use M = 2 and M = 4.
In Fig. 4, we also compare the robustness for M = 2 with
an AQIM based method [25] in similar situations for DWRs of
Copyright (c) 2011 IEEE. Personal use is permitted. For any other purposes, Permission must be obtained from the IEEE by emailing [email protected].
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication.
6
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL. ??, NO. ?, APR 2011
Fig. 5. Comparing the probability of error for the proposed method and the
hyperbolic RDM scheme [28] under power-law attack.
19 dB and 22 dB. As we can see, the proposed method which
employs an optimal decoder has lower error probability values
even for very low WNRs.
B. Power-law attack on the artificial signal
Here, we show the robustness of the proposed watermarking
algorithm against the power-law attack which can model the
gamma correction attack (as a non-linear gain attack) that is
a common process on image and video signals [28]. If we
represent the watermarked signal with u′′ , the AWGN signal
with n, and the attacked signal with z, the power-law attack
is defined as z = λ(u′′ + n)γ , where λ is the gain factor and
γ is the exponential parameter.
Fig. 5 shows the performance of our sample projection
approach against this attack for λ = 0.7 and γ = 1.2 along
with the results obtained by [28] with different value of L
in similar conditions. Here, L determines the memory of the
system. As seen, we have high robustness against the powerlaw attack which is an important factor of our algorithm caused
form the invariance of the proposed sample projection scheme
to the gain attack.
C. Tests on the image signal
Then, we conduct several experiments to verify the performance of the proposed technique in the real application
of image watermarking. Throughout our experiments, we use
the Daubechies length-8 Symslet filters with two levels of
decomposition to compute the 2-D DWT. The watermark data
is embedded in the second level approximation coefficients
of each block. The results are obtained by averaging over 100
runs with 100 different pseudorandom binary sequences as the
watermark bits.
For this study, we use four natural images of size 512×512:
Plane, Pirate, Boat, and Bridge. The original test images and
their watermarked versions using the proposed method with
16 × 16 block size and 128 bits message length are shown in
Fig. 6. Original (top) and watermarked (bottom) test images; Left-right:
Plane, Pirate, Boat, and Bridge.
Fig. 6. Besides, β, the angle between two consecutive coding
lines used in (11) is set to β = 40o for M = 2 and β = 20o
for M = 4. These values are hand-optimized to achieve the
highest robustness while keep the watermark almost imperceptible. As we can see, the watermark invisibility is perfectly
satisfied. The mean peak-signal-to-noise-ratio (PSNR) of the
watermarked images are 40.39dB, 40.44dB, 39.89, and 40.85,
respectively. To quantitatively evaluate the imperceptibility of
the watermarked images, we use the mean structural similarity
index (MSSIM) [30] which is highly matched with the human
visual system and effectively captures local errors in the spatial
domain. The respective MSSIM values for four test images
are 0.9983, 0.9984, 0.9982, and 0.9984 which confirms the
imperceptibility of the proposed algorithm.
Copyright (c) 2011 IEEE. Personal use is permitted. For any other purposes, Permission must be obtained from the IEEE by emailing [email protected].
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication.
IMAGE WATERMARKING
7
TABLE III
BER(%) R ESULTS OF EXTRACTED WATERMARK
ATTACK .
(a)
0
0.05
0.1
w
0.15
4.00
8.00
16.00
32.00
0.00
0.00
0.00
0.00
0.83
2.24
0.00
0.00
2.17
5.62
0.00
0.00
3.93
8.02
0.00
0.00
0.2
0.25
5.88
10.71
0.00
0.00
7.54
13.43
0.00
0.00
TABLE IV
E FFECT OF BLOCK SIZE AND MESSAGE LENGTH ON THE WATERMARK
ROBUSTNESS (BER (%)).
(b)
Fig. 7. AWGN attack for various noise variances, (a) for M = 2 (binary
case) and (b) for M = 4 (2-bits).
(a)
Segment
Size
UNDER VARIABLE GAIN
Blk.
Size
Bits
MSSIM
JPEG
20%
AWGN
15
Gaus.
3×3
Crop
5%
Scale
0.75
Rot.
-5
8
8
8
8
16
16
16
32
32
128
256
512
1024
128
256
512
128
256
0.9995
0.9989
0.9980
0.9961
0.9983
0.9965
0.9932
0.9906
0.9850
10.92
11.38
13.60
16.93
1.92
2.58
4.96
1.09
6.27
9.14
9.89
12.22
14.82
4.27
4.81
7.99
1.70
7.13
10.61
11.25
12.67
12.71
1.18
2.31
2.33
0.12
0.48
4.14
4.38
4.87
5.24
4.29
4.71
5.89
9.49
11.35
3.81
3.57
4.36
4.09
0.19
0.12
0.07
0.00
0.00
7.95
8.07
8.54
9.09
2.60
2.75
3.56
4.90
5.44
(b)
Fig. 8. JPEG compression attack for various quality factors, (a) for M = 2
(binary case) and (b) for M = 4 (2-bits).
Image
Median
Filtering
3×3
M =2
Plane
Pirate
Boat
Bridge
1.50
4.05
3.34
3.80
0.38
1.58
1.74
1.00
1.15
2.38
2.43
2.66
1.15
2.82
2.84
2.86
M =4
TABLE I
BER(%) R ESULTS OF EXTRACTED WATERMARK UNDER MEDIAN AND
G AUSSIAN FILTERING ATTACKS .
Gaussian
Filtering
3×3 5×5 7×7
Plane
Pirate
Boat
Bridge
6.57
7.38
7.02
6.93
4.61
3.95
6.05
4.59
5.52
5.66
9.22
6.87
5.79
5.70
9.38
7.16
1) AWGN attack: As the first attack, we investigate the
effect of AWGN to the proposed watermarking scheme. Fig.
7 shows the bit error rate (BER) of the proposed method for
various images versus different noise powers. As we see, the
method has a great resistance against noise attack for both
M = 2 and M = 4.
2) JPEG attack: Secondly, the proposed technique is tested
against JPEG compression with different quality factors. As
demonstrated in Fig 8, since low frequency components of the
image blocks are used for watermarking, the proposed method
is highly robust against JPEG with different quality factor up
to 10%.
3) Filtering attack: Table I shows the BER results for
median filtering, and Gaussian low-pass filtering attacks for
different test images. It can be seen that the proposed scheme
is highly robust against these attacks.
4) Editing attack: In Table II, we also report the performance of our algorithm under several image editing attacks
such as cropping, scaling, and rotation. As we can see in this
table, the proposed scheme remains highly robust against these
attacks as well.
5) Gain attack: To show the robustness of the proposed
algorithm to constant and variable gain attack, we conduct
an experiment on the four test images. To model the variable
gain attack, we segment each image to small non-overlapping
blocks and multiply all the pixels in each block by the gain
factor of 1 + wδ where δ is a random variable uniformly
distributed between 0 and 1, and w is a constant determining
the strength of the attack . Thus, over each block, the gain
factor remains constant, while for each block we use different
gain factors. This modeling can realize the variable gain attack
and fading attack happens through scanning process. Results
for different segmenting block sizes and strength factors ws
are shown in Table III ( Results are averaged over the four test
images for M = 2). As we see, for block sizes of 16 and 32
our algorithm is invariant to the gain attack. This is because
our watermarking approach embeds information in blocks of
size 16; hence, the gain will be constant in each embedding
block, and thud in the approximation band. That is, duo to the
invariance of the sample projection scheme to the constant
gain factor, we will have 0% BER for segmentation of 16
and 32. For segments of size 8 and 4 we will have variable
gain factors in each embedding block. However, we can see
that even for such variable gains, our algorithm shows high
robustness.
Copyright (c) 2011 IEEE. Personal use is permitted. For any other purposes, Permission must be obtained from the IEEE by emailing [email protected].
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication.
8
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL. ??, NO. ?, APR 2011
TABLE II
BER(%) R ESULTS OF EXTRACTED WATERMARK UNDER CROPPING , SCALING , AND ROTATION
M =2
Image
Plane
Pirate
Boat
Bridge
Cropping
5%
10%
3.95
4.42
3.57
5.23
5.71
5.71
6.20
11.34
ATTACKS .
M =4
0.5
Scaling
0.75
1.5
-10
0.38
0.70
0.72
1.42
0.00
0.38
0.00
0.37
0.00
0.00
0.00
0.00
2.23
5.06
2.72
6.67
Rotation
-5
5
1.45
3.77
1.54
3.64
0.69
2.73
0.35
4.75
10
1.09
3.11
1.12
5.87
Cropping
5%
10%
4.00
5.57
4.68
4.80
5.97
5.72
6.84
10.94
0.5
Scaling
0.75
1.5
-10
2.54
4.08
4.25
4.74
0.00
0.60
0.48
0.76
0.00
0.81
0.00
0.24
5.16
6.30
7.53
5.50
Rotation
-5
5
3.83
5.31
4.53
4.70
10
2.37
4.23
2.58
3.37
2.26
4.45
3.20
5.62
TABLE V
C OMPARISON BETWEEN OUR WATERMARKING METHOD AND [24] : BER
(%) UNDER AWGN ATTACK .
σn
4
Method
[24]
Proposed
1
2
3
1.00
0.00
2.00
0.00
3.00
0.12
4.00
0.12
5
6
7
15.00
0.43
25.00
0.85
44.00
1.05
TABLE VI
C OMPARISON BETWEEN OUR WATERMARKING METHOD AND WANG ’ S
METHOD [31] : BER (%) UNDER M EDIAN F ILTERING ATTACK WITH
WINDOW SIZE 3 × 3.
Method
Barbara
Wang [31]
Proposed
Fig. 9.
24.95
7.00
Image
Baboon Peppers
31.65
9.88
29.35
3.25
Goldhill
25.60
1.65
Capacity of the proposed method for BER less than 5%.
6) Block size and bit rate effect: In this experiment, we
study the effect of changing the block size and message
length on the performance of the proposed algorithm. To this
aim, we vary the block size from 8 to 32, and the number
of blocks engaged, that is the message length, from 128 to
1024. The robustness for M = 2, averaged over the four test
images, against set of attacks as well as the MSSIM value,
measure of imperceptibility, is demonstrated in Table IV. Our
experiments on various block sizes and watermark message
lengths indicated that while other block size and bit rates also
can be picked based on the application, the case of 16 × 16
block size and 128 bit message length is the best compromise
between imperceptibility, capacity, and robustness.
7) Capacity: To show the capacity of the proposed scheme,
a graph of capacity versus noise standard deviation is shown
in Fig. 9. In this graph, we have considered a fixed BER of
5% as a typical sufficiently small error rate and investigated
the maximum message length in bits for 16 × 16 block size
and M = 2 which obtains this error rate. The average message
length over four test images as well as the average MSSIM
value for each case is shown in Fig. 9. As shown in this figure,
even for heavy noise powers of σn = 20 , the capacity of the
proposed method in obtaining the BER of less than 5% is near
140 bits.
8) Comparison With Other Watermarking Schemes: Finally, we compare our watermarking algorithm for M = 2
with two of the recent blind watermarking techniques, [24] and
[31], in the same situation, for AWGN and median filtering
attacks. The simulation results are shown in Tables V, VI.
As shown in these tables, the robustness of our method is
considerably better than these two techniques.
VI. C ONCLUSION
In this article, we presented a novel blind watermarking
approach with the optimal decoder. Watermark embedding is
performed by multiplication of M specific matrices to the vector of samples of size four. These matrices translate and project
the vector of samples on the predefined coding lines depending
on the message symbol. Assuming the host samples to be i.i.d
Gaussian, which is often valid for approximation coefficients
of image blocks [15], we obtained a closed form PDF of
noisy watermarked samples. Having this distribution function,
we designed an optimum ML decoder. We analytically studied
and verified the error probability of the proposed decoder in
a noisy environment. The proposed algorithm is applied to
image signals by using four approximation coefficients of the
image blocks which may be selected according to a secret
key. In addition to be invariant to the volumetric distortions,
several simulations showed that the proposed algorithm is
highly robust against common watermarking attacks such as
AWGN, compression, and filtering. However, the algorithm
is sensitive to the collusion attack. The future work may be
proposing a solution to this problem in order to make the
Copyright (c) 2011 IEEE. Personal use is permitted. For any other purposes, Permission must be obtained from the IEEE by emailing [email protected].
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication.
IMAGE WATERMARKING
9
algorithm robust to the this attack as well.
where,
A PPENDIX A
C OMPUTING THE P ROBABILISTIC C HARACTERISTICS OF
THE D ECISION VARIABLE
To compute the PDF and the CDF of the variable c = ab , for
correlated Gaussian variables a ∼ N (0, σa2 ) and b ∼ N (0, σb2 )
with correlation coefficient r, consider their joint distribution,
given as:
fab (a, b) =
1
√
2
2
1
a
2rab
b
− 2(1−r
2 ) ( σ 2 − σa σ + 2 )
2πσa σb 1 − r2
e
a
b
σ
b
. (29)
a
b
Using this distribution function, the CDF of c =
can be
computed as:
}
{a
≤c
FC (c) = P
{b
}
{
}
= P a ≤ bc, b > 0 + P a ≥ bc, b < 0
∫ ∞ ∫ bc
=
fab (a, b) da db
b=0 a=−∞
∫ 0
∫ ∞
+
fab (a, b) da db,
(30)
b=−∞
a=bc
and its probability density function can be defined as:
∫ ∞
′
fC (c) = FC (c) =
|b|fab (bc, b) db.
(31)
−∞
Substituting (29) in (31) and considering the fact that fab (a, b)
is an even function with respect to a and b, we have:
∫ ∞
2
− b2
1
σ02
√
√
fC (c) =
,
be 2σ0 db =
2πσa σb 1 − r2 0
πσa σb 1 − r2
(32)
where
1 − r2
σ02 = c 2
.
( σa ) − σ2rc
+ σ12
a σb
b
As a consequence,
fC (c) =
√
1
σa σb 1 − r 2
.
a 2
2
2
π σb2 (c − rσ
σb ) + σa (1 − r )
In addition, FC (c) can be computed as:
∫ c
1
1
σb c − rσa
FC (c) =
fC (t) dt = + tan−1 √
.
2
π
σa 1 − r 2
−∞
(33)
(34)
A PPENDIX B
C OMPUTING THE D ECISION B OUNDARIES OF THE ML
D ECODER
For each two coding lines Li and Lj we have:
fC (c|i) ≷ij fC (c|j).
(35)
Substituting (17) in (35) and after some simplifications, we
obtain:
m2 c2 + m1 c + m0 ≷ij 0,
(36)
m2
=
m1
=
m0
=
σb|j √
σb|i √
2
1 − r|i2 −
1 − r|j
σa|j
σa|i
√
√
2 − 2r
2r|i 1 − r|j
1 − r|i2
|j
σa|j √
σa|i √
2.
1 − r|i2 −
1 − r|j
σb|j
σb|i
(37)
To find the roots for this quadratic equation, we show these
coefficients using ϕi = arctan(αi ), the angle of the coding
σ2
line Li . Thus, by defining γ 2 = σu2 , we can write the
n
parameters in (18) and (19) as:
σa2|i = 2σn2 (γ 2 sin2 (ϕi ) + 1),
r|i =
σb2|i = 2σn2 (γ 2 cos2 (ϕi ) + 1)
sin(ϕi ) cos(ϕi )γ 2
sin(2ϕi )σu2
=√
.
σa|i σb|i
(γ 2 sin2 (ϕi ) + 1)(γ 2 cos2 (ϕi ) + 1)
(38)
Then, it is easy to see that:
√
√
2σn2 γ 2 + 1
2
1 − r|i =
.
σa|i σb|i
(39)
Using these equations, we can rewrite m2 in (37) as:
√
2σn2 γ 2 + 1
m2 = (σb2|j − σb2|i )
σa|i σb|i σa|i σb|j
[
]
2
= R cos (ϕj ) − cos2 (ϕi ) ,
= R sin(ϕi + ϕj ) sin(ϕi − ϕj ),
(40)
√ 2
4 2
4σ γ
γ +1
where R = σa nσb σa σb . Similarly m0 can be computed as
|i
|i
|i
|j
m0 = −m2 , and m1 can be rewritten as:
[
]
m1 = R sin(2ϕi ) − sin(2ϕj )
= 2R cos(ϕi + ϕj ) sin(ϕi − ϕj ).
(41)
Therefore, the roots of (36) can be found as:
c = tan(
ϕi + ϕj
) or
2
c=−
1
ϕ +ϕ
tan( i 2 j )
,
(42)
which means the decision boundary between each two coding
lines Li and Lj . is either their bisector or its perpendicular
line. Since we first realize the sign of c and then detect its
corresponding coded word, the only valid solution will be the
bisector line.
R EFERENCES
[1] J. Seitz, Digital Watermarking For Digital Media. Arlington, VA, USA:
Information Resources Press, 2005.
[2] C.-S. Lu, Multimedia Security: Steganography and Digital Watermarking Techniques for Protection of Intellectual Property. Hershey, PA,
USA: IGI Publishing, 2004.
[3] G. Langelaar, I. Setyawan, and R. Lagendijk, “Watermarking digital
image and video data. a state-of-the-art overview,” IEEE Signal Process.
Mag., vol. 17, no. 5, pp. 20 –46, Sep. 2000.
[4] I. Cox, J. Kilian, F. Leighton, and T. Shamoon, “Secure spread spectrum
watermarking for multimedia,” IEEE Trans. Image Process., vol. 6,
no. 12, pp. 1673 –1687, Dec. 1997.
[5] J. Hernandez, M. Amado, and F. Perez-Gonzalez, “Dct-domain watermarking techniques for still images: detector performance analysis and
a new structure,” IEEE Trans. Image Process., vol. 9, no. 1, pp. 55 –68,
Jan. 2000.
Copyright (c) 2011 IEEE. Personal use is permitted. For any other purposes, Permission must be obtained from the IEEE by emailing [email protected].
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication.
10
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, VOL. ??, NO. ?, APR 2011
[6] H. Altun, A. Orsdemir, G. Sharma, and M. Bocko, “Optimal spread
spectrum watermark embedding via a multistep feasibility formulation,”
IEEE Trans. Image Process., vol. 18, no. 2, pp. 371 –387, Feb. 2009.
[7] Q. Cheng and T. Huang, “An additive approach to transform-domain
information hiding and optimum detection structure,” IEEE Trans.
Multimedia, vol. 3, no. 3, pp. 273 –284, Sep. 2001.
[8] M. Kutter and S. Winkler, “A vision-based masking model for spreadspectrum image watermarking,” IEEE Trans. Image Process., vol. 11,
no. 1, pp. 16 –25, Jan. 2002.
[9] P. Moulin and A. Ivanovic, “The zero-rate spread-spectrum watermarking game,” IEEE Trans. Signal Process., vol. 51, no. 4, pp. 1098 – 1117,
Apr. 2003.
[10] S. Maity and S. Maity, “Multistage spread spectrum watermark detection
technique using fuzzy logic,” IEEE Signal Process. Lett., vol. 16, no. 4,
pp. 245 –248, Apr. 2009.
[11] M. Barni, F. Bartolini, A. De Rosa, and A. Piva, “A new decoder for
the optimum recovery of nonadditive watermarks,” IEEE Trans. Image
Process., vol. 10, no. 5, pp. 755 –766, May. 2001.
[12] ——, “Optimum decoding and detection of multiplicative watermarks,”
IEEE Trans. Signal Process., vol. 51, no. 4, pp. 1118 – 1123, Apr. 2003.
[13] Q. Cheng and T. Huang, “Robust optimum detection of transform domain multiplicative watermarks,” IEEE Trans. Signal Process., vol. 51,
no. 4, pp. 906 – 924, Apr. 2003.
[14] J. Wang, G. Liu, Y. Dai, J. Sun, Z. Wang, and S. Lian, “Locally optimum
detection for barni’s multiplicative watermarking in dwt domain,” Signal
Process., vol. 88, no. 1, pp. 117–130, 2008.
[15] M. A. Akhaee, S. M. E. Sahraeian, B. Sankur, and F. Marvasti, “Robust
scaling-based image watermarking using maximum-likelihood decoder
with optimum strength factor,” IEEE Trans. Multimedia, vol. 11, no. 5,
pp. 822 –833, Aug. 2009.
[16] M. A. Akhaee, S. M. E. Sahraeian, and F. Marvasti, “Contourlet-based
image watermarking using optimum detector in a noisy environment,”
IEEE Trans. Image Process., vol. 19, no. 4, pp. 967 –980, Apr. 2010.
[17] S. M. E. Sahraeian, M. A. Akhaee, and F. Marvasti, “Distribution
independent blind watermarking,” in Proc. IEEE Int. Conf. Image
Processing, Cairo, Egypt, Nov. 2009, pp. 125 –128.
[18] C.-W. Tang and H.-M. Hang, “A feature-based robust digital image
watermarking scheme,” IEEE Trans. Signal Process., vol. 51, no. 4,
pp. 950 – 959, Apr. 2003.
[19] D. Zheng, J. Zhao, and A. El Saddik, “Rst-invariant digital image
watermarking based on log-polar mapping and phase correlation,” IEEE
Trans. Circuits Syst. Video Technol., vol. 13, no. 8, pp. 753 – 765, Aug.
2003.
[20] B. Chen and G. Wornell, “Quantization index modulation: a class
of provably good methods for digital watermarking and information
embedding,” IEEE Trans. Inf. Theory, vol. 47, no. 4, pp. 1423 –1443,
May. 2001.
[21] J. Eggers, R. Buml, and B. Girod, “Estimation of amplitude modifications before scs watermark detection,” in Proc. SPIE: Security
Watermarking Multimedia Contents IV, vol. 46, San Jose, Ca, USA,
Jan. 2002, pp. 387–398.
[22] J. H. Conway, N. J. A. Sloane, and E. Bannai, Sphere-packings, lattices,
and groups. New York, NY, USA: Springer-Verlag New York, Inc.,
1998.
[23] M. Miller, G. Doerr, and I. Cox, “Applying informed coding and
embedding to design a robust high-capacity watermark,” IEEE Trans.
Image Process., vol. 13, no. 6, pp. 792 –807, Hun. 2004.
[24] C. Chen and X. Wu, “An angle qim watermarking algorithm based
on watson perceptual model,” in Proc. Int. Conf. Image and Graphics
Processing (ICIG07), Chengdu, Sichuan, China, 22-24 2007, pp. 324
–328.
[25] F. Ourique, V. Licks, R. Jordan, and F. Perez-Gonzalez, “Angle qim: a
novel watermark embedding scheme robust against amplitude scaling
distortions,” in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal
Processing, vol. 2, Philadelphia, PA, USA, Mar. 2005, pp. 797 – 800.
[26] F. Perez-Gonzalez, C. Mosquera, M. Barni, and A. Abrardo, “Rational
dither modulation: a high-rate data-hiding method invariant to gain
attacks,” IEEE Trans. Signal Process., vol. 53, no. 10, pp. 3960 – 3975,
Oct. 2005.
[27] F. Perez-Gonzalez and C. Mosquera, “Quantization-based data hiding
robust to linear-time-invariant filtering,” IEEE Trans. Inf. Forensics
Security, vol. 3, no. 2, pp. 137 –152, Jun. 2008.
[28] P. Guccione and M. Scagliola, “Hyperbolic rdm for nonlinear valumetric
distortions,” IEEE Trans. Inf. Forensics Security, vol. 4, no. 1, pp. 25
–35, Mar. 2009.
[29] M. A. Akhaee, A. Amini, G. Ghorbani, and F. Marvasti, “A solution
to gain attack on watermarking systems: Logarithmic homogeneous
rational dither modulation,” in Proc. IEEE Int. Conf. Audio, Speech,
and Signal Process., Dallas, TX, USA, May. 2010, pp. 1050 –1053.
[30] Z. Wang, A. C. Bovik, H. R. Sheikh, and E. P. Simoncelli, “Image
quality assessment: from error visibility to structural similarity,” IEEE
Trans Image Process, vol. 13, pp. 600–612, Apr 2004.
[31] Y. Wang, J. Doherty, and R. Van Dyck, “A wavelet-based watermarking
algorithm for ownership verification of digital images,” IEEE Trans.
Image Process., vol. 11, no. 2, pp. 77 –88, Feb. 2002.
Mohammad Ali Akhaee (S’07–M’10) was born in
Tehran, Iran, in 1982. He received the B.S. degree
in both electronic and communication engineering
from Amirkabir University of Technology and the
M.S. degree from Sharif University of Technology,
Tehran, Iran, in 2003 and 2005, respectively. He
received his PhD degree from Sharif University of
Technology in 2009. He has awarded the Endeavour
research fellowship from Australia in 2010. He is
now a post-doc researcher at the Computing and
Audio Research Laboratory at the University of
Sydney.
His research interest includes multimedia security, statistical signal processing, acoustic, and image processing.
Sayed Mohammad Ebrahim Sahraeian (S’08)
was born in Shiraz, Iran, in 1983. He received
the B.S. and M.S. degrees in electrical engineering
from Sharif University of Technology, Tehran, Iran,
in 2005 and 2008, respectively. He is currently
pursuing the Ph.D. degree in electrical and computer
engineering at Texas A&M University, College Station.
His research interests include image and multidimensional signal processing, genomic signal processing, bioinformatics, and computational biology.
Craig Jin (M’92) received the M.S. degree in
applied physics from the California Institute of Technology, Pasadena, in 1991 and the Ph.D. degree in
electrical engineering from the University of Sydney,
Sydney, Australia, in 2001. He is a Senior Lecturer
in the School of Electrical and Information Engineering, University of Sydney and also a Queen Elizabeth II Fellow. He is a Director of the Computing
and Audio Research Laboratory at the University
of Sydney and also a co-founder of three start-up
companies. His research focuses on spatial audio and
neuromorphic engineering and he is the author or co-author of more than 50
journal or conference papers in these areas and holds six patents. Dr. Jin has
received national recognition in Australia for his invention of a spatial hearing
aid.
Copyright (c) 2011 IEEE. Personal use is permitted. For any other purposes, Permission must be obtained from the IEEE by emailing [email protected].