ConvChain is a Markov chain of images that converges to input-like images. That is, the distribution of NxN patterns in the outputs converges to the distribution of NxN patterns in the input as the process goes on.
In the examples a typical value of N is 3.
How to construct such a process? We want the final probability of a given pattern to be proportional to the pattern's weight, where pattern's weight is the number of such patterns in the input. For this it is sufficient that a stronger condition is satisfied: the probability of a given state (image) S
should be proportional to the product of pattern weights over all patterns in S
.
p(S) ~ product of pattern weights over all patterns in S
Fortunately, there are general methods to build a Markov chain that has the desired probability distribution over states as its stationary distribution.
Additional definitions:
- For the ease of reasoning about the algorithm, it's convenient to introduce an energy function
E
,E(S) := - sum over all patterns P in S of log(weight(P))
so the probability distribution over states becomesp(S) ~ exp(-E(S))
. Note that this energy function is a generalization of the Ising model energy. In the Ising model patterns are 1x2 instead of NxN. - To expand possible applications, it's convenient to introduce a temperature parameter
T
so the probability distribution over states becomesp(S) ~ exp(-E(S)/T)
. Low temperatures make the distribution more concentrated in energy wells, high temperatures make the distribution more uniform. If one uses ConvChain to generate dungeons, low temperatures correspond to accurate newly built dungeons while high temperatures correspond to ruins. - For the speed of convergence, it's convenient for weights of all patterns to be nonzero. So let's redefine the
weight(P)
of a patternP
to be the number of patternsP
in the input if that number is more than zero and some small numbereps
otherwise,0 < eps < 1
.
- Read the input image and count NxN patterns.
- (optional) Augment pattern data with rotations and reflections.
- Initialize the image (for example, with independent random values) in some state
S0
. - Repeat the Metropolis step:
- Compute the energy
E
of the current stateS
. - Choose a random pixel and change its value. Let's call the resulting state
S'
. - Compute the energy
E'
of the stateS'
. - Compare
E'
toE
. IfE' < E
assign the current state to beE'
. Otherwise, assign the current state to beE'
with probabilityexp(-(E'-E)/T)
.
- Compute the energy
If there are more than 2 colors, Gibbs sampling may converge faster than Metropolis:
- Repeat the Gibbs step: change the current state
S
to a stateS'
according to the probability distributionp(S'|S) ~ exp(-E'/T)
.
ConvChain supports constraints, so you can easily combine it with other generators or handcrafted content.
In the language of WFC readme ConvChain satisfies strong condition 2 (Strong C2), but not condition 1 (C1).
If you freeze out the system as the Metropolis simulation goes on, you'll get a variant of the simulated annealing algorithm.
The detailed balance condition for ConvChain is exp(-E1/T)p(S2|S1) = exp(-E2/T)p(S1|S2)
, so both Gibbs p(S2|S1) ~ exp(-E2/T)
and Metropolis p(S2|S1) = min(1, exp(-(E2-E1)/T))
chains converge to the desired distribution over states.
- Stuart Geman and Donald Geman, Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images, 1984.
- Kris Popat and Rosalind W. Picard, Novel cluster-based probability model for texture synthesis, classiffcation, and compression, 1993.
- Rupert Paget and I. Dennis Longstaf, Texture Synthesis via a Non-parametric Markov Random Field, 1995.
- Vivek Kwatra, Irfan Essa, Aaron Bobick and Nipun Kwatra, Texture Optimization for Example-based Synthesis, 2005.
ConvChain is a console application that depends only on the standard library. Get .NET Core for Windows, Linux or macOS and run
dotnet run --configuration Release ConvChain.csproj
ConvChain.cs
contains the basic program, ConvChainFast.cs
contains an equivalent faster program (~100 times faster on a 4-core CPU), but in a less human-readable form.
- Kevin Chapelier made an interactive vanilla JavaScript port of the faster version and a 1KB adaptation of it.
- Amit Patel made a web port of the older slower version, with the main algorithm ported to TypeScript.
- buckle2000 made a Processing (Java) port and a MoonScript port.
- Kevin Chapelier adapted ConvChain to run on a GPU and made an interactive WebGL2 demo.