Log In  


Hello,
as any of you guys I need some decent picture compression for the Pico Off Road game I'm making. PX-9 is very good choice but one of my passions always were compression algorithms. So I've decided to implement one myself. Sometimes is better and sometimes is worse than other available algorithms here. Consider it as my addition to the community and use it wherever you need.

Algorithm

LZ77 is very simple compression method that works similar to RLE with the exception: while RLE copies the same character N times LZ77 copies chunk of already decoded data of length N. I've used 3 different blocks: direct color output, offset + length block and just length for coherent rows (sequence that starts on the same X position but in previous row). Compression allows self-overlapping (for example offset 3 and length 20 is valid, it just copies itself with offset).

For more information check https://en.wikipedia.org/wiki/LZ77_and_LZ78#LZ77

Limits

  • designed for PICO-8 pictures only (or any arrays of values in interval 0..15), don't try it on map or other binary data
  • picture size limit is 128x128
  • compression uses simple brute force search and can be very slow (up to a minute on fast PC), but who cares?
  • decompressor is 131 tokens long and is relatively fast
  • it's guaranteed that the compressed stream never contains ] character so it's safe to keep it in [[..]] raw lua string
  • compressed stream doesn't contain capital letters (A..Z)
  • size of the picture is not in the compressed stream so you have to know it ahead

Tips

LZ77 from it's nature is good for compressing repeated patterns. For example images that use 2 color gradients with Bayer-Matrix dithering have usually much better compression ratios than images with random dithering.

Large blocks of single color are also compressed with high rations but expect much longer compression times.

Example

Load cartridge, import yourimage.png, in function _init edit w,h (width and height of the area you want to compress) and run. Result is written to compressed.txt.
For example following image is ideal for LZ77 - compressed size is just 1157 bytes.

Code

Whole code including cartridge and test picture is on Git (license CC0) https://github.com/assemblerbot/pico8-lz77 .

If you want to copy-paste it, here it is:

Compressor

max_run_len=100

function lz77_out_a(value) -- 0..15
	if value<0 or value>15 then
		stop("value 𝘢 out of range! "..value)
	end

	return chr(value+32)
end

function lz77_out_b(value) -- 0..178
	if value<0 or value>178 then
		stop("value 𝘣 out of range! "..value)
	end

	if value<17 then
		return chr(value+48)
	else
		return chr(value+94-17)
	end
end

function lz77_out_c(value) -- 0..194
	if value<0 or value>194 then
		stop("value 𝘤 out of range! "..value)
	end

	if value<33 then
		return chr(value+32)
	else
		return chr(value+94-33)
	end
end

function lz77_comp(x0,y0,w,h,vget)
	local pixels={}
	for y=0,h-1 do
		for x=0,w-1 do
			add(pixels,vget(x0+x,y0+y))
		end
	end

	local out=""
	local i=1
	while i<=#pixels do
		local p=pixels[i]

		local run=0
		local ofs=0
		for j=max(1,i-195),i-1 do
			local k=0
			while i+k<=#pixels and pixels[i+k]==pixels[j+k] do k+=1 end
			if k>run or (k>=run-1 and i-j==w) then
				run=k
				ofs=i-j
			end
		end

		if run<2 or (run==2 and ofs~=w) then
			-- direct pixel output
			out=out..lz77_out_a(p)
			i+=1
		elseif ofs==w then
			-- row with coherence
			run=min(run,178-(max_run_len+1)+2)

			out=out..lz77_out_b(run-2+max_run_len+1)
			i+=run
		else
			-- run
			run=min(run,max_run_len+2)

			out=out..lz77_out_b(run-2)
			out=out..lz77_out_c(ofs-1)

			i+=run
		end
	end
	return out
end

Decompressor

function lz77_decomp(x0,y0,w,h,src,vset)
	local i,d=1,{}
	while i<=#src do
		local c=ord(src,i)
		if c<48 then
			add(d,c-32)
		else
			local ofs,run=w,c-46
			if c>=94 then
				run-=29
			end
			if run>=103 then
				run-=101
			else
				i+=1
				ofs=ord(src,i)-31
				if ofs>=63 then
					ofs-=29
				end
			end
			local pos=#d-ofs
			for j=1,run do
				add(d,d[pos+j])
			end
		end
		i+=1
	end
	for i=0,w*h-1 do
		vset(i%w+x0,i\w+y0,d[i+1])
	end
end

Main

function _init()
	local x,y=0,0 --source position
	local w,h=128,128 --source size

	local lz77=lz77_comp(x,y,w,h,sget)
	print("size="..#lz77)
	printh(lz77,"compressed.txt")
	--lz77_decomp(x,y,w,h,lz77,pset)
	flip()
	stop()
end 

Have fun! :-)

8


Thanks for posting,

I've been tinkering with various permutations on RLE and working on encoding runs of patterns instead of just solid colors, but haven't made the jump to something like this yet, so this will be interesting to dissect.

I've been doing some comparison tests between image compression algorithms on Pico-8, and I've found some surprising things. First is that the integrated code compression is pretty impressive, and its effects on actual space on cartridge are really big.

To illustrate, I just made a new algorithm that encodes runs of bytes instead of pixels, and running your test image through it, I get a raw string size of 2991 characters vs. just 1157 for your LZ77 codec, but the compressed size is actually smaller, at 923 bytes vs. 1064.

I guess it depends on whether or not you're going to store compressed data in string form or in binary, but if the former, it seems like optimizing file size is largely about figuring out what works the best with the compression systems already at play. This does make things more complicated, but I guess extra compression always helps.


That's very interesting. I have multiple versions of the algorithm too, so I'll test them against Pico-8 internal compressor. Thanks for info.


Heard about lz77 reading articles about GBA tilesets — thanks for making and sharing a pico8 version!


Just wanted to say thanks for this! <3



[Please log in to post a comment]