Merge with x264.git #1

Also remove unused AVX cruft.

TBM and BMI1 are supported by Trinity/Piledriver. The others (and BMI1) will probably appear in Intel's upcoming Haswell. Also update x86inc with AVX2 stuff.

Broke register preservation in x264_cpu_cpuid and x264_cpu_xgetbv. Did not cause any problems.

Broke if the first macroblock in the slice exceeded the set slice-max-size.

BGR/BGRA input was correct.

x264_cavlc_init needs to be stack-aligned now.

Some x264 asm assumed that the high 32 bits of registers containing "int" values would be zero. This is almost always the case, and it seems to work with gcc, but it is *not* guaranteed by the ABI. As a result, it breaks with some other compilers, like Clang, that take advantage of this in optimizations. Accordingly, fix all x86 code by using intptr_t instead of int or using movsxd where neccessary. Also add checkasm hack to detect when assembly functions incorrectly assumes that 32-bit integers are zero-extended to 64-bit.

Not necessary for x264, as -m amd64 already does the right thing, but used by external users of x86inc.

Not necessary with the CAVLC lookup table for zero run codes.

Helps avoid VBV predictors going nuts with very low-cost MBs. One particular case this fixes is zero-cost MBs: adaptive quantization decreases the QP a lot, but (before this patch), no cost penalty gets factored in for this, because anything times zero is zero.

Required for row re-encoding.

Extremely accurate, possibly 100% so (I can't get it to fail even with difficult VBVs). Does not yet support rows split on slice boundaries (occurs often with slice-max-size/mbs). Still inaccurate with sliced threads, but better than before.

Intel was nice enough to make tzcnt equal to "rep bsf", which is backwards-compatible. This means we don't actually have to add new functions to make it work.

Recent AMD CPUs' instruction decoders choke horribly on extremely long nops (i.e. with 4 prefixes). Won't affect much, since we don't use ALIGN much.

Fully reconstruct frames even without dump-yuv.

Lowers encoding latency around 14% in sliced threads mode with preset superfast. Additionally, even if there is no waiting time between frames, this improves parallelism, because hpel+deblock are done during the (singlethreaded) lookahead. For ease of debugging, dump-yuv forces all of the threads to wait and finish instead of setting b_full_recon.

Regression in r2183. Bizarrely seemed to work on many platforms, but crashed on win64 and may have been slower. Only affected sliced threads during encoding, but could cause crashes on x264 encoder close even without sliced threads.

Was using qp instead of qscale; could cause NANs (not to mention less accurate results).

The code does, in fact, handle CAVLC+8x8dct correctly already.

MSVS requires exported variables to be declared with the DATA keyword, and requires that imported variables be declared with dllimport. This does not fix x264 cli being unable to use a shared library built by ICL however.

Adds support for a bunch of targets, including: aarch64 (armv8) arm-linux-androideabi

Makes multilib compilation more convenient.

x264 never supported it and never will because nobody uses it.

New assembly function with SSE2, SSSE3 and XOP implementations for calculating absolute sum of differences.

Some use-cases of x264 involve encoding video with large constant areas of the frame. Sometimes, the caller knows which areas these are, and can tell x264. This API lets the caller do this and adds internal tracking of modifications to macroblocks to avoid problems. This is really only suitable without B-frames. An example use-case would be using x264 for VNC.

Split each lookahead frame analysis call into multiple threads. Has a small impact on quality, but does not seem to be consistently any worse. This helps alleviate bottlenecks with many cores and frame threads. In many case, this massively increases performance on many-core systems. For example, over 100% faster 1080p encoding with --preset veryfast on a 12-core i7 system. Realtime 1080p30 at --preset slow should now be feasible on real systems. For sliced-threads, this patch should be faster regardless of settings (~10%). By default, lookahead threads are 1/6 of regular threads. This isn't exacting, but it seems to work well for all presets on real systems. With sliced-threads, it's the same as the number of encoding threads.

Fix some integer overflows and check input parameters better. Also fix incorrect type specifiers for demuxer info printing.

Allow manual invocation of WIN64_SPILL_XMM even under INIT_MMX SSE version of mova is movaps rather than movdqa. YMM version of movnta. Add mp size for named arguments. Fix DEFINE_ARGS when used outside of a cglobal. Define a few more cpuflags. 3-argument wrappers for a few more instructions.

Limits VBV mispredictions after long periods of relatively constant video.

Implement a basic separable bilinear filter to rescale the quantizer offsets. Structure inspired by swscale, but floating-point instead of fixed-point. Not as optimized as it could be, but it's quite fast already. Example compression penalties on a 720p video game recording: First pass with 720p and second as 480p: ~-1.5% (vs. same res) First pass with 480p and second as 720p: ~-3% (vs. same res)

Turn off the sub8x8 partitions, try it, and turn them back on if it didn't help. Small compression improvement with p4x4 on (~0.1-0.5%). Also update related comments.

Fix a typo that made an early-skip less effective. Avoid a relatively unpredictable branch. Slightly changed output due to the typo-fix. ~50 cycles faster on Core i7.

People don't seem to like this so I'm just going to get rid of it.

This eliminates a memory leak when calling x264_encoder_close.

Backported from libav.

Previously it was policy to use -pthread, but OpenBSD now recommends -lpthread. its been libpthread anyway and policy has changed to stop using -pthread.

Useful to judge the resulting quality of a frame when VBV is enabled.

Allow fast skipping even if the pskip MV isn't zero.

Add the input frame opaque pointer to the arguments. This makes it easier to use with multiple simultaneous x264 encodes.

x264 would free mb_info before it was completely done using it.

This feature lets the callee know which decoded macroblocks have changed.

Prerequisite for another configure patch after this. Idea copied from libpthread.

ICL's preprocessor doesn't handle it correctly. This fix is similar to libav's fix in 0db2d9.

Lossless mode can't currently be enabled mid-stream.

The Apple A6 CPU doesn't support performance counters, so this test caused a crash.

This allows overriding the value from outside the file. This can be useful if x86inc.asm is used outside of x264.

The name "3dnowext" is more common than "3dnow2". Doesn't affect x264.

Doesn't actually change encoding behavior, but makes it more correct. Warning messages should now be accurate at higher bit depths and non-4:2:0. Technically, since it redefines x264_level_t, this is an API version increment.

Use the first macroblock of each slice instead of the last of the previous. Lets us pick a reasonable initial QP for the first slice too. Slightly improved compression.

Small compression improvement; up to ~0.5% in extreme cases. Helps more with small slice sizes (tiny resolutions or slice-max-size). Note that this changes the 2-pass stats file format.

Fixes a possible regression in r2228.

Allocate AVFrames correctly with avcodec_alloc_frame(). This caused crashes with newer libavcodecs that try to free frame extradata.

Solaris responds correctly to the same value as Cygwin, so let's use that.

Slightly wrong numbers in level table.

Doesn't actually affect x264, but it's more correct.

GAS doesn't seem to like spaces in vld1 anymore, so remove those.

This is obviously bad user input, but x264 shouldn't crash if it happens.

Use this in 8-bit loopfilter functions so they can be used if there is no aligned stack (e.g. x86-32 MSVC or ICC 10.x).

Now RET checks whether it immediately follows a branch, so the programmer dosen't have to keep track of that condition. REP_RET is still needed manually when it's a branch target, but that's much rarer. The implementation involves lots of spurious labels, but that's ok because we strip them.

Automatically use VEX-encoding in AVX/AVX2/XOP/FMA3/FMA4 functions for all instructions that exists in a VEX-encoded version. This change makes it easier to extend existing code to use AVX2. Also add support for AVX emulation of a few instructions that were missing before.

First AVX2 function for testing. Bump yasm version to 1.2.0 for AVX2 support.

It is no longer needed now that we've bumped the version requirement of yasm to 1.2.0.

The "CentaurHauls family 6 model 9 stepping 8" family of CPUs (flags: fpu vme de pse tsc msr cx8 sep mtrr pge mov pat mmx fxsr sse up rng rng_en ace ace_en) SIGILLs on long nop codes.

Regression in r2145. Assembly assumed array was [2][64] when it was actually [2][63]. Tiny (~0.1%) compression improvement.

Code assumed keyframe analysis would only pull one frame off the list; this isn't true with open-gop.

Synced from libav. The new name is more descriptive and will allow defining a separate public prefix for externally visible library symbols.

This allows defining externally visible library symbols. Signed-off-by: Diego Biurrun <diego@biurrun.de>

~4% faster PIC WIN64: ~3% faster and 16 byte shorter cabac_encode_bypass ~8% faster cabac_encode_terminal Benchmarked on Ivy Bridge UNIX64: One instruction less in cabac_encode_bypass

Reduces code size because movaps/movups is one byte shorter than movdqa/movdqu. Also merge MMX and SSE versions of memcpy_aligned into a single macro.

Smarter decision to improve fast-first-pass performance in 2-pass encodes. Dramatically improves CPU utilization on multi-core systems. Tested on a quad-core Ivy Bridge (12 threads, 1080p): Fast first pass: veryfast: ~7% faster faster: ~11% faster fast/medium: ~15% faster slow/slower: ~42% faster veryslow: ~55% faster CRF/1-pass: veryfast: ~9% faster (all others remained the same)

pmv wasn't checked properly in some cases, as well as zero vector. Output-changing portion of the following patch.

Branchlessly handle elimination of candidates in MMX roundclip asm. Add a new asm function, similar to roundclip, except without the round part. Optimize and organize the C code, and make both subme>=3 and subme<3 consistent. Add lots of explanatory comments and try to make things a little more understandable. ~5-10% faster with subme>=3, ~15-20% faster with subme<3.

About 15% faster on average.

Makes SATD 20-50% faster across all partition sizes but 4x4.

Speedup is most apparent for 8-bit (~30%), but gives some improvements for 10-bit too (~12%). 64-bit only for now.

The Bobcat has a 64-bit SIMD unit reminiscent of the Athlon 64; detect this and apply the appropriate flags. It also has an extremely slow palignr instruction; create a flag for this to avoid massive penalties on palignr-heavy functions. Improve Atom function selection and document exactly what the SLOW_ATOM flag covers. Add Atom-optimized SATD/SA8D/hadamard_ac functions: simply combine the ssse3 optimizations with the sse2 algorithm to avoid pmaddubsw, which is slow on Atom along with other SIMD multiplies. Drop TBM detection; it'll probably never be useful for x264. Invert FastShuffle to SlowShuffle; it only ever applied to one CPU (Conroe). Detect CMOV, to fail more gracefully when run on a chip with MMX2 but no CMOV.

Use Conroe-style movddup in AVX transforms; both Sandy Bridge and Bulldozer do movddup in the load unit, so it's totally free this way. On Sandy Bridge: ~6% faster sa8d_satd ~5% faster hadamard_ac ~9% faster 32-bit satd ~2% faster sa8d

There's quite a few others, but most of them don't help to fix or there's no easy way to avoid them.

Faster, fewer branch mispredictions.

Uses dlopen to load AvxSynth on Linux and OS X. Allows the use of --demuxer avs for AvxSynth, though the only source filter it can currently use is FFMS2. Add a local copy of avxsynth_c.h and its dependent headers in extras/ so that users don't need to actually have AvxSynth development headers installed to enable support for it (mirroring the AviSynth behavior). Based on a patch by 0x09 (tab@lavabit.com)

This reduces overhead and lets us use less branchy code for zigzag, dequant, decimate, and so on. Reorganize and optimize a lot of macroblock_encode using this new function. ~1-2% faster overall. Includes NEON and x86 versions of the new function. Using larger merged functions like this will also make wider SIMD, like AVX2, more effective.

Up to 10-15% faster overall.

Regression in r2273.

SWAP with >=3 named (rather than numbered) args PERMUTE followed by SWAP with 2 named args used to produce the wrong permutation

Regression in r2265 (only affected compilers with broken stack alignment, like ICL on win32).

Fixes building against newer libavcodecs from the Libav project.

Results vary between versions because of different rounding results.

Works in conjunction with slice-max-mbs and/or slice-max-size to avoid overly small slices. Useful with certain decoders that barf on extremely small slices. If slice-min-mbs would be violated as a result of slice-max-size, x264 will exceed slice-max-size and print a warning.

The H.264 spec technically has limits on the number of slices per frame. x264 normally ignores this, since most use-cases that require large numbers of slices prefer it to. However, certain decoders may break with extremely large numbers of slices, as can occur with some slice-max-size/mbs settings. When set, x264 will refuse to create any slices beyond the maximum number, even if slice-max-size/mbs requires otherwise.

Rescale the scale factor if the offset clips. This makes weightp more effective in fades to/from white (and an other situation that requires big offsets). Search more than 1 scale factor and more than 1 offset, depending on --subme. Try to find the optimal chroma denominator instead of hardcoding it. Overall improvement: a few percent in fade-heavy clips, such as a sample from Avatar: TLA.

OpenCL support is compiled in by default, but must be enabled at runtime by an --opencl command line flag. Compiling OpenCL support requires perl. To avoid the perl requirement use: configure --disable-opencl. When enabled, the lookahead thread is mostly off-loaded to an OpenCL capable GPU device. Lowres intra cost prediction, lowres motion search (including subpel) and bidir cost predictions are all done on the GPU. MB-tree and final slice decisions are still done by the CPU. Presets which do not use a threaded lookahead will not use OpenCL at all (superfast, ultrafast). Because of data dependencies, the GPU must use an iterative motion search which performs more total work than the CPU would do, so this is not work efficient or power efficient. But if there are spare GPU cycles to spare, it can often speed up the encode. Output quality when OpenCL lookahead is enabled is often very slightly worse in quality than the CPU quality (because of the same data dependencies). x264 must compile its OpenCL kernels for your device before running them, and in order to avoid doing this every run it caches the compiled kernel binary in a file named x264_lookahead.clbin (--opencl-clbin FNAME to override). The cache file will be ignored if the device, driver, or OpenCL source are changed. x264 will use the first GPU device which supports the required cl_image features required by its kernels. Most modern discrete GPUs and all AMD integrated GPUs will work. Intel integrated GPUs (up to IvyBridge) do not support those necessary features. Use --opencl-device N to specify a number of capable GPUs to skip during device detection. Switchable graphics environments (e.g. AMD Enduro) are currently not supported, as some have bugs in their OpenCL drivers that cause output to be silently incorrect. Developed by MulticoreWare with support from AMD and Telestream.

RDO: ~20% faster than C Bitstream: ~50% faster than C 1-2% faster overall, highest on preset superfast/fast/medium.

For when we want to mix simd sizes within one function.

AVX2 functions: mc_chroma intra_sad_x3_16x16 last64 ads hpel dct4 idct4 sub16x16_dct8 quant_4x4x4 quant_4x4 quant_4x4_dc quant_8x8 SAD_X3/X4 SATD var var2 SSD zigzag interleave weightp weightb intra_sad_8x8_x9 decimate integral hadamard_ac sa8d_satd sa8d lowres_init denoise

Also restructure some code to reduce code size of various functions, especially in high bit-depth.

Also fix the AVX implementation to correctly use the SSSE3 inline asm instead of SSE2.

Also rewrite the entire function to be faster and drop the AVX version which is no longer useful.

Also reduce the number of xmm registers used by mc_copy_* to avoid saving and restoring xmm6 and xmm7 on 64-bit Windows.

Also use loops instead of duplicating code; reduces code size by ~10kB with negligible effect on performance.

Also reduce the number of xmm registers used by sse2/ssse3 pixel_sad_x3.

~55% faster ads in benchasm, ~15-30% in real encoding. ~4% faster "placebo" preset overall.

~2x faster coeff_level_run. Faster CAVLC encoding: {1%,2%,7%} overall with {superfast,medium,slower}. Uses the same pshufb LUT abuse trick as in the previous ads_mvs patch.

Slices-max broke slice-max-size when slice-max wasn't used. Slice-min-mbs broke in rare cases near the end of a threadslice.

Likely didn't actually break in practice, but memcpy with src==dst is incorrect.

Prevents overflows that can occur in some cases.

The Mach-O bug was fixed in yasm 0.8.0 and we don't support versions that old. a.out was superseded by ELF on sane systems a few decades ago.

On modern CPUs movdqu isn't slower than movdqa when used on aligned data and using the same code in both cases saves cache. This was already done for the high bit-depth AVX2 implementation but the aligned version still exists as dead code so remove that.

Store XMM6 and XMM7 in the shadow space in functions that clobbers them. This way we don't have to adjust the stack pointer as often, reducing the number of instructions as well as code size.

Avoids the need for manual 32 byte array alignment on compilers that support -mpreferred-stack-boundary.

~2% faster trellis.

~7% faster using the pmulhrsw trick from mc_chroma.

20->16 cycles on Ivy Bridge

30->18 cycles

43->24 cycles

30->22 cycles

10->9 cycles

27 -> 19 cycles

quant_4x4: 13->6 cycles quant_4x4_dc: 14->8 cycles quant_8x8: 47->24 cycles quant_4x4x4: 48->25 cycles

28->15 cycles Also reorder instructions to use fewer registers, 3 cycles faster on Ivy Bridge with 64-bit Windows.

~5% faster than 32-bit.

Autoload the OpenCL library so that it's not required to run an openCL-enabled build of x264. Update X264_BUILD, which should have been changed with the first patch.

Also fix crash in the case of OpenCL error during encoding.

Also fix crash in high bit depth builds compiled with unaligned stack.

Bitstream-reallocation function didn't handle the case of filler.

This probably makes more sense to the user than setting vbv-maxrate = bitrate, as before.

Stops x264 from attempting to optimize global stream headers, ensuring that different segments of a video will have identical headers when used with identical encoding settings.

Don't omit the delta quant if it'd raise the quantizer to do so; this fixes a rare flickering issue caused by deblocking.

Prevents a crash if the misaligned exception mask bit is cleared for some reason. Misaligned SSE functions are only used on AMD Phenom CPUs and the benefit is miniscule. They also require modifying the MXCSR control register and by removing those functions we can get rid of that complexity altogether. VEX-encoded instructions also supports unaligned memory operands. I tried adding AVX implementations of all removed functions but there were no performance improvements on Ivy Bridge. pixel_sad_x3 and pixel_sad_x4 had significant code size reductions though so I kept them and added some minor cosmetics fixes and tweaks.

…ixels

This is also a valid value for WIN64.

Combine frame and mb data mallocs into a single large malloc. Additionally, on Linux systems with hugepage support, ask for hugepages on large mallocs. This gives a small performance improvement (~0.2-0.9%) on systems without hugepage support, as well as a small memory footprint reduction. On recent Linux kernels with hugepage support enabled (set to madvise or always), it improves performance up to 4% at the cost of about 7-12% more memory usage on typical settings.. It may help even more on Haswell and other recent CPUs with improved 2MB page support in hardware.

This format has been reverse engineered and x264's output has almost exactly the same bitstream as Panasonic cameras and encoders produce. It therefore does not comply with SMPTE RP2027 since Panasonic themselves do not comply with their own specification. It has been tested in Avid, Premiere, Edius and Quantel. Parts of this patch were written by Fiona Glaser and some reverse engineering was done by Joseph Artsimovich.

Windows, unlike most other operating systems, uses UTF-16 for Unicode strings while x264 is designed for UTF-8. This patch does the following in order to handle things like Unicode filenames: * Keep strings internally as UTF-8. * Retrieve the CLI command line as UTF-16 and convert it to UTF-8. * Always use Unicode versions of Windows API functions and convert strings to UTF-16 when calling them. * Attempt to use legacy 8.3 short filenames for external libraries without Unicode support.

Caused crashes under gdb in Windows and might cause other unknown problems.

If FFMS_ReadIndex is used with an empty index file it gets stuck in an infinite loop instead of returning NULL like it's supposed to do on failure. Explicitly check if the file is empty before calling it as a workaround.

…lchain

If only a static library is built, the user of the library that just tries to link to the lib using the flags provided by pkg-config might not know that only a static lib exists and that he'd have to pass --static to pkg-config to get the internal dependencies to be able to link the library. For a shared build, the internal dependencies are kept in Libs.private as before. This matches how libav's pkg-config files are generated.

It was used as a workaround for a bug that only existed in the GPAC repository for a few weeks back in 2010. There's no reason to keep it anymore.

This makes more sense for future implementations of templates with zmm registers.

Only warn if underflow occurs for reasons other than CRF-max, as CRF-max implies that VBV underflow is desired by the user.

~100 cycles faster with subme>=9

Do the reconfig when the next frame's encode begins. Fixes some rare crashes with frame-threading and encoder_reconfig.

Allows generation of hard-CBR streams without using NAL HRD. Useful if you want to be able to reconfigure the bitrate (which you can't do with NAL HRD on).

Also add some compatibility fixes.

It probably wasn't used or maintained for last few years.

Caused if the timebase is not specified in stats file. Found by Clang.

It's not possible to seek in pipes, so if we want to skip frames we have to read and discard unused ones. It's pointless to do bit-depth upconversions in those frames.

It's an old stand-alone application that isn't relevant to x264.

Also update AUTHORS file and my e-mail address in the headers of various files.

We don't need to wastefully allocate quant tables above QP_MAX_SPEC; they're never used.

Assembly based on code by Henrik Gramner and Loren Merritt.

Work around yasm's inefficiency with handling large numbers of variables in the global scope.

Android NDK does not expose sched_getaffinity.

Actually allocate less (instead of just initialize less) and fix comments.

Probably a regression in r2178.

Fixes possible corruption with MBAFF+sliced threads.

Fixes an issue with too many forced non-skips in mbaff+cavlc, as well as non-deterministic output with mbaff+cavlc+sliced-threads.

The full details of the return values of encoder_encode and encoder_headers were mistakenly removed a while ago; re-add them.

The H.264 spec says it shouldn't be set in these cases.

For when --frame-packing is set.

Makes it easier to detect typos.

Emulation requires a temporary register if arguments 1 and 4 are the same; this doesn't obey the semantics of the original instruction, so we can't emulate that in x86inc. ffmpeg has an x86util emulation for that case; I'll add it if x264's asm ever needs it. Also add pmacsdql emulation.

If the stack is known to be at least 32-byte aligned we can safely store ymm registers on the stack without doing manual alignment. Change ALLOC_STACK to always align the stack before allocating stack space for consistency. Previously alignment would occur either before or after allocating stack space depending on whether manual alignment was required or not.

Reduce the number of registers used from 7 to 6. Reduce the number of vector registers used by the AVX2 implementation from 8 to 7. Multiply fps_factor by 1/256 once per frame instead of once per macroblock row. Use mova instead of movu for dst since it's guaranteed to be aligned. Some cosmetics.

About 5.6x faster than C on Haswell.

checkasm --bench on a cortex-a9: var_8x16_c: 4306 var_8x16_neon: 791

checkasm --bench on a cortex-a9: var2_8x16_c: 5677 var2_8x16_neon: 1421

4% faster on main/medium, 15% faster on baseline/superfast on a cortex-a9.

Move the second core part of macroblock tree into an assembly function; SIMD-optimize roughly half of it (for x86). Roughly ~25-65% faster mbtree, depending on content. Slightly change how mbtree handles the tradeoff between range and precision for propagation. Overall a slight (but mostly negligible) effect on SSIM and ~2% faster.

Merge with x264.git #1

Are you sure you want to change the base?

Merge with x264.git #1

Commits on Feb 4, 2012

Commits on Feb 5, 2012

Commits on Feb 15, 2012

Commits on Mar 6, 2012

Commits on Mar 7, 2012

Commits on Mar 12, 2012

Commits on Mar 14, 2012

Commits on Mar 22, 2012

Commits on Mar 25, 2012

Commits on Mar 27, 2012

Commits on Apr 23, 2012

Commits on Apr 24, 2012

Commits on May 15, 2012

Commits on May 18, 2012

Commits on Jul 3, 2012

Commits on Jul 17, 2012

Commits on Jul 18, 2012

Commits on Jul 26, 2012

Commits on Jul 27, 2012

Commits on Sep 5, 2012

Commits on Sep 11, 2012

Commits on Sep 26, 2012

Commits on Nov 7, 2012

Commits on Nov 8, 2012

Commits on Nov 12, 2012

Commits on Nov 19, 2012

Commits on Dec 6, 2012

Commits on Dec 12, 2012

Commits on Jan 8, 2013

Commits on Jan 9, 2013

Commits on Feb 25, 2013

Commits on Feb 26, 2013

Commits on Mar 1, 2013

Commits on Apr 13, 2013

Commits on Apr 23, 2013

Commits on Apr 29, 2013

Commits on May 15, 2013

Commits on May 17, 2013

Commits on May 20, 2013

Commits on May 22, 2013

Commits on May 28, 2013

Commits on Jul 3, 2013

Commits on Jul 5, 2013

Commits on Aug 23, 2013

Commits on Aug 24, 2013

Commits on Aug 26, 2013

Commits on Aug 27, 2013

Commits on Sep 3, 2013

Commits on Oct 24, 2013

Commits on Oct 25, 2013

Commits on Oct 30, 2013

Commits on Jan 6, 2014

Commits on Jan 8, 2014

Commits on Jan 21, 2014

Commits on Feb 24, 2014

Commits on Mar 11, 2014

Commits on Mar 12, 2014

Commits on Mar 13, 2014

Commits on Mar 6, 2017