Programming | The Color Black

Time flies · 2019-03-24 11:59 by Black in Meta Programming

After finishing my masters thesis and completing civil service, I’ve been working for Disney Research since early 2010. And I’ve obviously not had much to say on this blog. Most of my early work was focused on software engineering, mostly tech transfer using C++. I’ve written the basis of a big code base that is used (and considering the resistance to update anything will likely be used for decades) in many projector / camera systems in disney parks. Qt was popular for GUI work back then, and I’ve used it in several projects targeted at normal humans (but since those were internal research projects, not many actually used it). I’ve worked on projects targeting mobile platforms, in both Objective-C and Unity (I’m even named on a patent for one of them). I’ve written plugins for Nuke, ToonBoom, AfterEffects and many others, code running on arduinos or servers with dozens of cores and multiple GPUs.

But lately, I’ve shifted to projects that focus on machine learning. I’m not a researcher, so I don’t focus on developing models and graphs, but over the past few years I’ve debugged, ported and improved a lot of the deep learning research code created here. My favorite framework is TensorFlow, and I spent most time with it, but I’ve also used PyTorch. TensorFlow’s graph based structure forces more discipline, and makes reasoning much easier than the python spaghetti code I’ve seen from torch users. The biggest NN project I’ve participated in so far was Denoising with Kernel Prediction. Implemented in TensorFlow, and using custom CUDA code for better performance, this denoiser surpasses anything before.

I still like C++, it is one of the most flexible and pragmatic languages that are widely used. But for deep learning, Python is the standard, and inertia ensures this will remain the case for many years. Which is unfortunate, the lack of type safety and static checking is a big annoyance, especially in non-trivial code bases as the ones I work in.
Using C++ for TensorFlow is possible but only done for very specific subset of tasks, such as embedding into other programs, or writing custom Ops with CPU or CUDA.

Comment

SVG embedding in XHTML with TextPattern · 2010-02-10 19:20 by Black in Graphics Programming

To store and transport scale independent graphics SVG is a standardised format. It is supported by most browsers somewhat, with the exception of Microsoft’s product. It would be nice if it could be used like the usual raster image formats in an <img /> tag. While Safari and other WebKit browsers support this usage, Firefox does not. Instead, SVG can be embedded directly into the source code of an XHTML file.

The above image is part of the source code of this page. It is included by a TextPattern plugin, and only slightly processed. Processing is needed to remove the <?xml /> header and to insert a viewBox attribute to allow scaling of the image with CSS. With that done, Firefox displays and scales the image nicely. Safari on the other hand causes trouble, it does not correctly infer the viewport height from the width and the aspect ratio.

To be allowed to embed SVG data, the mime type of the document has to be application/xhtml+xml or similar. This has to be changed for TextPattern by editing the header() call in publish.php. The plugin code itself is rather simple. Download a version ready to be pasted into TextPattern (Licensed under the MIT). Below the sourcecode.

svg_inline.php [1.74 kB]

function svg_inline($atts)
{
  extract(lAtts(array(
    'src'  => '',
  ),$atts));
 
  if ($src)
  {
    if ($src[0] == '/')
    {
      // Relative to Document Root
      $src = $_SERVER['DOCUMENT_ROOT'].$src;
    }
    $svg = file_get_contents($src, FILE_TEXT);
    if ($svg)
    {
      // Add this to publish.php
      //header("Content-type: application/xhtml+xml; charset=utf-8");
      $svg = preg_replace('/<\?xml [^>]*>/', '', $svg, 1);
      $svg = preg_replace('/(<svg[^>]*)width="([^"]*)"([^>]*)height="([^"]*)"([^>]*)>/',
        '$1$3$5 viewBox="0 0 $2 $4">', $svg, 1);
      return '<div class="svg">'.$svg.'</div>';
    }
    else
    {
      return 'Read error src='.$src;
    }
  }
  else
  {
    return 'Missing src';
  }
}

TextPattern plugins are php functions that take two arguments: an array containing the tag attributes, and the contents of the tag element. All this plugin function does is to read the specified svg file and return the filtered source. A simple <txp:svg_inline src="imagepath" /> results in a nicely embedded SVG.

Comment

Stereoscopic Camera for OpenGL · 2010-02-10 02:15 by Black in Graphics Programming

Even to create stereoscopic content digitally, cameras are used. But more than real cameras, a lot of freedom and control lies in the hands of the user. The position and projection can be freely decided without respecting things like the size of the camera, inexactness in the manufacturing of their optics or their weight.

For further reading, see Paul Bourke’s Page on stereo pair creation.

Camera Theory

Cameras in OpenGL are defined by filling the modelview matrix and the projection matrix with values. The modelview matrix defines the position of the camera relative to the origin or the object space, the projection matrix defines how coordinates in space are mapped to screen.

The projection matrix can be chosen freely, but normally two basic types of cameras are used: Orthographic and Perspective. Perspective cameras create projections very similar to how the human eye sees the world, objects appear smaller the further they are from the camera. Orthographic cameras project objects preserving parallel lines and their proportions. It is mostly used in technical drawings.

Stereo Pairs

A simple method to use perspective cameras to create stereoscopic footage is to converged their viewing axis. With hardware cameras, this is often used for macro recordings or recordings in closed rooms. The advantage is that the parallax plane is determined when recording, so post-processing needs are low. In addition, the cameras do not have to be as close together as in the next method. The biggest drawback is that the left and right sides of the image do not overlap and have to be cut away or ignored, and that the divergence behind the parallax plane is very strong and can easily lead to unfuseable content. This method should be avoided when ever possible.

A better method is to use perspective cameras with parallel axis. It requires the cameras to be relatively close together and well aligned, both of which is no problem to do in software. Unlike converged cameras, the maximal divergence at infinity is fixed, so even recordings containing far objects can work. The zero parallax plane lies at infinity. It can be moved by creating asymmetric view frustums, effectively horizontally moving both images.

For special visualizations, parallel cameras with converged axis can be used. And similar as with perspective converged cameras, extreme caution has to be taken to not create strongly diverging images. This method should only be used to show objects that are very close to the parallax plane.

Implementation with OpenGL

As part of ExaminationRoom, I implemented a flexible camera class. The source and header can be downloaded and used relatively freely. As all of my code on this page, they are licensed under the GPL and MIT licenses. This class is not meant to be used directly in an other project since a lot of code is specific to ER, but I am sure the core can be of use as example.

In my implementation, camera positions are defined by their position, their viewing direction, their up-vector and their separation (distance between the cameras). The projection is influenced by the field-of-view, the distance to the zero-parallax plane (the plane where separation of corresponding points is zero) and of course the type of the projection.

camera.h [6.01 kB]

private:
  Tool::Point   pos_;
  Tool::Vector  dir_;
  Tool::Vector  up_;
  float     sep_;
  float     fov_;
  float     ppd_;
  Tool::ScreenProject * spL_;
  Tool::ScreenProject * spR_;
  Camera::Type  type_;

The core of the class is the creation of the matrixes. The call to glFrustum sets the projection matrix, the modelview matrix is created with the utility method gluLookAt. The separation between the cameras has to be considered for both. The camera uses vertical field-of-view, so that the height of the image does not change between standard and widescreen viewport aspect ratios.

camera.cpp [9.43 kB]

void Camera::loadMatrix(float offsetCamera)
{
  GlErrorTool::getErrors("Camera::loadMatrix:1");
  GLint viewport[4];
  glGetIntegerv(GL_VIEWPORT, viewport);
  float aspect = (float)viewport[2]/viewport[3];
  float fovTan = tanf((fov_/2)/180*M_PI);
  if (type() == Camera::Perspective)
  {
    // http://local.wasp.uwa.edu.au/~pbourke/projection/stereorender/
 
    float fTop, fBottom, fLeft, fRight, fNear, fFar;
    // Calculate fNear and fFar based on paralax plane distance hardcoded factors
    fNear = ppd_*nearFactor;
    fFar = ppd_*farFactor;
    // Calculate fTop and fBottom based on vertical field-of-view and distance
    fTop = fovTan*fNear;
    fBottom = -fTop;
    // Calculate fLeft and fRight basaed on aspect ratio
    fLeft = fBottom*aspect;
    fRight = fTop*aspect;
 
    glMatrixMode(GL_PROJECTION);
    // Projection matrix is a frustum, of which fLeft and fRight are not symetric
    // to set the zero paralax plane. The cameras are parallel.
    glPushMatrix();
    glLoadIdentity();
    glFrustum(fLeft+offsetCamera, fRight+offsetCamera, fBottom, fTop, fNear, fFar);
    glMatrixMode(GL_MODELVIEW);
    glPushMatrix();
    glLoadIdentity();
    // Rotation of camera and adjusting eye position
    Vector sepVec = cross(dir_, up_); // sepVec is normalized because dir and up are normalized
    sepVec *= offsetCamera/nearFactor;
    // Set camera position, direction and orientation
    gluLookAt(pos_.x - sepVec.x, pos_.y - sepVec.y, pos_.z - sepVec.z,
          pos_.x - sepVec.x + dir_.x, pos_.y - sepVec.y + dir_.y, pos_.z - sepVec.z + dir_.z,
          up_.x, up_.y, up_.z);
    GlErrorTool::getErrors("Camera::loadMatrix:2");
  }

The perspective projection is used in most places. For ExaminationRoom, one of the feature requests was the ability to disable selected depth cues. A very strong cue is size relative to the environment. To disable this cue, parallel projection with converged cameras as described above is used instead. The values for the projection matrix were chosen so that the objects at the zero-parallax plane would not change their size when switching between the projection types. The projection matrix is derived from the normal orthographic projection created by OpenGL’s glOrtho by shearing it.

camera.cpp [9.43 kB]

  else if (type() == Camera::Parallel)
  {
    float fTop, fBottom, fLeft, fRight, fNear, fFar;
    // Calculate fNear and fFar based on paralax plane distance and a hardcoded factor
    // Note: the zero paralax plane is exactly in between near and far
    fFar = ppd_*farFactor;
    fNear = 2*ppd_ - fFar; // = ppd_ - (fFar-ppd_);
    // Set fTop and fBottom based on field-of-view and paralax plane distance
    // This is done to make the scaling of the image at the paralax plane the same
    // as in perspective mode
    fTop = fovTan*ppd_;
    fBottom = -fTop;
    // Set left and right baased on aspect ratio
    fLeft = fBottom*aspect;
    fRight = fTop*aspect;
 
    glMatrixMode(GL_PROJECTION);
    glPushMatrix();
    glLoadIdentity();
    // http://wireframe.doublemv.com/2006/08/11/projections-and-opengl/
    // Note: The code there is wrong, see below for correct code
    // Create oblique projection matrix by shearing an orthographic
    // Projection matrix. Those cameras are converged.
    const float shearMatrix[] = {
      1, 0, 0, 0,
      0, 1, 0, 0,
      -offsetCamera/nearFactor, 0, 1, 0,
      0, 0, 0, 1
    };
    glMultMatrixf(shearMatrix);
    glOrtho(fLeft, fRight, fBottom, fTop, fNear, fFar);
    glMatrixMode(GL_MODELVIEW);
    glPushMatrix();
    glLoadIdentity();
    // Rotation of camera
    // Note: The position of both left and right camera is at the same place
    //  because the offset is already calculated by the shearing, which also sets
    //  the zero paralax plane.
    gluLookAt(pos_.x, pos_.y, pos_.z,
          pos_.x + dir_.x, pos_.y + dir_.y, pos_.z + dir_.z,
          up_.x, up_.y, up_.z);
    GlErrorTool::getErrors("Camera::loadMatrix:3");
  }

Hopefully this is useful to someone :)

Comment

Lua String Writer · 2010-02-04 16:35 by Black in Programming Scripts

Lua strings are opaque byte streams. They are constant, and can only be manipulated by using the string api to create new strings. This can be expensive, especially when creating a string by appending new values at the end. While Lua contains optimizations for direct concatenation, successive appending has a high overhead.

This StringWriter class reduces the overhead by aggregating string concatenations in a table and executing them when requested. It was originally designed to serve as an efficient drop-in replacement for files as created by io.open, but it can also be used standalone.

The class itself is built with a protected shared metatable and state inside a table. The state itself is not protected (it would be possible by using individual metatables or an internal database in a weak table, but this is more elegant). The metatable contains entries to redirect reads to the method table, redirect new writes to nothing and prevent changing or reading the metatable. The concatenation operator is also overloaded, but since it has value semantic, and is not allowed to change the object itself, the implementation is less efficient than StringWriter:write(). Converting a StringWriter with tostring() gives the contained string, equivalent to StringWriter:get().

stringwriter.lua [4.77 kB]

-- MetaTable for string writers
local StringWriter_Meta = {
  ["__index"] = StringWriter_Methods;
  ["__newindex"] = function ()
      -- Don't allow setting values
    end;
  ["__metatable"] = StringWriter_ID;
  ["__tostring"] = StringWriter_Methods.get;
  ["__concat"] = function (this, str)
      str = tostring(str);
      local sw = StringWriter();
      sw.string_ = {};
      for _, v in ipairs(this.string_) do
        table.insert(sw.string_, v);
      end
      table.insert(sw.string_, str);
      sw.len_ = this.len_ + #str;
      sw.pos_ = sw.len_;
      return sw;
    end;
}

The method table itself contains all methods the StringWriter supports. It was modeled after the file class, so many methods are placeholders that do nothing. The methods that are supported are seeking and writing. Seeking simply sets an internal position value. Writing in the context of files means overwriting and extending. When the position is at the end, the contents that are to be written can simply be appended to the contents table. Otherwise, the string has to be baked, split, and recomposed.

stringwriter.lua [4.77 kB]

-- Methods for string writers
local StringWriter_Methods = {
  ["close"] = voidFunc;
  ["flush"] = voidFunc;
  ["lines"] = voidFunc;
  ["read"] = voidFunc;
  ["seek"] = function (this, base, offset)
      -- Only act on StringWriters
      if not StringWriter_Check(this) then
        return nil, "Invalid StringWriter";
      end;
      -- Default offset
      if type(base) == "number" then
        offset = base; -- Not done in file, but reasonable
      else
        offset = offset or 0;
      end
      -- Set position and return it
      if base == "set" then
        this.pos_ = math.clamp(offset,0, this.len_);
      elseif base == "end" then
        this.pos_ = math.clamp(#this.string_+offset,0, this.len_);
      else -- "cur"
        this.pos_ = math.clamp(this.pos_+offset,0, this.len_);
      end
      return this.pos_;
    end;
  ["setvbuf"] = voidFunc;
  ["write"] = function (this, ...)
      -- Only act on StringWriters
      if not StringWriter_Check(this) then return end;
      -- Concat all arguments (assuming they are valid)
      local s = table.concat({...});
      -- Concat argument string with current string
      if this.pos_ == -1 or this.pos_ == this.len_ then
        -- Just append
        table.insert(this.string_, s);
      else
        -- Insert, merge into a string
        local sFull = table.concat(this.string_);
        -- Split it up
        local sLeft = string.sub(sFull, 1, this.pos_);
        local sRight = string.sub(sFull, this.pos_+1+#s, -1)
        -- And put it back in
        this.string_ = {sLeft, s, sRight};
      end
      -- Update position
      this.pos_ = this.pos_ + #s;
      if this.pos_ > this.len_ then
        this.len_ = this.pos_;
      end;
    end;
  ["get"] = function (this)
      if not StringWriter_Check(this) then
        return nil, "Invalid StringWriter";
      else
        this.string_ = {table.concat(this.string_)};
        return this.string_[1];
      end;
    end;
}

StringWriter instances are created by a factory method. It initializes the state and sets the metatable.

stringwriter.lua [4.77 kB]

-- StringWriter factory
StringWriter = function ()
  local sw = {
    string_ = {""};
    len_  = 0;
    pos_  = 0;
  }
  setmetatable(sw, StringWriter_Meta);
  return sw;
end

I hope this code is useful for someone, use it as you wish, it is licensed under the MIT license.

Comment

Lua Table Persistence · 2010-01-27 14:56 by Black in Programming Scripts

Lua is a very flexible scripting language for embedding into programs. It’s standard API is very slim, it lacks all but basic functions. Adding them is easy though.

The persistence code here requires nothing but lua’s standard io.open for reading and writing files. It can handle loops, multiple references to the same table in both keys and values, and most standard value types.
Not supported are userdata, threads and many types of functions. Exporting simple lua functions works, but the exported byte code is not portable. The result from the export is itself lua code, it can be executed and returns data structures equivalent to those that were exported.

The core for the export is a simple recursion with a dispatcher method and writers for all types. When unsupported types are encountered, nil is written. This can cause problems on import when those unsupported values are used as table keys, but in most cases it is more desirable than to fail the export.

persistence.lua [5.50 kB]

-- Format items for the purpose of restoring
writers = {
  ["nil"] = function (file, item)
      file:write("nil");
    end;
  ["number"] = function (file, item)
      file:write(tostring(item));
    end;
  ["string"] = function (file, item)
      file:write(string.format("%q", item));
    end;
  ["boolean"] = function (file, item)
      if item then
        file:write("true");
      else
        file:write("false");
      end
    end;
  ["table"] = function (file, item, level, objRefNames)
      local refIdx = objRefNames[item];
      if refIdx then
        -- Table with multiple references
        file:write("multiRefObjects["..refIdx.."]");
      else
        -- Single use table
        file:write("{\n");
        for k, v in pairs(item) do
          writeIndent(file, level+1);
          file:write("[");
          write(file, k, level+1, objRefNames);
          file:write("] = ");
          write(file, v, level+1, objRefNames);
          file:write(";\n");
        end
        writeIndent(file, level);
        file:write("}");
      end;
    end;
  ["function"] = function (file, item)
      -- Does only work for "normal" functions, not those
      -- with upvalues or c functions
      local dInfo = debug.getinfo(item, "uS");
      if dInfo.nups > 0 then
        file:write("nil --[[functions with upvalue not supported]]");
      elseif dInfo.what ~= "Lua" then
        file:write("nil --[[non-lua function not supported]]");
      else
        local r, s = pcall(string.dump,item);
        if r then
          file:write(string.format("loadstring(%q)", s));
        else
          file:write("nil --[[function could not be dumped]]");
        end
      end
    end;
  ["thread"] = function (file, item)
      file:write("nil --[[thread]]\n");
    end;
  ["userdata"] = function (file, item)
      file:write("nil --[[userdata]]\n");
    end;
}

To be able to export tables that are referenced several times (be it a cycle in the data structure, or just one that is inserted several times), the structures that are to be written are examined first and the numbers or references to each table are counted.

All tables that have multiple references to them are created at the start in the export file before they are filled with content. This is required, since they could contain themselves or other multi-ref tables.

After all those temporary tables are created, they are filled with content. The writer for tables uses a lookup table for multi-ref tables, instead of creating the table constructor for them, they are assigned from the table created at the start. Last but not least, the passed arguments themselves are created in the same way.

persistence.lua [5.50 kB]

  store = function (path, ...)
    local file, e;
    if type(path) == "string" then
      -- Path, open a file
      file, e = io.open(path, "w");
      if not file then
        return error(e);
      end
    else
      -- Just treat it as file
      file = path;
    end
    local n = select("#", ...);
    -- Count references
    local objRefCount = {}; -- Stores reference that will be exported
    for i = 1, n do
      refCount(objRefCount, (select(i,...)));
    end;
    -- Export Objects with more than one ref and assign name
    -- First, create empty tables for each
    local objRefNames = {};
    local objRefIdx = 0;
    file:write("-- Persistent Data\n");
    file:write("local multiRefObjects = {\n");
    for obj, count in pairs(objRefCount) do
      if count > 1 then
        objRefIdx = objRefIdx + 1;
        objRefNames[obj] = objRefIdx;
        file:write("{};"); -- table objRefIdx
      end;
    end;
    file:write("\n} -- multiRefObjects\n");
    -- Then fill them (this requires all empty multiRefObjects to exist)
    for obj, idx in pairs(objRefNames) do
      for k, v in pairs(obj) do
        file:write("multiRefObjects["..idx.."][");
        write(file, k, 0, objRefNames);
        file:write("] = ");
        write(file, v, 0, objRefNames);
        file:write(";\n");
      end;
    end;
    -- Create the remaining objects
    for i = 1, n do
      file:write("local ".."obj"..i.." = ");
      write(file, (select(i,...)), 0, objRefNames);
      file:write("\n");
    end
    -- Return them
    if n > 0 then
      file:write("return obj1");
      for i = 2, n do
        file:write(" ,obj"..i);
      end;
      file:write("\n");
    else
      file:write("return\n");
    end;
    file:close();
  end;

Loading the exported data is simple, but the provided method performs some error checking.

persistence.lua [5.50 kB]

  load = function (path)
    local f, e = loadfile(path);
    if f then
      return f();
    else
      return nil, e;
    end;
  end;

I hope this code is useful for someone, use it as you wish, it is licensed under the MIT license.

Comment [1]

Source Code Management with Git · 2009-12-29 15:22 by Black in Programming

Git is a distributed SCM designed by Linus Torvalds to manage the development of the Linux Kernel. Since it’s licensed under the GPL, it can be used freely by anyone.

Just like backups are a necessity for anyone who uses a Computer (or should be…), source code management is a necessity for serious developers. Not only does it track the past state of the project (which allows tracking the introduction of bugs), but it also allows the management of separate branches. That way, development can continue to add new experimental features while production uses only stable and tested code.

Git is a distributed SCM tool, unlike CVS and Subversion it does not require a central server and by design there is no central authoritative repository. Every repository contains the full history. Every file is hashed and added to a database. Every commit contains a tree of file hashes, a commit message and a pointer to the ancestor commits. All that is hashed and added to the database, so a commit’s hash can be used to cryptographically verify the integrity of the complete previous history. For a more technical perspective on git’s inner workings, read Git for Computer Scientists (It really is quite cool in it’s simplicity). Here’s a one sided comparison of git with some alternatives.

I have started to use git beginning of 2008 for my work on ExaminationRoom, and while the start was a bit hairy, having a history of my code development as well as my comments have helped me a lot, even as only developer. I worked on three computers, so keeping the code synchronized was critical. That too was easy thanks to the SCM, even without a reachable central server (One of the computers had no internet access, it was only used to drive two Projectors for the experiments.)

I still use git these days, and can’t recommend it more. Although most other projects are World of Warcraft addons… All my public code can be cloned from my repositories

Comment

Links

Recent Articles

Navigation

Time flies · 2019-03-24 11:59 by Black in Meta Programming

SVG embedding in XHTML with TextPattern · 2010-02-10 19:20 by Black in Graphics Programming

Stereoscopic Camera for OpenGL · 2010-02-10 02:15 by Black in Graphics Programming

Camera Theory

Stereo Pairs

Implementation with OpenGL

Lua String Writer · 2010-02-04 16:35 by Black in Programming Scripts

Lua Table Persistence · 2010-01-27 14:56 by Black in Programming Scripts

Source Code Management with Git · 2009-12-29 15:22 by Black in Programming