Escape!

by Mike Rylander on December 7, 2006

In case anybody happens to need some C that can take a char* and encode any UTF-8 byte sequences in there (as I needed to — our cribbed UTF-8 encoder proved brittle), here’s a little bit of code that may help. It’s probably more verbose than it needs to be, but it works — and working code wins! 🙂

Just loop over the bytes in the string and push them into another string, or do like us and use Bill’s handy dandy growing buffer utility.

if ((unsigned char)string[idx] >= 0x80) { // It's a wide UTF-8 character.

  if ( (unsigned char)string[idx] >= 0xC0
       && (unsigned char)string[idx] < = 0xF4) { // We know we're starting a UTF-8 string
    clen = 1;

    if (((unsigned char)string[idx] & 0xF0) == 0xF0) { // It's a 4 byte character
      clen = 3;
      c = (unsigned char)string[idx] ^ 0xF0;
    } else if (((unsigned char)string[idx] & 0xE0) == 0xE0) { // This means 3 bytes.
      clen = 2;
      c = (unsigned char)string[idx] ^ 0xE0;
    } else if (((unsigned char)string[idx] & 0xC0) == 0xC0) { // And that's 2.
      clen = 1;
      c = (unsigned char)string[idx] ^ 0xC0;
    }

    for (;clen;clen--) {
      idx++; // look at the next byte

      // only the last 6 bits are used for data
      c = (c << 6) | ((unsigned char)string[idx] & 0x3F);
    }

    /* Use sprintf or the like to shove the
        hex value of 'c' into, well, something.
        We have a handy growing buffer thing
        with printf format support, so we do this: */

      buffer_fadd(buf, "\\u%04x", c);

    } else { // Arg!  It doesn't start with a valid first byte.
      return NULL;
    }

  } else {
    // It's not a wide character, treat is as ASCII ...
}

Maybe, just maybe, someone else out there won’t waste 6 hours of their life that they’ll never get back trying to do this same thing that’s been done a thousand times but never documented simply …

November 3, 2pm EST
Collaborative code review / office hours
Hi Evergreen Community,

You are cordially invited to weekly collaborative code review sessions. In these, we'll meet for an hour to review recent pull requests, read code, run automated tests, try patches out, provide feedback, merge code that is ready... all the best stuff!

We can meet at https://princeton.zoom.us/my/sandbergja.

All skill levels welcome! You can join for as many or as few sessions as you'd like. If you have submitted a pull request that has been languishing and needs some review, please bring that along too. We're going for a fun, positive, and social tone. :-D

This is not to take the place of any of the other great ways that developers can get feedback on their work (e.g. bug squashing week, the new developers working group, typical review process), just a fun supplement.

Hope to see you there,

-Jane
https://princeton.zoom.us/my/sandbergja
November 5, 4pm EST
Outreach Meeting
Join with Google Meet: https://meet.google.com/oip-dwca-bpc
Or dial: (US) +1 337-441-4148 PIN: 125363025#

Learn more about Meet at: https://support.google.com/a/users/answer/9282720
November 6, 12am EST
3.16.0 Release
See the 3.16 Roadmap
November 6, 2pm EST
DIG (Documentation Interest Group) Meeting
Agenda and meeting information here: https://wiki.evergreen-ils.org/doku.php?id=evergreen-docs:dig_meetings
November 10, 12am EST
Evergreen Hack-a-way (Columbia MO)
November 10, 2pm EST
Collaborative code review / office hours
Hi Evergreen Community,

You are cordially invited to weekly collaborative code review sessions. In these, we'll meet for an hour to review recent pull requests, read code, run automated tests, try patches out, provide feedback, merge code that is ready... all the best stuff!

We can meet at https://princeton.zoom.us/my/sandbergja.

All skill levels welcome! You can join for as many or as few sessions as you'd like. If you have submitted a pull request that has been languishing and needs some review, please bring that along too. We're going for a fun, positive, and social tone. :-D

This is not to take the place of any of the other great ways that developers can get feedback on their work (e.g. bug squashing week, the new developers working group, typical review process), just a fun supplement.

Hope to see you there,

-Jane
https://princeton.zoom.us/my/sandbergja