Unicode Best Practices

*
Accepted Session
Short Form
Intermediate
Scheduled: Wednesday, June 19, 2013 from 3:45 – 4:30pm in B301

Excerpt

Developing applications to handle the natural languages and written scripts of the world—or even a small handful of them—is an impressively large task. Fortunately, Unicode provides tools to do just that. It’s more than just a character set, it’s a collection of standards for working with the world’s textual data. The problem is: Unicode itself is complex!

Description

This talk will help make supporting Unicode easier by providing some of the best practices for your projects—whether open source libraries, RESTful services, or web applications. We’ll briefly review Unicode and then dive into best practices for handling Unicode text in the following areas:

  • User experience
  • Collation (comparison and sorting)
  • Input, output, and logging
  • Security considerations
  • Debugging
  • Testing (unit tests and QA)

This is a programming language–agnostic talk and will cover general best practices for developing with Unicode as well as some important tips related to various popular programming languages.

Speaking experience

I’ve presented and facilitated workshops at multiple events including YAPC::NA, Pittsburgh Perl Workshop, DC–Baltimore Perl Workshop, Boston Skillshare, and local user groups. Topics have ranged from development and testing to DIY and foraging for edible wild plants. I’ll also be presenting at the Internationalization and Unicode Conference (IUC 37) in Santa Clara this October.

Speaker

  • Patch avatar

    Nova Patch

    Shutterstock

    Biography

    Nova Patch is a software engineer on the International Search team at Shutterstock, specializing in internationalization, localization, and multilingual information retrieval; and focusing on developing a search and discovery experience that supports the world’s languages, writing systems, and cultures. They are an open source developer, contributor to the Unicode CLDR, and member of the Unicode Consortium.

    Sessions

      • Title: Unicode Best Practices
      • Track: Cooking
      • Room: B301
      • Time: 3:454:30pm
      • Excerpt:

        Developing applications to handle the natural languages and written scripts of the world—or even a small handful of them—is an impressively large task. Fortunately, Unicode provides tools to do just that. It’s more than just a character set, it’s a collection of standards for working with the world’s textual data. The problem is: Unicode itself is complex!

      • Speakers: Nova Patch