BLU Discuss list archive

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Discuss] "M-" notation

Subject: [Discuss] "M-" notation
From: gaf at blu.org (Jerry Feldman)
Date: Fri, 06 Jul 2012 14:17:25 -0400
In-reply-to: <20120706180315.GG11861@dragontoe.org>
References: <alpine.DEB.2.00.1207051957400.8100@sas1.nber.org> <4FF632ED.3090707@gmail.com> <Pine.GSO.4.64.1207060649410.16194@nber6> <4FF6EEC4.7030007@gmail.com> <4FF703E3.7050301@blu.org> <4FF721AF.7080002@gmail.com> <4FF7254C.4030709@blu.org> <20120706180315.GG11861@dragontoe.org>

On 07/06/2012 02:03 PM, Derek Martin wrote:
> On Fri, Jul 06, 2012 at 01:50:04PM -0400, Jerry Feldman wrote:
>> On 07/06/2012 01:34 PM, Richard Pieri wrote:
>>> On 7/6/2012 11:27 AM, Jerry Feldman wrote:
>>>> I've found that sed(1) tends to work well for me in my scripts. What I
>>>> do in the scripts is something like:
>>> sed works, too.  I find tr to be easier/quicker to use than sed for
>>> simple transformations.  I use sed for more significant edits.
>>>
>> Someone mentioned unicode. There are a number of unicode to ascii
>> converters.
> Can you elaborate?  I can't see how this would work unless the
> "Unicode" file contained only a subset of Unicode which corresponds to
> the 7-bit ASCII character set... in which case the Unicode version of
> the file will be identical to the ASCII version of the file, possibly
> save for a 3-byte encoding marker (which is optional and largely
> unnecessary) at the beginning of the file.
>
Normally, we thing of Unicode as 16-bit (UTF-16). It can be UTF-7 or
UTF-8. A true ASCII file is 7-bits. It has been a while since I have
played with encodings, but you certainly can express unicode in ASCII by
encoding the exceptions as escape characters.


In any case, sed or tr work fine when dealing with the normal ASCII text
files we see on Linux.

-- 
Jerry Feldman <gaf at blu.org>
Boston Linux and Unix
PGP key id:3BC1EB90 
PGP Key fingerprint: 49E2 C52A FC5A A31F 8D66  C0AF 7CEA 30FC 3BC1 EB90

Follow-Ups:
- [Discuss] "M-" notation
  - From: invalid at pizzashack.org (Derek Martin)

References:
- [Discuss] "M-" notation
  - From: feenberg at nber.org (Daniel Feenberg)
- [Discuss] "M-" notation
  - From: richard.pieri at gmail.com (Richard Pieri)
- [Discuss] "M-" notation
  - From: feenberg at nber.org (Daniel Feenberg)
- [Discuss] "M-" notation
  - From: richard.pieri at gmail.com (Richard Pieri)
- [Discuss] "M-" notation
  - From: gaf at blu.org (Jerry Feldman)
- [Discuss] "M-" notation
  - From: richard.pieri at gmail.com (Richard Pieri)
- [Discuss] "M-" notation
  - From: gaf at blu.org (Jerry Feldman)
- [Discuss] "M-" notation
  - From: invalid at pizzashack.org (Derek Martin)

Prev by Date: [Discuss] "M-" notation
Next by Date: [Discuss] Need Apache help
Previous by thread: [Discuss] "M-" notation
Next by thread: [Discuss] "M-" notation
Index(es):
- Date
- Thread


BLU is a member of BostonUserGroups
We also thank MIT for the use of their facilities.

Boston Linux & Unix / webmaster@blu.org