BLU Discuss list archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Discuss] Seeking information on binaries called "entities" and "fixup"
- Subject: [Discuss] Seeking information on binaries called "entities" and "fixup"
- From: greg at freephile.com (Greg Rundlett (freephile))
- Date: Sat, 2 Aug 2014 09:15:07 -0400
- In-reply-to: <53DC7A46.6060106@gmail.com>
- References: <53D5BE9D.7030100@horne.net> <CA+h9Qs5thgns2LDziQcRmDkzeQjA_2MYj1kDwcsGQjcu0-rOxg@mail.gmail.com> <53D681EA.10008@horne.net> <53D6C1BB.2070301@gmail.com> <53D7F8B1.8050206@horne.net> <53D81ADA.5000600@gmail.com> <53DB0722.6040200@horne.net> <CANaytceVfwikNUbBVUUB5bO2FDLtujpJA3ECtevDO84Jx3Td4Q@mail.gmail.com> <20140802014145.3969C11FB@telecom.csail.mit.edu> <53DC7A46.6060106@gmail.com>
In the 'strings' output, I noticed a typo - perfect for searching code because you hope people don't make too many of the SAME typos across multiple projects. What I noticed is that one of the entities was &pount; (instead of £) When searching OHLOH (now Black Duck), it found 177 matches. Sadly their engine does not search literal strings even when they say they do... they ignore the & and the ; Still, it's a really good start at a very small haystack for finding the original source of this code. https://code.ohloh.net/search?s=%26pount Then I added the term brvbar in a compound search, but that turned up zero results - which means that this original source is not found in their database. On Sat, Aug 2, 2014 at 1:42 AM, Tom Metro <tmetro+blu at gmail.com> wrote: > What's the big picture that you are trying to accomplish with the > script? Presenting the Telecom Digest on a web site? Maybe an > off-the-shelf mail archiving tool is a better way to go? (Like MHonArc.) > Agreed, if you just want to put the Digest on the web, then I'd feed it to MHonArc, or similar. Out of curiosity, I looked at the MHonArc sources to see if there were such files (entities and fixup), but no. MHonArc is written in Perl. That said, there is quite a bit of code to deal with sanitizing or escaping content for conversion to HTML for display. To wit: http://www.mhonarc.org/release/MHonArc/latest/lib/mhtxtplain.pl http://www.mhonarc.org/release/MHonArc/latest/lib/mhtxtenrich.pl http://www.mhonarc.org/release/MHonArc/latest/lib/mhtxthtml.pl And, even with that, there is a big security warning on the homepage for yet another script injection vulnerability. That's because MHonArc doesn't fully parse the HTML, nor does it use a white-listing approach. Basically, with MHonArc, you should not enable HTML in your mail archives. Despite the obvious "weakness" of MHonArc, I'm simply pointing out the obvious that rolling your own solution that mimics MHonArc's feature set would be quite a project. > > What language is the script written in? 'fixup' remains a mystery, > Fixup is a mystery, but at least one thing it probably did was deal with angle brackets and keywords. > but > you probably know enough about 'entities' to replace it with some > in-line code. Greg gave you a PHP example. It could be done as a > one-liner in Perl, with the assistance of a module. > Greg Rundlett http://eQuality-Tech.com http://freephile.org
- References:
- [Discuss] Seeking information on binaries called "entities" and "fixup"
- From: greg at freephile.com (Greg Rundlett (freephile))
- [Discuss] Seeking information on binaries called "entities" and "fixup"
- From: bill at horne.net (Bill Horne)
- [Discuss] Seeking information on binaries called "entities" and "fixup"
- From: tmetro+blu at gmail.com (Tom Metro)
- [Discuss] Seeking information on binaries called "entities" and "fixup"
- Prev by Date: [Discuss] Seeking information on binaries called "entities" and "fixup"
- Next by Date: [Discuss] Seeking information on binaries called "entities" and "fixup"
- Previous by thread: [Discuss] Seeking information on binaries called "entities" and "fixup"
- Next by thread: [Discuss] Seeking information on binaries called "entities" and "fixup"
- Index(es):