Can I get a WOOT!?

Why, until next week, of course. Silly man.
_______________________________________________
PDXRuby mailing list
IRC: #pdx.rb on irc.freenode.net
http://lists.pdxruby.org/mailman/listinfo/pdxruby

Ben Bleything

2007-05-06 00:51:04 UTC

Permalink

Post by Tim Dysinger
RailsConf of course!!!

Oh, didn't realize that was next week. I'm out of touch with the Rails
community these days.

Have a good time!

Ben

Robby Russell

2007-05-06 03:37:51 UTC

Permalink

Post by Ben Bleything

Post by Tim Dysinger
RailsConf of course!!!

Oh, didn't realize that was next week. I'm out of touch with the Rails
community these days.
Have a good time!

Carl Lerche

2007-05-06 03:44:36 UTC

Permalink

Somebody is going to be first to show at rails conf.

Post by Robby Russell

Post by Ben Bleything

Post by Tim Dysinger
RailsConf of course!!!

Oh, didn't realize that was next week. I'm out of touch with the Rails
community these days.
Have a good time!

12 days to go... how is that next week? :-p
--
Robby Russell
Founder and Executive Director
PLANET ARGON, LLC
Ruby on Rails Development, Consulting & Hosting
www.planetargon.com
www.robbyonrails.com
+1 503 445 2457
+1 877 55 ARGON [toll free]
+1 815 642 4068 [fax]
_______________________________________________
PDXRuby mailing list
IRC: #pdx.rb on irc.freenode.net
http://lists.pdxruby.org/mailman/listinfo/pdxruby

--
EPA Rating: 3000 Lines of Code / Gallon (of coffee)

Ben Bleything

2007-05-06 05:15:31 UTC

Permalink

Post by Robby Russell
12 days to go... how is that next week? :-p

The original post said "one week to go" :) Like I said, I don't have
any idea when it is.

Ben

Tim Dysinger

2007-05-06 17:50:00 UTC

Permalink

Guess I jumped the gun. I am traveling to P-town in a week.

Post by Robby Russell

Post by Ben Bleything

Post by Tim Dysinger
RailsConf of course!!!

Oh, didn't realize that was next week. I'm out of touch with the Rails
community these days.
Have a good time!

Chris Anderson

2007-05-06 18:01:52 UTC

Permalink

WOOT for sure. I can't count anyway... weeks... days...

Anyone throwing any RailsConf afterparties? Unlike last year we'll be
in the middle of the city - our city!

Post by Tim Dysinger
Guess I jumped the gun. I am traveling to P-town in a week.

Post by Robby Russell

Post by Ben Bleything

Post by Tim Dysinger
RailsConf of course!!!

Oh, didn't realize that was next week. I'm out of touch with the Rails
community these days.
Have a good time!

_______________________________________________
PDXRuby mailing list
IRC: #pdx.rb on irc.freenode.net
http://lists.pdxruby.org/mailman/listinfo/pdxruby

--
Chris Anderson
http://jchris.mfdz.com

Devin Ben-Hur

2007-05-08 00:37:46 UTC

Permalink

I got my membership early, but things changed and I'm no longer working
primarily with Rails and now have a scheduling conflict. I checked with
O'Reilly and they won't give me a refund, but are happy to transfer the
registration.

Anyone on the list looking for a conference registration at a discount?

--
Devin Ben-Hur 503/860-4114 mailto:***@ben-hur.net

"Startups are basically comedies, or at least seem so in retrospect."
-- Paul Graham

--
No virus found in this outgoing message.
Checked by AVG Free Edition.
Version: 7.5.467 / Virus Database: 269.6.5/793 - Release Date: 5/7/2007 2:55 PM

Grant Kruger

2007-05-08 00:58:19 UTC

Permalink

I might. Question is, is it worthwhile for a novice?

Grant

Post by Devin Ben-Hur
I got my membership early, but things changed and I'm no longer working
primarily with Rails and now have a scheduling conflict. I checked with
O'Reilly and they won't give me a refund, but are happy to transfer the
registration.
Anyone on the list looking for a conference registration at a discount?
--
"Startups are basically comedies, or at least seem so in retrospect."
-- Paul Graham
--
No virus found in this outgoing message.
Checked by AVG Free Edition.
Version: 7.5.467 / Virus Database: 269.6.5/793 - Release Date: 5/7/2007 2:55 PM
_______________________________________________
PDXRuby mailing list
IRC: #pdx.rb on irc.freenode.net
http://lists.pdxruby.org/mailman/listinfo/pdxruby

--
All the best,

Grant Kruger

Devin Ben-Hur

2007-05-08 19:59:12 UTC

Permalink

Post by Grant Kruger
I might. Question is, is it worthwhile for a novice?

I suspect it depends on the Novice :)

I go to tech conferences for three things: information, inspiration, and
making connections with people who share a common interest. The first
is the least important these days as it's relatively easy to find all
the information you need on most any topic sitting at your computer.

As a novice, you may be more interested in the Tutorial Day on May 17
the day before the core conference opens
<http://conferences.oreillynet.com/pub/w/51/tutorials.html>. This day is
a separate registration from the main conference sessions which is the
registration I'm trying to place.

--
Devin Ben-Hur 503/860-4114 mailto:***@ben-hur.net

"Startups are basically comedies, or at least seem so in retrospect."
-- Paul Graham

--
No virus found in this outgoing message.
Checked by AVG Free Edition.
Version: 7.5.467 / Virus Database: 269.6.6/794 - Release Date: 5/8/2007 2:23 PM

Grant Kruger

2007-05-08 22:02:00 UTC

Permalink

Thanks for the info. I guess I'll wait for next year, since they are 100%
sold out and have a waiting list. Truth is, since I'm still looking for
work, it is probably out of my price range right now too. Luck.

Grant

Post by Devin Ben-Hur

Post by Grant Kruger
I might. Question is, is it worthwhile for a novice?

I suspect it depends on the Novice :)
I go to tech conferences for three things: information, inspiration, and
making connections with people who share a common interest. The first
is the least important these days as it's relatively easy to find all
the information you need on most any topic sitting at your computer.
As a novice, you may be more interested in the Tutorial Day on May 17
the day before the core conference opens
<http://conferences.oreillynet.com/pub/w/51/tutorials.html>. This day is
a separate registration from the main conference sessions which is the
registration I'm trying to place.
--
"Startups are basically comedies, or at least seem so in retrospect."
-- Paul Graham
--
No virus found in this outgoing message.
Checked by AVG Free Edition.
Version: 7.5.467 / Virus Database: 269.6.6/794 - Release Date: 5/8/2007 2:23 PM
_______________________________________________
PDXRuby mailing list
IRC: #pdx.rb on irc.freenode.net
http://lists.pdxruby.org/mailman/listinfo/pdxruby

--
All the best,

Grant Kruger

Javan Makhmali

2007-05-08 22:22:53 UTC

Permalink

Hi all,

I'm using Ruby's stdlib RSS library to grab an rss feed and tuck some
information from it into a database -- essentially creating an
archive of a feed. The problem I'm having is that some html entities
(like & l s q u o ; (without the spaces) for example) in the title
and description are being mangled with strange multibyte characters
that I'll avoid pasting into this message. Does anyone know why this
happens and how I might fix / work around it?

Best,
Javan

Chris Anderson

2007-05-08 23:17:00 UTC

Permalink

Javan,

You might try using Hpricot to parse the RSS feed. It does a fine job
with all the strange characters I throw at it from HTML. Although
using Hpricot for XML is a (supported) corner case...

http://code.whytheluckystiff.net/hpricot/

At Grabb.it we're using REXML to parse RSS feeds. I haven't really put
it through the paces, but I haven't noticed problems either.

Good luck!

Chris

Post by Javan Makhmali
Hi all,
I'm using Ruby's stdlib RSS library to grab an rss feed and tuck some
information from it into a database -- essentially creating an
archive of a feed. The problem I'm having is that some html entities
(like & l s q u o ; (without the spaces) for example) in the title
and description are being mangled with strange multibyte characters
that I'll avoid pasting into this message. Does anyone know why this
happens and how I might fix / work around it?
Best,
Javan
_______________________________________________
PDXRuby mailing list
IRC: #pdx.rb on irc.freenode.net
http://lists.pdxruby.org/mailman/listinfo/pdxruby

--
Chris Anderson
http://jchris.mfdz.com

Eric Wilhelm

2007-05-09 01:22:24 UTC

Permalink

# from Javan Makhmali

like ‘ in the title
and description are being mangled with strange multibyte characters

That would be utf8.

Does anyone know why this happens

An xml parser such as expat will output utf8 instead of named character
entities for all characters which are not "<"=< and "&"=&. That
might be configurable, but it is often dictated by the xml input. I'm
not sure exactly what is under the hood of ruby's standard rss parser
but it might well be expat.

and how I might fix / work around it?

--
The opinions expressed in this e-mail were randomly generated by
the computer and do not necessarily reflect the views of its owner.
--Management
---------------------------------------------------
http://scratchcomputing.com
---------------------------------------------------

Erik Hollensbe

2007-05-09 05:41:37 UTC

Permalink

Apologies for the top-post.

Look into the NKF library that comes with ruby 1.8.5 (later
patchlevels). It has some awesome mixins into the string class that
can help you normalize your strings to utf-8. It will also set $KCODE
appropriately, which is the variable that controls your default
string encoding. This is vital. Ruby is also extremely tolerant of
malformed UTF-8, something that we can probably thank Tim Bray for.

Notes with Hpricot that may or may not still be relevant (I'm using
it for a rather large project involving utf-8 encoded HTML, and may
be using an old version - 0.5.x)

- Hpricot will preserve formatting that is not XML compliant. Be
aware of this and attempt to normalize ahead of time if necessary and
use the Hpricot::XML constructor. Libtidy does a decent job.
- Using Hpricot's built in (non-ruby) character set support is a
good way to get nothing back.
- Passing any arguments to Hpricot's constructor (other than the
content) is a good way to get malformed output back.

Really at this point, if you need something really robust and well-
tested, LibXML2 is probably a better choice and has a DOM-compliant
interface, but I don't believe the ruby support is that great. Worth
a look, if it had been an option when I started this project I'd had
been all over it. If you're an API connoisseur Hpricot is slightly
better.

Post by Eric Wilhelm
# from Javan Makhmali

like ‘ in the title
and description are being mangled with strange multibyte characters

That would be utf8.

Does anyone know why this happens

An xml parser such as expat will output utf8 instead of named
character
entities for all characters which are not "<"=< and "&"=&.
That
might be configurable, but it is often dictated by the xml input. I'm
not sure exactly what is under the hood of ruby's standard rss parser
but it might well be expat.

and how I might fix / work around it?

The best way to *properly* deal with it is to treat it as
characters and
not bytes, though that means your database layer, string objects, and
output layer all need to understand characters to some extent (of
course, low-byte ascii is a subset of utf8, so you could just flag
anything loaded from bag-o-bytes storage as characters and
generally be
on your merry way.) If you're outputting to a browser, the doctype
should be utf8, etc, etc.
The improper way to deal with it is to strip them, though that can be
difficult to do on the encoded end if all you have is bytes (you
basically have to implement utf8 yourself :-) Alternatively, you could
s/&[^;]+;/thbbt/g on the front-end or other similarly hackish
workarounds.
Have fun.
--Eric
--
The opinions expressed in this e-mail were randomly generated by
the computer and do not necessarily reflect the views of its owner.
--Management
---------------------------------------------------
http://scratchcomputing.com
---------------------------------------------------
_______________________________________________
PDXRuby mailing list
IRC: #pdx.rb on irc.freenode.net
http://lists.pdxruby.org/mailman/listinfo/pdxruby

--
Erik Hollensbe
***@hollensbe.org