FUDforum: comp.lang.php » preg_match() oddities and question

Home » Imported messages » comp.lang.php » preg_match() oddities and question

Show: Today's Messages :: Polls :: Message Navigator

Re: preg_match() oddities and question [message #176103 is a reply to message #176098]

Wed, 23 November 2011 18:58

Peter H. Coffin
Messages: 245
Registered: September 2010

Karma:

Senior Member

On Wed, 23 Nov 2011 19:01:14 +0100, Sandman wrote:
> In article <slrnjcpunb(dot)85q(dot)hellsop(at)nibelheim(dot)ninehells(dot)com>,
> "Peter H. Coffin" <hellsop(at)ninehells(dot)com> wrote:
>
>> On Wed, 23 Nov 2011 09:55:22 +0100, Sandman wrote:
>>
>>> Right, but your example is not a valid argument for that conclusion.
>>> My examples contained the variations of addresses that I wanted
>>> to match. Or are you saying that there is no way to use regular
>>> expressions to catch the examples I gave? Because I have a hard time
>>> believing that.
>>
>> Address-matching is a hard task. I did that for a decade professionally
>> (as part of a job, not the sole function), and it's not easy to do well
>> for even one postal system, and trying to write a generalized one is
>> basically impossible to manage in one lifetime. The best *simple* way
>> to manage it is to take a field, blow it out into individual words,
>> standardize all the words you can find without trying to sort out
>> what they are (which is the Very Hard part of that task), throw the
>> alphabetic ones into soundex or nysiis, make a loose match by a chunk of
>> postal code or city code or province, then pick the item(s) that have
>> the greatest number of matches between incoming and loose-match record
>> of the numeric and nysiis-encoded alphabetical elements. If you weight
>> things like "numeric match = 1, plaintext that's in a dictionary that
>> matches when nysiis = 2, nondictionary text that matches nysiis = 3",
>> and do that for NAME as well as ADDRESS, you get about as good as you
>> can get without buying someone else's work. And that's STILL a lot of
>> effort to write. Regexp alone for address matching is a snipe-hunt. It
>> looks obviously right and you can spend a lot of time playing with it,
>> but it ends up being a dead end.
>
> I thank you for your input, but I still maintain that my examples
> could be parsed by using a regular expression, and unless explicitly
> told so by using examples will I admit otherwise :-D

*grin* Any given (note: given) example set can be parsed with a
sufficiently complicated regexp. If your task is small enough and clean
enough, it might even not be THAT hard to accomplish. It's impossible to
provide advice about it, though, without having that complete example
set as well. The incoming data, however, is almost always going to
contain data that is not clean enough and will also probably end up
containing stuff that does not match your parsing rules, in a "because
fools are so ingenious" sense.

And, at that point, you'll want to be looking at how you handle those
exeptions: reject, pass, send for clerical review, and what those
categories mean for your process.

> No offense, though.

None to take.

--
58. If it becomes necessary to escape, I will never stop to pose
dramatically and toss off a one-liner.
--Peter Anspach's list of things to do as an Evil Overlord

Report message to a moderator

[Message index]

		preg_match() oddities and question By: Sandman on Tue, 22 November 2011 11:21
		Re: preg_match() oddities and question By: The Natural Philosoph on Tue, 22 November 2011 11:26
		Re: preg_match() oddities and question By: Sandman on Tue, 22 November 2011 11:36
		Re: preg_match() oddities and question By: Jerry Stuckle on Tue, 22 November 2011 12:22
		Re: preg_match() oddities and question By: tony on Tue, 22 November 2011 11:47
		Re: preg_match() oddities and question By: Sandman on Tue, 22 November 2011 12:12
		Re: preg_match() oddities and question By: Thomas 'PointedEars' on Tue, 22 November 2011 12:30
		Re: preg_match() oddities and question By: Sandman on Tue, 22 November 2011 12:55
		Re: preg_match() oddities and question By: Thomas 'PointedEars' on Tue, 22 November 2011 16:56
		Re: preg_match() oddities and question By: The Natural Philosoph on Tue, 22 November 2011 17:30
		Re: preg_match() oddities and question By: Peter H. Coffin on Tue, 22 November 2011 23:20
		Re: preg_match() oddities and question By: The Natural Philosoph on Tue, 22 November 2011 23:59
		Re: preg_match() oddities and question By: Thomas 'PointedEars' on Wed, 23 November 2011 00:59
		Re: preg_match() oddities and question By: Sandman on Wed, 23 November 2011 08:58
		Re: preg_match() oddities and question By: Thomas 'PointedEars' on Wed, 23 November 2011 21:02
		Re: preg_match() oddities and question By: Sandman on Thu, 24 November 2011 07:20
		Re: preg_match() oddities and question By: Denis McMahon on Thu, 24 November 2011 12:55
		Re: preg_match() oddities and question By: Sandman on Fri, 25 November 2011 08:36
		Re: preg_match() oddities and question By: Thomas 'PointedEars' on Thu, 24 November 2011 21:41
		Re: preg_match() oddities and question By: Sandman on Fri, 25 November 2011 08:26
		Re: preg_match() oddities and question By: Thomas 'PointedEars' on Fri, 25 November 2011 14:44
		Re: preg_match() oddities and question By: Sandman on Fri, 25 November 2011 15:34
		Re: preg_match() oddities and question By: Thomas 'PointedEars' on Fri, 25 November 2011 22:23
		Re: preg_match() oddities and question By: The Natural Philosoph on Wed, 23 November 2011 09:35
		Re: preg_match() oddities and question By: Sandman on Wed, 23 November 2011 08:55
		Re: preg_match() oddities and question By: Peter H. Coffin on Wed, 23 November 2011 13:53
		Re: preg_match() oddities and question By: Sandman on Wed, 23 November 2011 18:01
		Re: preg_match() oddities and question By: The Natural Philosoph on Wed, 23 November 2011 18:54
		Re: preg_match() oddities and question By: Sandman on Wed, 23 November 2011 19:23
		Re: preg_match() oddities and question By: Peter H. Coffin on Wed, 23 November 2011 18:58
		Re: preg_match() oddities and question By: Sandman on Thu, 24 November 2011 07:28
		SOLVED: Re: preg_match() oddities and question By: Sandman on Fri, 25 November 2011 09:32
		Re: SOLVED: Re: preg_match() oddities and question By: Jerry Stuckle on Fri, 25 November 2011 23:55
		Re: SOLVED: Re: preg_match() oddities and question By: Sandman on Sat, 26 November 2011 10:21

Previous Topic:	Amazing Website!!!
Next Topic:	session handler auto log out

Goto Forum:

-=] Back to Top [=-

[ Syndicate this forum (XML) ] [

]

Current Time: Fri Nov 22 08:46:48 GMT 2024

Total time taken to generate the page: 0.04961 seconds