Search Results

Search found 3804 results on 153 pages for 'regex'.

Page 95/153 | < Previous Page | 91 92 93 94 95 96 97 98 99 100 101 102 | Next Page >

php regular expression to match specific url pattern

- by zen

I'd like to "grab" a few hundred urls from a few hundred html pages. Pattern: <h2><a href="http://www.the.url.might.be.long/urls.asp?urlid=1" target="_blank">The Website</a></h2>

Read the article
what is the return value of BeautifulSoup.find ?

- by prosseek

I run to get some value as score. score = soup.find('div', attrs={'class' : 'summarycount'}) I run 'print score' to get as follows. <div class=\"summarycount\">524</div> I need to extract the number part. I used re module but failed. m = re.search("[^\d]+(\d+)", score) TypeError: expected string or buffer function search in re.py at line 142 return _compile(pattern, flags).search(string) What's the return type of the find function? How to get the number from the score variable? Is there any easy way to let BeautifulSoup to return the value(in this case 524) itself?

Read the article
What regular expression would strip out all attributes from a BR tag?

- by Edward Tanguay

What C# regular expression would replace all of these: <BR style=color:#93c47d> <BR style=color:#fefefe> <BR style="color:#93c47d"> <BR style="color:#93c47d ..."> <BR> <BR/> <br style=color:#93c47d> <br style=color:#fefefe> <br style="color:#93c47d"> <br style="color:#93c47d ..."> <br> <br/> with: <br/> basically "remove all attributes from any BR element and lowercase it".

Read the article
A "smart" (forgiving) date parser?

- by jdmuys

I have to migrate a very large dataset from one system to another. One of the "source" column contains a date but is really a string with no constraint, while the destination system mandates a date in the format yyyy-mm-dd. Many, but not all, of the source dates are formatted as yyyymmdd. So to coerce them to the expected format, I do (in Perl): return "$1-$2-$3" if ($val =~ /(\d{4})[-\/]*(\d{2})[-\/]*(\d{2})/); The problem arises when the source dates moves away from the "generic" yyyymmdd. The goal is to salvage as many dates as possible, before giving up. Example source strings include: 21/3/1998, March 2004, 2001, 3/4/97 I can try to match as many of the examples I can find with a succession of regular expressions such as the one above. But is there something smarter to do? Am I not reinventing the wheel? Is there a library somewhere doing something similar? I couldn't find anything relevant googling "forgiving date parser". (any language is OK).

Read the article
return empty string from preg_split

- by Gutzofter

Right now i'm trying to get this: Array ( [0] => hello [1] => [2] => goodbye ) Where index 1 is the empty string. $toBeSplit= 'hello,,goodbye'; $textSplitted = preg_split('/[,]+/', $toBeSplit, -1); $textSplitted looks like this: Array ( [0] => hello [1] => goodbye ) I'm using PHP 5.3.2

Read the article
Java split regular expression

- by Danny King

Hi all, If I have a string, e.g. setting=value How can I remove the '=' and turn that into two separate strings containing 'setting' and 'value' respectively? Thanks very much!

Read the article
Repeated regular expression

- by javaguy

How can I parse a strings like : name1="val1" name2="val2" name3="val3" I cannot use split(\s+) as it can be name = "val 1". I am doing java but any laguage is okay.

Read the article
Extract multiple values from one column in MySql

- by Neil

I've noticed that MySql has an extensive search capacity, allowing both wildcards and regular expressions. However, I'm in somewhat in a bind since I'm trying to extract multiple values from a single string in my select query. For example, if I had the text "<span>Test</span> this <span>query</span>", perhaps using regular expressions I could find and extract values "Test" or "query", but in my case, I have potentially n such strings to extract. And since I can't define n columns in my select statement, that means I'm stuck. Is there anyway I could have a list of values (ideally separated by commas) of any text contained with span tags? In other words, if I ran this query, I would get "Test,query" as the value of spanlist: select <insert logic here> as spanlist from HtmlPages ...

Read the article
How can I replace a line which contains only -------- by |||

- by mimou

I have something like: ------------------------------------------------------------------------ r2 | username | 2011-01-16 16:52:23 +0100 (Sun, 16 Jan 2011) | 1 line Changed paths: D /foo Removed foo ------------------------------------------------------------------------ r1 | username | 2011-01-16 16:51:03 +0100 (Sun, 16 Jan 2011) | 1 line Changed paths: A /foo created foo ------------------------------------------------------------------------ My target is to identify the file added by the "username" in a specific date. Thus, I need to have the combination (username, 16 Jan 2011, A) to insure that it is the right file ands then print foo. My idea is to: delete the white spaces change the newlines into | get rid of the --------------- and replace them with newlines but the problem is that I couldn't replace the ------- since they are mixed with other characters. ------------------------------------------------------------------------ |r2|username|2011-01-1616:52:23+0100(Sun,16Jan2011)|1line|Changedpaths:|D/foo|Removedfoo| ------------------------------------------------------------------------ |r1|username|2011-01-1616:51:03+0100(Sun,16Jan2011)|1line|Changedpaths:|A/foo|createdfoo| ------------------------------------------------------------------------ So I thought it would be a good idea to start by replacing the --------------- by a special character like ||| and then change this character by a newline using awk FS=||| OFS=\n Can anyone help me! thanks

Read the article
Use Regular expression with fileinput

- by chrissygormley

Hello, I am trying to replace a variable stored in another file using regular expression. The code I have tried is: r = re.compile(r"self\.uid\s*=\s*('\w{12})'") for line in fileinput.input(['file.py'], inplace=True): print line.replace(r.match(line), sys.argv[1]), The format of the variable in the file is: self.uid = '027FC8EBC2D1' I am trying to pass in a parameter in this format and use regular expression to verify that the sys.argv[1] is correct format and to find the variable stored in this file and replace it with the new variable. Can anyone help. Thanks for the help.

Read the article
Extract a regular expression match in R version 2.10

- by tovare

Hi, I'm trying to extract a number from a string. And do something like this [0-9]+ on this string "aaaa12xxxx" and get "12". I thought it would be something like: > grep("[0-9]+","aaa12xxx", value=TRUE) [1] "aaa12xxx" And then I figured... > sub("[0-9]+", "\\1", "aaa12xxxx") [1] "aaa12xxx" But I got some form of response doing: > sub("[0-9]+", "ARGH!", "aaa12xxxx") [1] "aaaARGH!xxx" There's a small detail I'm missing Please advice :-) I'm using R version 2.10.1 (2009-12-14) Thanks ! Comments on the solution The best solution is to ignore the standard functions and install Hadley Wickham's stringr package to get something that actually makes sense. Kudos to Marek for figuring out how the standard library worked.

Read the article
PHP RegExpr error Unkown modifier '('

- by Anonymous

I have this regular expression: ([http://some.url.com/index.php?showtopic=\"]*)([0-9]+(?:\.[0-9]*)?) its for extracting links to topics from forum Now when i use it in my script $url = "([http://some.url.com/index.php?showtopic=\"]*)([0-9]+(?:\.[0-9]*)?)"; preg_match_all spits: "Unknown modifier '('" Can anyone help me with this obviously stupid problem

Read the article
How do I match and return a zero with a regular expression in javascript?

- by Hobhouse

I am trying to extract a number from a string. The number might be zero. Numbers appear like this: '+123', '-8' or '0'. alert( '+123'.match(/[-?|+?]\d+/) ); alerts +123 alert( '-8'.match(/[-?|+?]\d+/) ); alerts -8 alert( '0'.match(/[-?|+?]\d+/) ); alerts null // why oh why? How do I get '0'.match(/[-?|+?]\d+/) to return 0 instead of null?

Read the article
regular expression that ignores case of first character

- by Richard

I know JavaScript regular expressions can ignore case for the entire match, but what about just the first character? Then tuesday would match Tuesday but not TUESDAY.

Read the article
How do I create a regular expression to match a word misspelling the original case sensitivity?

- by Patrick Allaert

I want to discover wrong spelling of "FooBar" in sentence: "This is a 'FooBar' example where I should match different spelling of fooBar such as: foobar, FOOBAR or even fOoBaR but not foobarS!" In this sentence, I would like to match words (in order): fooBar, foobar, FOOBAR, fOoBaR and not: FooBar (correct spelling), foobarS (not the same word) Is there an existing solution using Perl Regular Expression? This is intended to be used with grep -P Thanks

Read the article
any excellent Python 're' tutorial

- by Tshepang

I read through the official regular expression howto which wasn't gentle enough for me. Is there anything better/easier out there?

Read the article
How to remove the first & last slash from a url if it's exists?

- by David

Hi I use parse_url() to get the path of a url but i want to remove the first slash & the last if it's exists. Example: "/posts/funny-dog/" -> "posts/funny-dog" "/" -> "" // Because http://www.google.com/ path is a slash "/categories/politic" -> "categories/politics" Thanks

Read the article
Modify a reference numbered group to match against

- by StuperUser

I want to match YYYY-YY for sequential years. I at moment I'm trying to match where all the second YY is the 3rd and 4th characters in YYYY with 1 added to it. So far I've got {19|20}(\d{2})-(\d{2}), but not sure how to use ? with reference to (1) or whether I'm going about this the right way and finding out the inevitable "unknown unknowns" (like YY99) with this approach? Edit: Matches: 2010-11,2011-12,2029-30 Not matches: 2010-12, 2010-09,2011-2,2011-2012

Read the article
With PHP preg_match_all, get value of href

- by Jeremy Dicaire

Hi, I don'T really understabd how regular expressions works even after I read this tutorial http://www.webcheatsheet.com/php/regular_expressions.php Here is what I need to find: And it should return: http://link Here is what I tried: $find = preg_match_all('/<link type="text/html" rel="alternate" href=".*',$file,$patterns2); You can laught :) Thanks in advance for your help and your time :)

Read the article
Regular Expression Question

- by zyq524

I'm trying to use regular expression to extract the comments in the heading of a file. For example, the source code may look like: //This is an example file. //Please help me. #include "test.h" int main() //main function { ... } What I want to extract from the code are the first two lines, i.e. //This is an example file. //Please help me. Any idea?

Read the article
python: multiline regular expression

- by facha

Hi, everyone I have a piece of text and I've got to parse usernames and hashes out of it. Right now I'm doing it with two regular expressions. Could I do it with just one multiline regular expression? #!/usr/bin/env python import re test_str = """ Hello, UserName. Please read this looooooooooooooooong text. hash Now, write down this hash: fdaf9399jef9qw0j. Then keep reading this loooooooooong text. Hello, UserName2. Please read this looooooooooooooooong text. hash Now, write down this hash: gtwnhton340gjr2g. Then keep reading this loooooooooong text. """ logins = re.findall('Hello, (?P<login>.+).',test_str) hashes = re.findall('hash: (?P<hash>.+).',test_str)

Read the article
preg_replace pass match through function before replacing

- by Martin

This is what i want to do: $line = 'blabla translate("test") blabla'; $line = preg_replace("/(.*?)translate\((.*?)\)(.*?)/","$1".translate("$2")."$3",$line); So the result should be that translate("test") is replaced with the translation of "test". The problem is that translate("$2") passes the string "$2" to the translate function. So translate() tries to translate "$2" instead of "test". Is there some way to pass the value of the match to a function before replacing?

Read the article
Why regular expression's "non-capturing" group is not working

- by Jim Horng

In my snippet below, the non-capturing group "(?:aaa)" should be ignored in matching result, so the result should be "_bbb" only. However, I get "aaa_bbb" in matching result; only when I specify group(2) does it show "_bbb". import re string1 = "aaa_bbb" print(re.match(r"(?:aaa)(_bbb)", string1).group()) >>> aaa_bbb

Read the article
Redirect visitor with .htaccess

- by Aaron

Hi all, I've got an e-shop on a virtual server that's been used as a subdirectory for the last few years, but now I'm finally giving the VS it's own domain name. What I really need is visitors to the old URL to be transparently (and 301) redirected to the new URL with everything after /eshop/ maintained and apended to the new host. I.e. http://www.example.com/eshop/page.php - http://www.newdomain.com/page.php Any help would be greatly appreciated.

Read the article
ignoring folders in mercurial

- by damian

Caveat: I try all the posibilities listed here: http://stackoverflow.com/questions/254002/how-can-i-ignore-everything-under-a-folder-in-mercurial. None works as I hope. I want to ignore every thing under the folder test. But not ignore srcProject\test\TestManager I try syntax: glob test/** And it ignores test and srcProject\test\TestManager With: syntax: regexp ^/test/ It's the same thing. Also with: syntax: regexp test\\* I have install TortoiseHG 0.4rc2 with Mercurial-626cb86a6523+tortoisehg, Python-2.5.1, PyGTK-2.10.6, GTK-2.10.11 in Windows

Read the article

< Previous Page | 91 92 93 94 95 96 97 98 99 100 101 102 | Next Page >