Daily Archives

Articles indexed Thursday May 13 2010

Page 78/119 | < Previous Page | 74 75 76 77 78 79 80 81 82 83 84 85 | Next Page >

Naive Bayesian classification (spam filtering) - Doubt in one calculation? Which one is right? Plz c

- by Microkernel

Hi guys, I am implementing Naive Bayesian classifier for spam filtering. I have doubt on some calculation. Please clarify me what to do. Here is my question. In this method, you have to calculate P(S|W) - Probability that Message is spam given word W occurs in it. P(W|S) - Probability that word W occurs in a spam message. P(W|H) - Probability that word W occurs in a Ham message. So to calculate P(W|S), should I do (1) (Number of times W occuring in spam)/(total number of times W occurs in all the messages) OR (2) (Number of times word W occurs in Spam)/(Total number of words in the spam message) So, to calculate P(W|S), should I do (1) or (2)? (I thought it to be (2), but I am not sure, so plz clarify me) I am refering http://en.wikipedia.org/wiki/Bayesian_spam_filtering for the info by the way. I got to complete the implementation by this weekend :( Thanks and regards, MicroKernel :) @sth: Hmm... Shouldn't repeated occurrence of word 'W' increase a message's spam score? In the your approach it wouldn't, right?. Lets take a scenario and discuss... Lets say, we have 100 training messages, out of which 50 are spam and 50 are Ham. and say word_count of each message = 100. And lets say, in spam messages word W occurs 5 times in each message and word W occurs 1 time in Ham message. So total number of times W occuring in all the spam message = 5*50 = 250 times. And total number of times W occuring in all Ham messages = 1*50 = 50 times. Total occurance of W in all of the training messages = (250+50) = 300 times. So, in this scenario, how do u calculate P(W|S) and P(W|H) ? Naturally we should expect, P(W|S) P(W|H)??? right. Please share your thought...

Read the article
How to completely wipe rubygems along with rails etc

- by Earlz

Ok, so I decided I'd be cool and try to use Rails3 that's in beta. Then, things were getting hard to manage so I got rvm. I installed ruby 1.9.2-head in rvm and things were working, and then a computer restart later rails wouldn't start up. So I figured I'd just try running the system ruby and start rails in it. same error. Then, I uninstalled rails3 and got rails: no such file or directory type errors.. So now I'm royally screwed because rails2 is still installed but will not uninstall because of invisible dependencies, along with a lot of other random gems. How do I completely clear out all ruby gems and such so I can start anew?

Read the article
Posting data using AJAX

- by ognjenb

public ActionResult Create() { return View(); } // // POST: /Partial/Create [HttpPost] public ActionResult Create(last_name ln) { try { // TODO: Add insert logic here proba.AddTolast_name(ln); proba.SaveChanges(); return View(); } catch { return View(); } } Part of index View: <div id="Div1"> <% using (Ajax.BeginForm("Create", new AjaxOptions { UpdateTargetId = "Div1" })) { %> <%= Html.Action("Create")%> <% } %> </div> In this solutions data from from View Create not post to database. In this solutions data posted to database <div id="Div1"> <%= Html.Action("Create")%> </div> View "Create" is strongly typed view. Why is this happening ie why the first solution does not work?

Read the article
Commercial web application--scalable database design

- by Rob Campbell

I'm designing a set of web apps to track scientific laboratory data. Each laboratory has several members, each of whom will access both their own data and that of their laboratory as a whole. Many typical queries will thus be expected to return records of multiple members (e.g. my mouse, joe's mouse and sally's mouse). I think I have the database fairly well normalized. I'm now wondering how to ensure that users can efficiently access both their own data and their lab's data set when it is mixed among (hopefully) a whole ton of records from other labs. What I've come up with so far is that most tables will end with two fields: user_id and labgroup_id. The WHERE clause of any SELECT statement will include the appropriate reference to one of the id fields ("...WHERE 'labroup_id=n..." or "...WHERE user_id=n..."). My questions are: Is this an approach that will scale to 10^6 or more records? If so, what's the best way to use these fields in a query so that it most efficiently searches the relevant subset of the database? e.g. Should the first step in querying be to create a temporary table containing just the labgroup's data? Or will indexing using some combination of the id, user_id, and labroup_id fields be sufficient at that scale? I thank any responders very much in advance.

Read the article
Current / Context property in class

- by richard-heesbeen

Hi, I would like to implement an current property in my class which returns the current active context of the class (much like httpcontext.current etc.), like so: using(classA x = new classA(..)) { classB.Dosomething(); } where the method dosomething() gets the current context of classA to perform some operation. How would i go about creating such functionality? Greetz, Richard

Read the article
how to post text on facebook from iphone

- by navjot

how to post text on facebook from iphone

Read the article
jQuery: Animated header plugin

- by Fverswijver

I'm looking for a jQuery plugin that can help me with the following: I have a list of images I want to use for my header but they are pretty big (height especially) and I don't want to resize them to fit my small header div. What I'd want is a plugin that allows the images to start at the bottom of the div (or rather the top of the image at the top of the div) and move upwards so the entire image can be seen, and once up they are shown entirely (bottom of image at bottom of div) they should "blend" (opacity toggle or something alike) with the next image and thus create a continuous loop with all the images. I've looked through several plugins but have never found one that can achieve what I'm looking for (maybe I'm asking for a tad too much) but my JS is not sufficient enough to build it myself. Thanks!

Read the article
What is the Null Character literal in TSQL?

- by David in Dakota

I am wondering what the literal for a Null character (e.g. '\0') is in TSQL. Note: not a NULL field value, but the null character (see link). I have a column with a mix of typical and a null character. I'm trying to replace the null character with a different value. I would have thought that the following would work but it is unsuccessfull: select REPLACE(field_with_nullchar, char(0), ',') from FOO where BAR = 20

Read the article
Is it possible to use return statement in try block?. how?

- by Ayyappan.Anbalagan

Is it possible to use return statement in try block?.How,What is the use of the statement.

Read the article
cron stops running when processing multiple items

- by James

Cron stops running (no visible error) after a few hours when I add more than 5 jobs in my cron tab. Each job runs every minute (polls a webpage for information which takes 1 second). I tried putting all of my php jobs in a shell script and called this shell script instead but the same problem occurs. Cron stops running, no error in the log file, no error email sent out either. Anyone encountered this before? Where/how canI debug this?

Read the article
MySQL - how to retrieve columns in same row as the values returned by min/mx

- by Gala101

I couldn't frame the Question's title properly.. Suppose a table of weekly movie Earnings as below, MovieName MovieGross WeekofYear Year So how do I get the names of top grossers for each week of this year If I do select MovieName , Max(MovieGross) , WeekofYear from earnings where year = 2010 group by WeekofYear; Then obviously query wont run, select Max(MovieName) , Max(MovieGross) , WeekofYear from earnings where year = 2010 group by WeekofYear; would just give movies starting with lowest alphabet Is using group-concat and then substring-index the only option here? select substring_index(group_concat(MovieName order by MovieGross desc),',',1), Max(MovieGross) , WeekofYear from earnings where year = 2010 group by WeekofYear ; Seems clumsy.. Is there any better way of acieveing this?

Read the article
Which is more efficient/faster when calling a cached image?

- by andufo

Hi, i made an image resizer in php. When an image is resized, it caches a new jpg file with the new dimensions. Next time you call the exact img.php?file=hello.jpg&size=400 it checks if the new jpg has already been created. If it has NOT been created yet, it creates the file and then prints the output (cool). If it ALREADY exists, no new file needs to be generated and instead, it just calls the already cached file. My question is regarding the second scenario. Which of these is faster? redirecting: header('Location: cache/hello_400.jpg');die(); grabbing data and printing the cached file: $data = file_get_contents('cache/hello_400.jpg'); header('Content-type: '.$mime); header('Content-Length: '.strlen($data)); echo $data; Any other ways to improve this?

Read the article
How to share code with continuous integration

- by alchemical

I've just started working in a continuous integration environment (TeamCity). I understand the basic idea of not getting so abstracted out in your code that you are never able to build it to test functionality, etc. However, when there is deep coding going on, occasionally it will take me several days to get buildable code--but in the interim other team members may need to see my code. If I check the code in, it breaks the build. However, if I don't check it in, my team members are unable to see the most recent work. I'm wondering how this situation is best dealt with.

Read the article
.Net equivalent of JSP?

- by MCS

What is the .Net equivalent of a JSP page?

Read the article
ServerIdentity memory leak with IHttpAsyncHandler

- by Anton

I have a .NET web application that consists of a single HTTP handler class that implements IHttpAsyncHandler. All requests to this handler are handled asynchronously, though some requests are short-lived and some are long-lived (nothing over a few seconds). The problem is that memory consumption grows over time as requests are handled. All profiling results point to an unbounded growth of String objects held by instances of System.Runtime.Remoting.ServerIdentity. Every String value is different, but they all look similar to: /dd41c00e_1566_4702_b660_c81cdea18a43/vigefresi5pfv8n0ekddg57z_1154.rem There is nothing in my application that uses ServerIdentity directly, and unless I am mistaken, the ServerIdentity instances are proportional to the number of incoming requests. If this is an internal .NET structure, it looks like the CLR is not cleaning up after itself. What could be causing the leak? UPDATE A little less than half of the String objects are being held by System.Runtime.Remoting. The remaining String objects are being held by System.Runtime.Serialization and look similar to: +1sgess5rjcrgbmp3kqr6bmv_3474.rem Also, the problem only seems to occur when lots of simultaneous HTTP web requests arrive.

Read the article
SO_LINGER and closing sockets(WINSOCK)

- by Johnny Walked

hey. im writing a multithreaded winsock application and im having some issues with closing the sockets. first of all, is there a limit for a number of simultaneously open sockets? lets say like 32 sockets all in once. i establish a connection on one of the sockets, and passing information and it all goes right. problem is when i disconnect the socket and then reconnect to the same destination, i get a RST from the server after my SYN. i dont have the code for the server app so i cant debug it. when i used SO_LINGER and it sent a RST flag at the end of each session - it worked. but i dont want to end my connections this way. when not using SO_LINGER a FIN flag was sent but it seems the connection was not really closed. any help? thanks

Read the article
repository server and client setup

- by regmaster

explain to me in brief how to setup repo server and point client to that server? where do you manipulate the settings in server and also client. Give me the most common setup example?

Read the article
Using "Active Directory Users and Computers" for a different domain

- by Jaxidian

How can I manage a domain with the "Active Directory Users and Computers" from a computer that is not on that domain? I realize I'll need some domain admin (or less) credentials, but that's fine. I have 2 scenarios where I'd like to do this: From a machine on Domain 1 but I'd also like to manage Domain 2 (the 2 domains are in no way related) From a laptop that is not a member of any domain. If we can figure out #2, that will be "good enough" but #1 would be nice too.

Read the article
Customizing Flowcharts in Oracle Tutor

- by [email protected]

Today we're going to look at how you can customize the flowcharts within Oracle Tutor procedures, and how you can share those changes with other authors within your company. Here is an image of a flowchart within a Tutor procedure with the default size and color scheme. You may want to change the size of your flowcharts as your end-users might have larger screens or need larger fonts. To change the size and number of columns, navigate to Tutor Author Author Options Flowcharts. The default is to have 4 columns appear in each flowchart, but, if I change it to six, my end-users will see a denser flowchart. This might be too dense for my end-users, so I will change it to 5 columns, and I will also deselect the option to have separate task boxes. Now let's look at how to customize the colors. Within the Flowchart options dialog, there is a button labeled "Colors." This brings up a dialog box of every object on a Tutor flowchart, and I can modify the color of each object, as well as the text within the object. If I click on the background, the "page" object appears in the Item field, and now I can customize the color and the title text by selecting Select Fill Color and/or Select Text Color. A dialog box with color choices appears. If I select Define Custom Colors, I can make my selections even more precise. Each time I change the color of an object, it appears in the selection screen. When the flowchart customization is finished, I can save my changes by naming the scheme. Although the color scheme I have chosen is rather silly looking, perhaps I want others to give me their feedback and make changes as they wish. I can share the color scheme with them by copying the FCP.INI file in the Tutor\Author directory into the same directory on their systems. If the other users have color schemes that they do not want to lose, they can copy the relevant lines from the FCP.INI file into their file. If I flowchart my document with the new scheme, I can see how it looks within the document. Sometimes just one or two changes to the default scheme are enough to customize the flowchart to your company's color palette. I have seen customers who have only changed the Start object to green and the End object to red, and I've seen another customer who changed every object to some variant of black and orange. Experiment! And let us know how you have customized your flowcharts. Mary R. Keane Senior Development Director, Oracle Tutor

Read the article
A Checklist of Current SEO Techniques

Search engine optimization entail fine tuning a website's code alongside a thorough linking architecture and just like mechanics, use a checklist to track down what has already been accomplished and how to measure the effect of each step on all other variables. The following is a basic overview of the steps you should take to ensure you leave nothing to chance, anything that can have a positive effect on your website's ranking thus improve the amount of traffic.

Read the article
Two Factors of Top SEO Rankings

There are two things required to rank at the top of the search engines. With both of these elements in place, your site should see high placements. But if you are weak in either, ranking will be almost impossible. They are: relevant content and links.

Read the article
Directory Submission Services - Learn to Make Most of Them

Who Offers Directory Submission Services? Most business owners would know the meaning of directory submission but not all are aware of the companies or entities that are offering these services. Directory submissions are a part of the overall search engine optimization or SEO plan.

Read the article
Top 5 White Hat Techniques in SEO

SEO techniques that help your site to get better page rank in search engines can be both White Hat and Black Hat. White hat techniques are accepted as ethical and useful long term, while black hat methods forcibly manipulate page rankings which are either discarded or not preferred because they flout the ethical norms. You should know the difference between these two and select a firm that utilizes only white hat techniques.

Read the article
How is Your Business Doing?

Times are changing marketing methods are evolving, the question is are you? A well structured marketing plan can help turn your business around.

Read the article
Why You Should Always Use SEO

Web design is something you do once. Filling your site with content is something you do once. Setting up ads is something you do once. But marketing/SEO is something you do every single week. If you want your site to climb to the top and stay there, you must be diligent in your marketing efforts.

Read the article

< Previous Page | 74 75 76 77 78 79 80 81 82 83 84 85 | Next Page >