Select your font size 
 
about us products & services consulting & support news & events contact us
Paul Meagher shows how the getConditionalProbability function, which draws from a database, relates to literature about Bayesian reasoning.

Frequency versus probability format - Nevada

print this article 
 

The getConditionalProbability function you've developed operates on counts and frequencies rather than on probabilities. In reading the literature on Bayesian reasoning, you will notice that the enumeration method for computing P(A | B) is only briefly discussed. Most authors quickly move onto describing how P(A | B) can be formulated using terms denoting probability values rather than frequency counts. For example, you can recast the formula for computing P(A | B) using such probability terms as:

P(A | B) = P(A & B) / P(B)

The advantage of recasting the formula using terms denoting probabilities instead of frequency counts arises because in practice, you often don't have access to a data set we can use to derive conditional probability estimates through an enumeration of cases method. Instead, you often have access to higher-level summary information from past studies in the form of percentages and probabilities. With the available information, the challenge then becomes finding a way to use these probability estimates instead to compute the conditional probabilities you are interested in. Recasting the conditional probability formula in terms of probabilities allows you to make inferences based on related probability information that is more readily accessible.

The enumeration method might still be regarded as the most basic and intuitive method for computing a conditional probability. In Thomas Bayes' "Essay on the Doctrine of Chances," he uses enumeration to arrive at the conclusion that P( 2nd Event = b | 1st Event = a ) is equal to [P / N] / [ a / N], which is equal to P / a, which one can also denote as {a & b} / {a}:

Figure 1. Graphical representation of relations
Graphical representation of relations

Another reason why it is important to be aware of frequency versus probability format issues is because it has been demonstrated by Gerd Gigerenzer (and others) that people are better at reasoning in accordance with prescriptive Bayesian rules of inference when background information is presented in terms of frequencies of cases (1 in 10 cases) rather than probabilities (10 percent probability). A practical application of this research is that medical students are now being taught to communicate risk information in terms of frequencies of cases instead of probabilities, making it easier for patients to make better informed judgements about what actions are warranted given the test results.



Page:   1  2  3  4  5  6  7  8  9  10  11 Next Page: Joint probability

The content shown in this page was first published by IBM developerWorks and is reprinted with permission from Paul Meagher (www.datavore.com)


Most Recent Website and Regional Updates

 Transparen Toronto Office Locations
Addresses of Transparen Corporation offices in Toronto, Ontario.

 
 High Scalability - Large Systems Optimization
Transparen Corporation lends its expertise to clients experiencing rapid and sudden growth in traffic or server utilization, bottlenecks, systems instability, downtime during peak traffic, or which would like to plan to avoid such issues.

 
 Throughput (or Bandwidth) vs. Latency
This document uses the example of Bill Gates purchasing Google to explain the difference between bandwidth (or throughput) and latency.

 

Google
 
Web transparen.com

Contact Information

Related Information

 
   
 
E C M | © 2003-2007 Transparen Corp.      

Standardized Services: Data Recovery Service / Creative Services / Premium Web Hosting Services / System Administration Tech Support Services
Recent Projects: Full-Service Mortgage and Financing Company / System to manage flights from Vancouver to Tofino / Photo exchange verification service
Our Vancouver BC Server Proudly Hosts: automated parking and revenue control systems, leafside lane at southlands, cost effective alternative power sources, Higher Grade Learning Centres, pacific forage bag supply, sunburst medical, neosonic design, roger mahler photography - passionate, intriguing, desirable, the connection between east and west, affordable flights to victoria and tofino, low interest mortgage brokers in vancouver, richmond, surrey, toronto, Toronto Calgary and Vancouver IT staffing and talent search
* Alamo * Amargosa Valley * Ash Springs * Austin * Baker * Battle Mountain * Beatty * Beowawe * Blue Diamond * Boulder City * Bunkerville * Cal-Nev-Ari * Caliente * Carlin * Carson City * Cold Springs * Crescent Valley * Crystal * Crystal Bay * Dayton * Delamar Ghost Town * Denio * Duckwater * Dyer * East * Ely * Elko * Empire * Enterprise * Eureka * Fallon * Fernley * Gabbs * Gardnervillle * Gerlach * Glenbrook * Golden Valley * Goldfield * Goodsprings * Hawthorne * Henderson * Imlay * Incline Village * Indian Hills * Indian Springs * Jackpot * Jarbidge * Jean * Jiggs * Johnson Lane * Kingsbury * Las Vegas * Lamoille * Laughlin * Lemmon Valley * Logandale * Lovelock * Lund * McDermitt * McGill * Mesquite * Minden * Moapa Town * Moapa Valley * Montello * Mount Charleston * Nixon * North Las Vegas * Orovada * Overton * Owyhee * Pahrump * Panaca * Paradise * Paradise Valley * Pioche * Rachel * Reno * Round Hill Village * Round Mountain * Sandy Valley * Schurz * Searchlight * Silver Park * Silver City * Silver Springs * Sloan * Smith * Spanish Springs * Sparks * Spring Creek * Spring Valley * Stateline * Summerlin South * Sun Valley * Sunrise Manor * Sutcliffe * Tonopah * Tuscarora * Verdi * Virginia City * Wadsworth * Wells * West Wendover * Winnemucca * Whitney * Winchester * Yerington * Zephyr Cove