| Research Area: |
Machine Learning |
| My Home School: | University of Pennsylvania |
| My email: | kadavids AT sas.upenn.edu |
| or | kadavids AT dimax.rutgers.edu |
What does it mean for a computer to "learn"? For example, if a computer is given the medical data of a large number of both sick and healthy patients, could it learn to produce a formula for diagnosing future patients as either healthy or sick? Could it tell us what factors are the most important in making that distinction?
Previously, doctors diagnosed patients based on a relatively small amount of data: a few test results, experience with former patients, and whatever outside knowledge they had acquired. Now, large laboratory experiments and genetic testing provide us with data that is simply to large for the human brain to analyze. Instead we turn to computers to analyze the data for us.
Data such as medical test results are subject to plenty of error (perhaps by the humans that conducted the reading, or maybe mechanical estimates.) We have to allow for error in any formulas created by a computer using this data. I spent the summer investigating ways for the computer to allow the most error tolerance but create the simplest and most useful formulas for making diagnoses.