Baseball, P. (2000). During the P. Golf ball, H. F. Spirer, & L. Spirer (Eds.), Making the Instance: Investigating Large scale Person Liberties Violations Playing with Recommendations Assistance and you may Study Study. AAAS.
Belin, T. R., & Rubin, D. B. (1995). A technique to own calibrating incorrect-match cost for the list linkage. Diary of American Statistical Relationship, 90(430), 694–707.
Bilenko, M., & Mooney, Roentgen. J. (2003). Adaptive Content Recognition Using Learnable Sequence Resemblance Procedures. During the KDD ’03 (pp. 39–48). ACM.
Christen, P. (2008). Automatic Checklist Linkage Having fun with Seeded Nearby Neighbour and you will Service Vector Machine Group. During the KDD ’08 (pp. 151–159). ACM.
Christen, P. (2012). A survey regarding indexing suggestions for scalable number linkage and deduplication. IEEE Deals to the Studies and you may Data Technology, 24(9), 1537–1555.
Cohen, W., Raviku). A comparison out of sequence metrics for matching names and details. When you look at the KDD working area for the research cleanup and you will target combination (Vol. 3, pp. 73–78).
Copas, J., & Hilton, F. (1990). List linkage: Analytical designs to possess complimentary computers details. Record of your own Regal Statistical Neighborhood, Show An effective, 153(3), 287–320.
Dai, Good. M., & Storkey, A good. J. (2011). New grouped writer-thing model for unsupervised entity resolution. In the Phony neural companies and you may servers reading–icann 2011 (pp. 241–249). Springer.
Fortini, Meters., Liseo, B., Nuccitelli, A great., & Scanu, Yards. (2001). Into Bayesian Checklist Linkage. Search for the Authoritative Analytics, 4(1), 185–198.
Gutman, R., Afendulis, C., & Zaslavsky, Good. (2013). An excellent bayesian means of document connecting to research avoid- of-lifestyle scientific costs. Journal of your own American Statistical Connection, 108(501), 34–47.
Hsu, W., Lee, Meters. L., Liu, B., & Ling, T. W. (2000). Mining najljepЕЎe Tajvan Еѕene Exploration in the Diabetics Database: Findings and you may Findings. For the KDD ’00 (pp. 430–436). ACM.
A split-mix Markov chain Monte Carlo procedure of brand new Dirichlet process mix design
Jewell, Letter. P., Spagat, M., & Jewell, B. L. (2013). MSE and Casualty Matters: Presumptions, Interpretation, and you may Challenges. When you look at the T. B. Seybolt, J. D. Aronson, & B. Fischhoff (Eds.), Depending Civilian Casualties: An introduction to Tape and you may Estimating Nonmilitary Deaths in conflict. Oxford, UK: Oxford University Drive.
Larsen, Meters. D. (2002)ments for the Hierarchical Bayesian List Linkage. For the Procedures of one’s joint analytical meetings, part on questionnaire look methods (pp. 1995–2000). This new American Analytical Relationship.
Steorts, Roentgen
Larsen, M. D. (2005). Advances during the Listing Linkage Theory: Hierarchical Bayesian Checklist Linkage Concept. Inside Proceedings of shared analytical conferences, section into the questionnaire browse strategies (pp. 3277–3284). Brand new Western Statistical Organization.
Larsen, Meters. D., & Rubin, D. B. (2001). Iterative automated list linkage playing with mix habits. Journal of your own American Analytical Association, 96(453), 32–41.
Lum, K., Price, M. Age., & Finance companies, D. (2013). Software out-of Numerous Solutions Quote in the People Liberties Research. The fresh new Western Statistician, 67(4), 191–two hundred.
Marchant, Letter. Grams., C., Kaplan, A great., Rubinstein, B. We. P., & Elazar, D. Letter. (2019). D-blink: Marketed stop-to-avoid bayesian organization resolution.
McCallum, A great., & Wellner, B. (2004). Conditional Types of Identity Suspicion which have App to help you Noun Coreference. In Enhances in neural guidance processing expertise (nips ’04) (pp. 905–912). MIT Push.
Miller, P. L., Frawley, S. J., & Sayward, F. G. (2000). IMM/Scrub: A domain-Particular Tool on Deduplication of Inoculation History Facts during the Youth Immunization Registriesputers and you may Biomedical Search, 33(2), 126–143.
Murphy, J., Brackbill, Roentgen. Meters., Thalji, L., Dolan, Meters., Pulliam, P., & Walker, D. J. (2007). Measuring and Enhancing Visibility globally Change Cardiovascular system Wellness Registry. Analytics inside the Medicine, 26(8), 1688–1701.
Murray, J. S. (2016). Probabilistic record linkage and you may deduplication once indexing, clogging, and you may filtering. Record away from Privacy and Privacy, 7(1), 3–24.
Newcombe, H. B., Kennedy, J. Meters., Axford, S. J., & James, Good. P. (1959). Automated linkage away from vital records hosts are often used to extract” follow-up” analytics out of family members away from data away from regime suggestions. Technology, 130(3381), 954–959.
Sadinle, Yards. (2014). Discovering Duplicates in the a murder Registry Having fun with an effective Bayesian Partitioning Strategy. Annals out-of Applied Analytics, 8(4), 2404–2434.
Sariyar, Meters., Borg, An excellent., & Pommerening, K. (2012). Active Learning Tricks for the new Deduplication out-of Digital Patient Investigation Playing with Group Trees. Log out-of Biomedical Informatics, 45(5), 893–900.
C., Hallway, R., & Fienberg, S. E. (2016). A Bayesian Method to Graphical Checklist Linkage and you can Deduplication. Log of your own Western Statistical Organization, 111(516), 1660–1672.
Tancredi, A great., & Liseo, B. (2011). An effective hierarchical Bayesian method to listing linkage and you may population proportions problems. Annals regarding Applied Analytics, 5(2B), 1553–1585.
