\n\t\t\tExpert insights, analysis and smart data help you cut through the noise to spot trends,\n\t\t\trisks and opportunities.\n\t\t\n\t\tJoin over 300,000 Finance professionals who already subscribe to the FT.
Expert insights, analysis and smart data help you cut through the noise to spot trends, risks and opportunities.Try full digital access and see why over 1 million readers subscribe to the FT.
Expert insights, analysis and smart data help you cut through the noise to spot trends, risks and opportunities.Try full digital access and see why over 1 million readers subscribe to the FTUnited Kingdom Latest News, United Kingdom Headlines
Similar News:You can also read news stories similar to this one that we have collected from other news sources.
French justice minister to stand trial over alleged abuse of position\n\t\t\tGet local insights from Lisbon to Moscow with an unrivalled network of journalists across Europe,\n\t\t\texpert analysis, our dedicated ‘Brussels Briefing’ newsletter. Customise your myFT page to track\n\t\t\tthe countries of your choice.\n\t\t
Read more »
Five ways Kwarteng can reduce UK debt\n\t\t\tGet local insights from Lisbon to Moscow with an unrivalled network of journalists across Europe,\n\t\t\texpert analysis, our dedicated ‘Brussels Briefing’ newsletter. Customise your myFT page to track\n\t\t\tthe countries of your choice.\n\t\t
Read more »
Tucuxi-BLAST: Enabling fast and accurate record linkage of large-scale health-related administrative databases through a DNA-encoded approachBackground Public health research frequently requires the integration of information from different data sources. However, errors in the records and the high computational costs involved make linking large administrative databases using record linkage (RL) methodologies a major challenge. Methods We present Tucuxi-BLAST, a versatile tool for probabilistic RL that utilizes a DNA-encoded approach to encrypt, analyze and link massive administrative databases. Tucuxi-BLAST encodes the identification records into DNA. BLASTn algorithm is then used to align the sequences between databases. We tested and benchmarked on a simulated database containing records for 300 million individuals and also on four large administrative databases containing real data on Brazilian patients. Results Our method was able to overcome misspellings and typographical errors in administrative databases. In processing the RL of the largest simulated dataset (200k records), the state-of-the-art method took 5 days and 7 h to perform the RL, while Tucuxi-BLAST only took 23 h. When compared with five existing RL tools applied to a gold-standard dataset from real health-related databases, Tucuxi-BLAST had the highest accuracy and speed. By repurposing genomic tools, Tucuxi-BLAST can improve data-driven medical research and provide a fast and accurate way to link individual information across several administrative databases.
Read more »