


Flood history Uganda for Forecast-Based Financing
By Eddie Wasswa Jjemba (RCRCCC) and Jurjen Wagemaker (FloodTags).
BackgroundThe Red Cross Red Crescent Clim
Approach and ResultsAs source data we used two newspaper repositories of Daily Monitor and New Vision and searched the databases using a large number of flood related keywords. From Daily Monitor we downloaded total 2974 news articles between 2004 and 2015. From New Vision we downloaded 752 news articles between 2001 and 2015. Unfortunately the database for New Vision could not be used fully, since the news repository has serious access problems (access to only top 200 newspapers per query, without possibility for advanced search).
Within the database of total 3726 news articles, we clustered all sentences on the basis of similar features in the sentences and next we annotated the sentences using four classes: 1. Current flood event 2. Past event or flood warning 3. Mixed and 4. Unrelated. After annotation we found that total 1721 news articles held relevant flood information (annotated as class 1 or 2). For these articles we looked for geographical references (mentions of district and next for sub-county names in sentences containing a selection of geographical related keywords.
As a result, for the districts of our interest (Abim, Amuria, Kotido, Soroti and Katakwi) we found total 63 news articles with flood sentences AND geographical reference. Applying the same approach to all districts in Uganda we found total 1173 of such articles (except in this case we did not only use the sentences containing geographical related keywords). Next, we validated the result manually for the districts of our interest. For 85% of the events we had found an actual flood event (the flood event was automatically detected for the correct month/ year in the correct location(s), while 15% were false positives (which we corrected in the map). For the flood reports on the whole of Uganda we may expect a margin of error of around ~15% (false positives). We presented the results on an internal website.
Red Cross Red Crescent Climate Centre and FloodTags are continuing their work on historic flood mapping, improving its accuracy and coverage. Two large improvements that can seriously improve the results are improving the geocoding database (we have found many errors in the OpenStreetMap database for Uganda) and improving the clustering methods (a.o. isolating different flood incidences including blocked roads and improving geocoding). A third improvement would be to negotiate access to a larger number of newspapers from New Vision or other local newspapers.