You are here: Learn > The Library > Daily News Desk > Ancestry Daily News

Ancestry Daily News
1/8/2004 - Archive

•  Ancestry Daily News, 8 January 2004
•  RootsWorks: Databasics—Using a Database for Genealogy

RootsWorks: Databasics—Using a Database for Genealogy

A database, according to the American Heritage Dictionary, is "a collection of data arranged for ease and speed of search and retrieval." If you were to replace the word "data" with "mixed vegetables" and "search and retrieval" with "eating" you might get a better idea of what a database is. It's a collection of carrots, beans, and peas that you can sort, summarize, and print out.

Databases are similar to warehouses in the sense that they are great places to store things that are similar. If you want to search for a match on a state, and some of the records say "TX" and some say "Tex" and some say "Texas" then you will have to do something to help the computer group them properly. Maybe you would add a field named "CODEDSTATE" and put "TX" in it for each of the three cases. That's called "scrubbing" the data—and it's really important in family history to do that in a way that doesn't involve destroying the original citation.

I want to point out that there is a difference in a "database" and a "database management system (DBMS)." A database could be any old list, such as is often found in Microsoft Excel. A DBMS is a special program for storing and retrieving data, such as Microsoft Access.

Today we will talk about when you, as a genealogy buff, might decide to use a database, and when that database might be in a DBMS (hint: probably never).

Do You Really Need a Database?
I talk to a large number of genealogists, and I use databases a lot. It's my impression that a DBMS is a solution in search of a problem. Most of the challenges to organizing information encountered by the genealogist can be dealt with using readily available tools, and some of them are free.

You might decide that you really want to use one. Let me say that using a database program requires more training than using a spreadsheet or word processor or genealogy program. You have to design your table structures, you have to figure out how to link data that is in different tables, and you have to figure out how to get the information into the tables. Most people come to their senses and quit before they get past this point. But there ARE times when you can make a case for using a DBMS.

You have a better chance of being elected governor of California. For example, you will generally find that a spreadsheet can only handle 65,000 rows. So if you get your hands on a really big list of marriage records and want to play with it, you might need to use a DBMS.

But the best uses for a DBMS are to link together lots of different kinds of vegetables, and that takes some special training that most of us don't have. And now that I've beaten that point to death, here's the good news. You can get lots of help with your data without learning it all.

Hey Buddy, You Wanna Buy a Spreadsheet?
One of the easiest databases you can work with is a spreadsheet. It's fairly easy to make a column for each person and a row for each record and then put the relevant information into each cell. If you're careful you can sort it and it's really easy to tell what you're missing. But the real power of a database is to relate information from different sources—say, Marriages and Census and Land Records and Wills. You would still mix them up in Excel if you wanted to.

Excel has a great set of lookup functions that will let you pull in data from different worksheets. I use HLOOKUP and VLOOKUP a lot, and I think you could use them to link and compare different kinds of information. For more information on HLOOKUP and VLOOKUP and how these functions work, check the help function of Excel. If you think this is a tough learning curve, I can assure you that a DBMS presents you with a lot more to learn than linking Excel tables.

Excel also has a feature called "AutoFilter" that will let you click on any column and limit the display to some specific value in that column, such as a birth state of Alabama.

Get a Clooz
If you have a lot of different kinds of records and want to enter them into a computer to record them, think about using a program called Clooz, from Clooz.com. Clooz was written by Liz Kerstens, the managing editor of Genealogical Computing. It's a great genealogy filing cabinet, with templates for entering the various kinds of records that you find in family history. It's not free, but it's not expensive, and you can start entering your information in just a few minutes. No programming required! Clooz stores evidence—it doesn't make family trees.

How About a Real Genealogy Program?
Free or Fee?

If it's family trees you want, you'll need to get a "real genealogy program." There are a number of them that are free - such as PAF, Legacy, and Ancestry Family Tree. If you want more features than these programs provide, you can buy Family Tree Maker or the Master Genealogist—and leave the programming to someone else.

If You Really Must, Think MS Access
I've been using MS Access since version 1. The current version is Access 2003—part of Office 2003. There was a big change between Access 97 and Access 2000—you have to convert your pre-2000 applications in order to use them with the newer versions. Since then, you can use Access 2000 formats for any of the newer versions. And here's something fun—you can look at your Legacy and TMG data in MS Access. I don't recommend that you do anything to change any of that information, though. You can break your program and the developer will not be happy with you—but sometimes people want to see what's behind the curtain.

More Information
For links and more information about databases, please see the RootsWorks site.
If you want to discuss your database challenges, please drop by the RootsWorks Forums.
Registration is free, and I'd be interested to know what kinds of issues you are facing.


Beau Sharbrough is a product manager at Ancestry.com. His articles contain his own views and opinions and do not reflect any corporate policy or statement by the company. The RootsWorks series of articles focuses on genealogical applications for generic technologies. Beau would like to hear from you at www.rootsworks.com and discuss this or any topic related to the use of technology in family history. Tell us about your experiences. Please note that he cannot assist you with your individual computer problems. Visit the RootsWorks website for links to previous articles and Beau's lecture schedule.

Copyright 2004, MyFamily.com. All rights reserved.


  Printer Friendly
 
E-mail to a friend

Search The Library



Weekly Journal

Sign up for the Ancestry Weekly Discovery and get free family history tips, news and updates in your inbox.