Here's my task:
I need to design a database that will allow me to describe a long text that I'm analyzing for my dissertation. I'll be analyzing the text from several angles and inputing all of this data into the database. Eventually, I will need to filter this data and print reports.
Some ground rules:
- The text itself will never be changed. (obviously)
- The reference numbers of the text (i.e. sentence #s) are already set and cannot be altered, though I need to add to them.
- So, currently, the text is set with chapters and verses. I'll need to add another label, clause #. Each "verse" will have at least one clause and usually more. Currently I'm working with a calculated field of Ch.V.Cl#. These are all unique.
- I will analyze several characteristics of each clause such as verb form, introducers, conjunctions, clause type, perspective of writer, topic, etc. Each characteristic will only describe each clause one time.
- At this point I'm only building my database, so I'm not sure what kinds of queries or reports I'll be doing in the future.
To complicate matters, I'm a noob to Access .
What I'm thinking so far:
Here's what I'm envisioning. Basically, my database will have two types of tables:
- TYPE 1: A single "CLAUSES" table with columns:
- ClauseID (PrimKey that I'll leave hidden),
- each clause in a separate field,
- Ch#,
- V#,
- and Clause#,
- then calculated column: Chapter + Verse + clause#.
- I'll use this table as a home base, from which I'll input all of the other data. Then other columns will be imported from:
- TYPE 2: Several tables of "CHARACTERISTICS."
- Each will will also have auto-number ID (as their primary key).
- "VERB TYPES"
- "INTRODUCERS"
- etc.
DESIGN: Each of the CHARACTERISTICS will have a "one to many" relationship with the CLAUSES table. (I.e. CHARACTERISTICS show up in the CLAUSES table many times. CLAUSES will never be reproduced twice.)
- This means my CLAUSES table will have lots of columns with lots of data. From what I've learned, this seems like poor database design: I have too much information stored in the CLAUSES table. However, I wonder if that matters because of the relative simplicity of my database. (i.e. because I only have lots of "one to many" relationships.)
- I have considered creating a separate table wherein I label each clause with only one characteristic––thus dividing my data. But I can't seem to import the basics into a new table: i.e. every clause ("long text") and my calculated column of Chapter+ Verse+Clause#. I need this info in each table because I have to look at it in order to analyze it.
- Ultimately, the reason I landed on this design is because I can't figure out how to get all of my clauses (or even the reference of Ch.V.Cl#) to show up in each CHARACTERSTICS table. But I can get the CHARACTERISTICS tables to show up in my CLAUSES table (mostly I'm using drop-downs).
Here are my questions from my noob perspective:
- Is my proposed design workable for a database of several thousand clauses and all of their characteristics? Or, Is my proposed database "bad database design"?
- If I should go another direction, how can I get my clauses––or at least my calculated column into each CHARACTERISTICS table?
- At this point I'm up for a completely different design structure. As I said, I'm having trouble envisioning how this will work out. So, does anyone have an idea for me?