Revision: 39576
Updated Code
at July 27, 2011 07:19 by TopcatSoftware
Updated Code
+SYNOPSIS This document describes the formal specifications of the TOPIC database format and serves as its canonical reference. Designed for humans first and machines second, the TOPIC format attempts to provide a standardized structure for plain text databases that's easy to read and edit in most text editors, and easy to programmatically parse as well [i]. Uses include: knowledge-bases, glossaries, apropos, notes... +KEY CONCEPTS - TOPIC databases are OS neutral. - TOPIC databases are self-indexing. - TOPIC databases provide associations linking blocks of data. - TOPIC databases are written and read as standard ASCII [ii], so virtually any plain text editor is suitable for editing chores. - TOPIC databases use fundamentally simple markup [iii] employing only the plus ('+') and comma (',') characters to delimit content. - TOPIC databases allow the end user to label data in a straight foreword, intuitive manner. +TAGS - A tag line always begins with a single + character [HEX:2B] [iv]. - Tags are always located above the block it describes, alone on a single line. - Tags only contain alpha/numeric characters A-Z, a-z, 0-9, and optionally spaces [HEX:20] [iv]. - A tag can be either a single word, or a group of words. - Multiple tags are comma delimited [HEX:2C] [iv]. +BLOCKS - A block is always located below the tags that describe it. - Block lines never begin with a + character [HEX:2B] [iv]. - A block may contain any number of lines. - Empty lines within a block are valid. +LINES - Lines are terminated with one of CR [HEX:0D], LF [HEX:0A], or a CR/LF pair [iv]. - No limits are imposed on the length of a given line [v]. +ASSOCIATIONS Using multiple tags establishes associations between otherwise unrelated blocks. In the example below, the first block has a tag named 'Apples', the second block has a tag named 'Oranges', and both blocks have a common tag named 'Fruit' as shown in the next two blocks: +Apples, Fruit Block line 1 Block line 2 Block line n... +Oranges, Fruit Block line 1 Block line 2 Block line n... This means you can stream the first block with the 'Apples' tag, stream the second block with the 'Oranges' tag, or stream both blocks via the 'Fruit' tag. The advantage gained is that your data can be filtered in an arbitrary manner. For instance, you could have twelve blocks, each with differing month tags, and a common year tag allowing you to scrutinize your data by month as well as year... +PARSING - TOPIC databases are parsed line-by-line sequentially from top to bottom, and left to right. - Parsing ignores blocks, seeking only tags matching the current query, and when a match is found, outputs the associated block. - Because a given tag can define multiple blocks, the data should be parsed in its entirety 'per query'. +MODIFICATIONS There are no formally sanctioned modifications to the TOPIC database specification. However, the user is free to extend and alter the format as best fits the need provided all legalese is observed. +UPDATES Updates, parsing examples, and other resources are located at: http://www.topcat.hypermart.net/index.html +NOTES i. Parse: To scan/analyze data looking for a desired pattern. ii. ASCII: American Standard Code for Information Interchange. iii. Markup: A system for annotating text. iv. See topic 'HEX TABLE' for ASCII/hexadecimal equivalents. v. Caveat: The user should recognize the constraints governing both the hardware and software rendering the data. +HEX TABLE ASCII/hexadecimal equivalents used in this document: 0 1 2 3 4 5 6 7 8 9 A B C D E F 0 ^@ ^A ^B ^C ^D ^E ^F ^G ^H ^I ^J ^K ^L ^M ^N ^O 1 ^P ^Q ^R ^S ^T ^U ^V ^W ^X ^Y ^Z ^[ ^\ ^] ^^ ^_ 2 SPC ! " # $ % & ' ( ) * + , - . / 3 0 1 2 3 4 5 6 7 8 9 : ; < = > ? 4 @ A B C D E F G H I J K L M N O 5 P Q R S T U V W X Y Z [ \ ] ^ _ 6 ` a b c d e f g h i j k l m n o 7 p q r s t u v w x y z { | } ~ DEL +LEGALESE The TOPIC database specification is copyright Topcat Software LLC. and is absolutely free for anyone to use for any reason in perpetuity. A single line citation is requested in the form of: TOPIC database specification by Topcat Software LLC. eof
Revision: 39575
Updated Code
at February 22, 2011 00:34 by TopcatSoftware
Updated Code
SYNOPSIS This document describes the formal specifications of the TOPIC database format and serves as its canonical reference. Designed for humans first and machines second, the TOPIC format attempts to provide a standardized structure for plain text databases that's easy to read and edit in most text editors, and easy to programmatically parse as well [i]. Uses include: knowledge-bases, glossaries, logging, notes... KEY CONCEPTS - TOPIC databases are OS neutral. - TOPIC databases are self-indexing. - TOPIC databases provide associations linking blocks of data. - TOPIC databases are written and read as standard ASCII [ii], so virtually any plain text editor is suitable for editing chores. - TOPIC databases use fundamentally simple markup [iii] employing only the tab and comma characters to delimit content. - TOPIC databases allow the end user to label data in a straight foreword, intuitive manner. TAGS - Tags are always located above the block it describes, alone on a single line. - Tags only contain alpha/numeric characters A-Z, a-z, 0-9, and optionally spaces [HEX:20] [iv]. - A tag can be either a single word, or a group of words. - Multiple tags are comma delimited [HEX:2C] [iv]. BLOCKS - A block is always located below the tags that describe it. - A block may contain any number of lines, each beginning with a horizontal tab character [HEX:09] [iv]. - Empty lines within a block are valid. LINES - Lines are terminated with one of CR [HEX:0D], LF [HEX:0A], or a CR/LF pair [iv]. - No limits are imposed on the length of a given line [v]. ASSOCIATIONS Using multiple tags establishes associations between otherwise unrelated blocks. In the example below, the first block has a tag named 'Apples', the second block has a tag named 'Oranges', and both blocks have a common tag named 'Fruit' as shown in the next two blocks: Apples, Fruit Block line 1 Block line 2 Block line n... Oranges, Fruit Block line 1 Block line 2 Block line n... This means you can stream the first block with the 'Apples' tag, stream the second block with the 'Oranges' tag, or stream both blocks via the 'Fruit' tag. The advantage gained is that your data can be filtered in an arbitrary manner. For instance, you could have twelve blocks, each with differing month tags, and a common year tag allowing you to scrutinize your data by month as well as year... PARSING - TOPIC databases are parsed line-by-line sequentially from top to bottom, and left to right. - Parsing ignores blocks, seeking only tags matching the current query, and when a match is found, outputs the associated block. - Because a given tag can define multiple blocks, the data should be parsed in its entirety 'per query'. MODIFICATIONS There are no formally sanctioned modifications to the TOPIC database specification. However, the user is free to extend and alter the format as best fits the need provided all legalese is observed. HEX TABLE ASCII/hexadecimal equivalents used in this document: 0 1 2 3 4 5 6 7 8 9 A B C D E F 0 ^@ ^A ^B ^C ^D ^E ^F ^G ^H ^I ^J ^K ^L ^M ^N ^O 1 ^P ^Q ^R ^S ^T ^U ^V ^W ^X ^Y ^Z ^[ ^\ ^] ^^ ^_ 2 SPC ! " # $ % & ' ( ) * + , - . / 3 0 1 2 3 4 5 6 7 8 9 : ; < = > ? 4 @ A B C D E F G H I J K L M N O 5 P Q R S T U V W X Y Z [ \ ] ^ _ 6 ` a b c d e f g h i j k l m n o 7 p q r s t u v w x y z { | } ~ DEL UPDATES Updates, parsing examples, and other resources are located at: http://www.topcat.hypermart.net/index.html NOTES i. Parse: To scan/analyze data looking for a desired pattern. ii. ASCII: American Standard Code for Information Interchange. iii. Markup: A system for annotating text. iv. See topic 'HEX TABLE' for ASCII/hexadecimal equivalents. v. Caveat: The user should recognize the constraints governing both the hardware and software rendering the data. LEGALESE The TOPIC database specification is copyright Topcat Software LLC. and is absolutely free for anyone to use for any reason in perpetuity. A single line citation is requested in the form of: TOPIC database specification by Topcat Software LLC. eof
Revision: 39574
Updated Code
at January 23, 2011 00:59 by TopcatSoftware
Updated Code
SYNOPSIS This document describes the formal specifications of the TOPIC database format and serves as its canonical reference. Designed for humans first and machines second, the TOPIC format attempts to provide a standardized structure for plain text databases that's easy to read and edit in most text editors, and easy to programmatically parse as well [i]. Uses include: knowledge-bases, glossaries, logging, notes... KEY CONCEPTS - TOPIC databases are OS neutral. - TOPIC databases are written (and readable) as standard ASCII [ii], so virtually any plain text editor such as Notepad, TextEdit, and Vim are all equally good choices for editing TOPIC databases. - TOPIC databases use fundamentally simple markup [iii]. - TOPIC databases provide a method of filtering whereby associations are established between otherwise unrelated data. TAGS - Tags are always located above the block it describes, alone on a single line. - Tags only contain alpha/numeric characters A-Z, a-z, 0-9, and optionally spaces [HEX:20] [iv]. - A tag can be either a single word, or a group of words each separated by the space character. - Multiple tags are comma delimited [HEX:2C] [iv]. BLOCKS - A block is always located below the tags that describe it. - A block may contain any number of lines, each beginning with a horizontal tab character [HEX:09] [iv]. - Empty lines within a block are valid. LINES - Lines are terminated with one of CR [HEX:0D], LF [HEX:0A], or a CR/LF pair [iv]. - No limits are imposed on the length of a given line [v]. PARSING - TOPIC databases are parsed line-by-line sequentially from top to bottom, and left to right. - Because a given tag can define multiple blocks, the data should be parsed in its entirety 'per query'. ASSOCIATIONS Using multiple tags establishes associations between otherwise unrelated blocks. In the example below, the first block has a tag named 'Apples', the second block has a tag named 'Oranges', and both blocks have a common tag named 'Fruit': Apples, Fruit Block line 1 Block line 2 Block line n... Oranges, Fruit Block line 1 Block line 2 Block line n... This means you can stream the first block with the 'Apples' tag, stream the second block with the 'Oranges' tag, or stream both blocks via the 'Fruit' tag. The advantage gained is that your data can be filtered in an arbitrary manner. For instance, you could have twelve blocks, each with differing month tags, and a common year tag allowing you to scrutinize your data by month as well as year... MODIFICATIONS There are no formally sanctioned modifications to the TOPIC database specification. However, the user is free to extend and alter the format as best fits the need provided all legalese is observed. HEX TABLE ASCII/hexadecimal equivalents used in this document: 0 1 2 3 4 5 6 7 8 9 A B C D E F 0 ^@ ^A ^B ^C ^D ^E ^F ^G ^H ^I ^J ^K ^L ^M ^N ^O 1 ^P ^Q ^R ^S ^T ^U ^V ^W ^X ^Y ^Z ^[ ^\ ^] ^^ ^_ 2 SPC ! " # $ % & ' ( ) * + , - . / 3 0 1 2 3 4 5 6 7 8 9 : ; < = > ? 4 @ A B C D E F G H I J K L M N O 5 P Q R S T U V W X Y Z [ \ ] ^ _ 6 ` a b c d e f g h i j k l m n o 7 p q r s t u v w x y z { | } ~ DEL UPDATES Updates, parsing examples, and other resources are located at: http://www.topcat.hypermart.net/index.html NOTES i. Parse: To scan/analyze data looking for a desired pattern. ii. ASCII: American Standard Code for Information Interchange. iii. Markup: A system for annotating text. iv. See topic 'HEX TABLE' for ASCII/hexadecimal equivalents. v. Caveat: The user should recognize the constraints governing both the hardware and software rendering the data. LEGALESE The TOPIC database specification is copyright Topcat Software LLC. and is absolutely free for anyone to use for any reason in perpetuity. A single line citation is requested in the form of: TOPIC database specification by Topcat Software LLC. HISTORY - v1.01, revised 1/22/2011. - v1.00, initial release 1/17/2011.
Revision: 39573
Updated Code
at January 19, 2011 13:43 by TopcatSoftware
Updated Code
SYNOPSIS This document describes the formal specifications of the TOPIC database format and serves as its canonical reference. Designed for humans first and machines second, the TOPIC format attempts to provide a standardized structure for plain text databases that's easy to read and edit in most text editors, and easy to programmatically parse as well [i]. Uses include: knowledge-bases, glossaries, logging, notes... KEY CONCEPTS - TOPIC databases are written (and readable) as standard ASCII [ii], so virtually any plain text editor such as Notepad, TextEdit, or Vim are equally good choices for editing TOPIC databases. - TOPIC databases provide a method of filtering text whereby associations are established between otherwise unrelated data. - TOPIC databases use fundamentally simple markup [iii]. TAGS - Tags are always located above the block it describes, alone on a single line. - Tags never contain meta-characters ^$\/()|?+*[]{},. or the horizontal tab character [HEX:09] [iv]. - Multiple tags are comma delimited, thus tag fields never contain commas [HEX:2C] [iv]. BLOCKS - A block is always located below the tags that describe it. - A block may contain any number of lines, each beginning with a horizontal tab character. - Empty lines within a block are valid. LINES - Lines are terminated with one of CR [HEX:0D], LF [HEX:0A], or a CR/LF pair [iv]. - No limits are imposed on the length of a given line [v]. PARSING - TOPIC databases are parsed line-by-line sequentially from top to bottom, and left to right. - Because a given tag can define multiple blocks, the data should be parsed in its entirety 'per query'. ASSOCIATIONS Using multiple tags establishes associations between otherwise unrelated blocks. In the example below, the first block has a tag named 'Apples', the second block has a tag named 'Oranges', and both blocks have a common tag named 'Fruit': Apples, Fruit Block line 1 Block line 2 Block line n... Oranges, Fruit Block line 1 Block line 2 Block line n... This means you can stream the first block with the 'Apples' tag, stream the second block with the 'Oranges' tag, or stream both blocks via the 'Fruit' tag. The advantage gained is that your data can be filtered in an arbitrary manner. For instance, you could have twelve blocks, each with differing month tags, and a common year tag allowing you to scrutinize your data by month as well as year... MODIFICATIONS There are no formally sanctioned modifications to the TOPIC database specification. However, the user is free to extend and alter the format as best fits the need provided all legalese is observed. HEX TABLE ASCII/hexadecimal equivalents used in this document: 0 1 2 3 4 5 6 7 8 9 A B C D E F 0 ^@ ^A ^B ^C ^D ^E ^F ^G ^H ^I ^J ^K ^L ^M ^N ^O 1 ^P ^Q ^R ^S ^T ^U ^V ^W ^X ^Y ^Z ^[ ^\ ^] ^^ ^_ 2 SPC ! " # $ % & ' ( ) * + , - . / 3 0 1 2 3 4 5 6 7 8 9 : ; < = > ? 4 @ A B C D E F G H I J K L M N O 5 P Q R S T U V W X Y Z [ \ ] ^ _ 6 ` a b c d e f g h i j k l m n o 7 p q r s t u v w x y z { | } ~ DEL UPDATES Updates, parsing examples, and other resources are located at: http://www.topcat.hypermart.net/index.html NOTES i. Parse: To scan/analyze data looking for a desired pattern. ii. ASCII: American Standard Code for Information Interchange. iii. Markup: A system for annotating text. iv. See topic 'HEX TABLE' for ASCII/hexadecimal equivalents. v. Caveat: The user should recognize the constraints governing both the hardware and software rendering the data. LEGALESE The TOPIC database specification is copyright Topcat Software LLC. and is absolutely free for anyone to use for any reason in perpetuity. A single line citation is requested in the form of: TOPIC database specification by Topcat Software LLC. HISTORY - TOPIC database specification v1.00, initial release 1/17/2011.
Revision: 39572
Initial Code
Initial URL
Initial Description
Initial Title
Initial Tags
Initial Language
at January 18, 2011 18:54 by TopcatSoftware
Initial Code
SYNOPSIS This document describes the formal specifications of the TOPIC database format and serves as its canonical reference. Designed for humans first and machines second, the TOPIC format attempts to provide a standardized structure for plain text databases that's easy to read and edit in most text editors, and easy to programmatically parse as well [i]. Uses include: knowledge-bases, glossaries, logging, notes... KEY CONCEPTS - TOPIC databases are written (and readable) as standard ASCII [ii], so virtually any plain text editor such as Notepad, TextEdit, or Vim are equally good choices for editing TOPIC databases. - TOPIC databases provide a method of filtering text whereby associations are established between otherwise unrelated data. - TOPIC databases use fundamentally simple markup [iii]. TAGS - Tags are always located above the block it describes, alone on a single line. - Tags never contain meta-characters ^$\/()|?+*[]{},. or the horizontal tab character [HEX:09] [iv]. - Multiple tags are comma delimited, thus tag fields never contain commas [HEX:2C] [iv]. BLOCKS - A block is always located below the tags that describe it. - A block may contain any number of lines, each beginning with a horizontal tab character. - Empty lines within a block are valid. LINES - Line are terminated with one of CR [HEX:0D], LF [HEX:0A], or a CR/LF pair [iv]. - No limits are imposed on the length of a given line [v]. PARSING - TOPIC databases are parsed line-by-line sequentially from top to bottom, and left to right. - Because a given tag can define multiple blocks, the data should be parsed in its entirety 'per query'. ASSOCIATIONS Using multiple tags establishes associations between otherwise unrelated blocks. In the example below, the first block has a tag named 'Apples', the second block has a tag named 'Oranges', and both blocks have a common tag named 'Fruit': Apples, Fruit Block line 1 Block line 2 Block line n... Oranges, Fruit Block line 1 Block line 2 Block line n... This means you can stream the first block with the 'Apples' tag, stream the second block with the 'Oranges' tag, or stream both blocks via the 'Fruit' tag. The advantage gained is that your data can be filtered in an arbitrary manner. For instance, you could have twelve blocks, each with differing month tags, and a common year tag allowing you to scrutinize your data by month as well as year... MODIFICATIONS There are no formally sanctioned modifications to the TOPIC database specification. However, the user is free to extend and alter the format as best fits the need provided all legalese is observed. HEX TABLE ASCII/hexadecimal equivalents used in this document: 0 1 2 3 4 5 6 7 8 9 A B C D E F 0 ^@ ^A ^B ^C ^D ^E ^F ^G ^H ^I ^J ^K ^L ^M ^N ^O 1 ^P ^Q ^R ^S ^T ^U ^V ^W ^X ^Y ^Z ^[ ^\ ^] ^^ ^_ 2 SPC ! " # $ % & ' ( ) * + , - . / 3 0 1 2 3 4 5 6 7 8 9 : ; < = > ? 4 @ A B C D E F G H I J K L M N O 5 P Q R S T U V W X Y Z [ \ ] ^ _ 6 ` a b c d e f g h i j k l m n o 7 p q r s t u v w x y z { | } ~ DEL UPDATES Updates, parsing examples, and other resources are located at: http://www.topcat.hypermart.net/index.html NOTES i. Parse: To scan/analyze data looking for a desired pattern. ii. ASCII: American Standard Code for Information Interchange. iii. Markup: A system for annotating text. iv. See topic 'HEX TABLE' for ASCII/hexadecimal equivalents. v. Caveat: The user should recognize the constraints governing both the hardware and software rendering the data. LEGALESE The TOPIC database specification is copyright Topcat Software LLC. and is absolutely free for anyone to use for any reason in perpetuity. A single line citation is requested in the form of: TOPIC database specification by Topcat Software LLC. HISTORY - TOPIC database specification v1.00, initial release 1/17/2011.
Initial URL
http://www.topcat.hypermart.net/index.html
Initial Description
Specifications of the TOPIC database format, a standardized structure for plain text databases that's easy to read and edit in most text editors, and easy to programmatically parse as well.
Initial Title
TOPIC database format
Initial Tags
database
Initial Language
Pseudocode