IEICE global.ieice.org Site

Keyword Search Result

[Keyword] contingency table(2hit)

1-2hit

Compression by Substring Enumeration Using Sorted Contingency Tables
Takahiro OTA Hiroyoshi MORITA Akiko MANADA

PAPER-Information Theory

Vol:
E103-A No:6
Page(s):
829-835
This paper proposes two variants of improved Compression by Substring Enumeration (CSE) with a finite alphabet. In previous studies on CSE, an encoder utilizes inequalities which evaluate the number of occurrences of a substring or a minimal forbidden word (MFW) to be encoded. The inequalities are derived from a contingency table including the number of occurrences of a substring or an MFW. Moreover, codeword length of a substring and an MFW grows with the difference between the upper and lower bounds deduced from the inequalities, however the lower bound is not tight. Therefore, we derive a new tight lower bound based on the contingency table and consequently propose a new CSE algorithm using the new inequality. We also propose a new encoding order of substrings and MFWs based on a sorted contingency table such that both its row and column marginal total are sorted in descending order instead of a lexicographical order used in previous studies. We then propose a new CSE algorithm which is the first proposed CSE algorithm using the new encoding order. Experimental results show that compression ratios of all files of the Calgary corpus in the proposed algorithms are better than those of a previous study on CSE with a finite alphabet. Moreover, compression ratios under the second proposed CSE get better than or equal to that under a well-known compressor for 11 files amongst 14 files in the corpus.
Approximate Counting Scheme for m n Contingency Tables
Shuji KIJIMA Tomomi MATSUI

PAPER

Vol:
E87-D No:2
Page(s):
308-314
In this paper, we propose a new counting scheme for m n contingency tables. Our scheme is a modification of Dyer and Greenhill's scheme for two rowed contingency tables. We can estimate not only the sizes of error, but also the sizes of the bias of the number of tables obtained by our scheme, on the assumption that we have an approximate sampler.

Keyword Search Result

[Keyword] contingency table(2hit)

Compression by Substring Enumeration Using Sorted Contingency Tables

Approximate Counting Scheme for m n Contingency Tables

Latest Issue

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles