IEICE global.ieice.org Site

Keyword Search Result

[Keyword] OMP(3945hit)

3801-3820hit(3945hit)

Compaction of Test Sets for Combinational Circuits Based on Symbolic Fault Simulation
Hiroyuki HIGUCHI Nagisa ISHIURA Shuzo YAJIMA

PAPER-Test

Vol:
E76-D No:9
Page(s):
1121-1127
Since the time required for testing logic circuits is proportional to the number of test vectors, the size of test sets as well as test generation time is one of the most important factors to be considered in test generation. The size of test sets becomes an essential issue, especially for scan designed circuits, because of the need to shift a test vector serially into the scan path. In this paper, we propose new methods of generating compact test sets to detect al the irredundant single stuck-at faults in combinational circuits. The proposed algorithms calculate a test function for each fault which corresponds to the set of all test vectors for the fault and generate a compact test set by analyzing the test functions. The analysis is based on finding a test vector which detects the largest number of remaining faults. Since our methods select a test vector among all the test vectors, represented by a test function, for a target fault, smaller test sets can be generated, in general, than that by conventional test set compaction methods. The experimental results show that the size of test sets generated by our method is about one-third as large as that without compaction.
Optimization of Sequential Synchronous Digital Circuits Using Structural Models
Giovanni De MICHELI

INVITED PAPER-Logic Synthesis

Vol:
E76-D No:9
Page(s):
1018-1029
We present algorithms for the optimization of sequential synchronous digital circuits using structural model, i.e. interconnections of combinational logic gates and synchronous registers. This approach contrasts traditional methods using state diagrams or transition tables and leveraging state minimization and encoding techniques. In particular, we model circuits by synchronous logic networks, that are weighted multigraphs representing interconnections of gates implementing scalar combinational functions. With this modeling style, area and path delays are explicit and their variation is easy to compute when circuit transformations are applied. Sequential logic optimization may target cycle-time or area minimization, possibly under area or cycle-time constraints. Optimization is performed by a sequence of transformations, directed to the desired goal. This paper describes the fundamental mechansms for transformations applicable to sequential circuits. We review first retiming and peripheral retiming techniques. The former method optimizes the position of the registers, while the latter repositions the registers to enlarge maximally the combinational region where combinational restructuring algorithms can be applied. We consider then synchronous algebraic and Boolean transformations, that blend combinational transformations with local retiming. Both classes of transformations require the representation of circuits by means of logic expressions with labeled variables, the labels representing discrete time-points. Algebraic transformations entail manipulation of time-labeled expressions with algebraic techniques. Boolean transformations exploit the properties of Boolean algebra and benefit from the knowledge of don't care conditions in the search for the best implementation of local functions. Expressing don't care conditions for sequential circuits is harder than for combinational circuits, because of the interaction of variables with different time labels. In addition, the feasibility of replacing a local function with another one may not always be verified by checking for the inclusion of the induced perturbation in local explicit don't care set. Indeed, the behavior of sequential circuits, that can be described appropriately by the relation between input and output traces, may require relational models to express don't care conditions. We describe a general formalism for sequential optimization by Boolean transformations, where the don't care conditions are expressed implicitly by synchronous recurrence equations. We present then an optimization method for this model, that can exploit degrees of freedom in optimization not possible for other methods, and hence providing solutions of possible superior quality. We conclude by summarizing the major features and limitations of optimization methods using structural models.
An Automated Approach to Generating Leaf Cells for a Macro Cell Configuration
Ritsu KUSABA Hiroshi MIYASHITA Takumi WATANABE

PAPER-Computer Aided Design (CAD)

Vol:
E76-A No:8
Page(s):
1334-1342
This paper describes a new automated approach to generating the patterns of CMOS leaf cells from transistor-level connectivity data. This method can generate CMOS leaf cells that are configurable to a macro cell satisfying user-specified constraints. The user-specified constraints include the aspect ratio and port positions of the macro cell. We propose a top-down method for converting the macro cell level constratints to leaf cell level ones. Using this method, a variety of customized macro cells can be designed in a short turn-around time. The method consists of four processes--diffusion sharing, initial placement, placement improvement and routing--which culminate in the automatic generation of symbolic representations. Using a compactor, those symbolic representations can be converted to physical patterns which are gathered into a macro cell by a macro generator. We define various objective functions to improve unit pair placement. We also introduce five ways to optimize leaf cell area: 1) multi-row division, 2) gate division 3) rotation, 4) power line and diffusion overlapping and 5) reconstruction of hierarchical structure. The proposed approach has been applied to various kinds of CMOS leaf cells. Experimental results show that the generated cells have almost the same areas as those generated by conventional bottom-up approaches in leaf and macro cell layouts. This approach offers a further advantage in that the various-sized macro cells required by layout disigners can also be generated.
Design of Josephson Ternary Delta-Gate (δ-Gate)
Ali Massoud HAIDAR Fu-Qiang LI Mititada MORISUE

PAPER-Computer Hardware and Design

Vol:
E76-D No:8
Page(s):
853-862
A new circuit design of Josephson ternary δ-gate composed of Josephson junction devices is presented. Mathematical theory for synthesizing, analyzing, and realizing any given function in ternary system using Josephson ternary δ-gate is introduced. The Josephson ternary δ-gate is realized using SQUID technique. Circuit simulation results using J-SPICE demonstrated the feasibility and the reliability operations of Josephson ternary δ-gate with very high performances for both speed and power consumption (max. propagation delay time44 ps and max. power consumption2.6µW). The Josephson ternary δ-gate forms a complete set (completeness) with the ternary constants (1, 0, 1). The number of SQUIDs that are needed to perform the operation of δ-gate is 6. Different design with less than 6 SQUIDs is not possible because it can not perform the operation of δ-gate. The advantages of Josephson ternary δ-gate compared with different Josephson logic circuits are as follows: The δ-gate has the property that a simple realization to any given ternary logic function as the building blocks can be achieved. The δ-gate has simple construction with small number of SQUIDs. The δ-gate can realize a large number of ternary functions with small number of input/output pins. The performances of δ-gate is very high, very low power consumption and ultra high speed switching operation.
Microwave Characteristics of High-T_c Superconductors by Improved Three-Fluid Model
Tadashi IMAI Takaaki SAKAKIBARA Yoshio KOBAYASHI

PAPER

Vol:
E76-C No:8
Page(s):
1275-1279
In order to explain the temperature and frequency characteristics of high-Tc superconductors, a new model is proposed, which will be called the improved three-fluid model, where the momentum relaxation time τ is assumed to depend on temperature in the superconducting and normal states, respectively, although τ has been assumed to be independent of temperature for the conventional three-fluid model. According to this model, the complex conductivity σ1jσ2 and the surface impedance ZsRsjXs, where Rs is the surface resistance and Xs is the surface reactance, are expressed as a function of temperature. The temperature dependences of Zs and for a YBCO bulk estimated using this model agree very well with ones measured by the dielectric-loaded cavity method in room to cryogenic temperature. In particular, a peak of σ1 observed just below the critical temperature Tc in experiments, appeared in the calculated results based on this model. This phenomenon has been already known in the BCS theory. Thus, it is verified that this model is useful to explain the microwave characteristics of high-Tc superconductors in room to cryogenic temperature. On the other hand, the residual normal electron density nres4.2541023 m-3 and the total electron density nt7.3081024 m-3 are obtained by calculation. The ratio nres/nt0.058 can be used as figure of merit to evaluate material quality of high-Tc superconductors; thus it means that there is 5.8% nonpairing electron in this YBCO bulk.
Definition of Attributed Random Graph and Proposal of Its Applications
Dong Su SEONG Ho Sung KIM Kyu Ho PARK

PAPER-Image Processing, Computer Graphics and Pattern Recognition

Vol:
E76-D No:8
Page(s):
919-925
In this paper, we define an attributed random graph, which can be considered as a generalization of conventional ones, to include multiple attributes as well as numeric attribute instead of a single nominal attribute in random vertices and edges. Then we derive the probability equations for an attributed graph to be an outcome graph of the attributed random graph, and the equations for the entropy calculation of the attributed random graph. Finally, we propose the application areas to computer vision and machine learning using these concepts.
A Modular Inversion Hardware Algorithm with a Redundant Binary Representation
Naofumi TAKAGI

PAPER-Computer Hardware and Design

Vol:
E76-D No:8
Page(s):
863-869
A hardware algorithm for modular inversion is proposed. It is based on the extended Euclidean algorithm. All intermediate results are represented in a redundant binary representation with a digit set {0, 1,1}. All addition/subtractions are performed without carry propagation. A modular inversion is carried out in O (n) clock cycles where n is the word length of the modulus. The length of each clock cycle is constant independent of n. A modular inverter based on the algorithm has a regular cellular array structure with a bit slice feature and is very suitable for VLSI implementation. Its amount of hardware is proportional to n.
A Network-Topology-Independent Static Task Allocation Strategy for Massively Parallel Computers
Takanobu BABA Akehito GUNJI Yoshifumi IWAMOTO

PAPER-Computer Networks

Vol:
E76-D No:8
Page(s):
870-881
A network-topology-independent static task allocation strategy has been designed and implemented for massively parallel computers. For mapping a task graph to a processor graph, this strategy evaluates several functions that represent some intuitively feasible properties or the graphs. They include the connectivity with the allocated nodes, distance from the median of a graph, connectivity with candidate nodes, and the number of candidate nodes within a distance. Several greedy strategies are defined to guide the mapping process, utilizing the indicated function values. An allocation system has been designed and implemented based on the allocation strategy. In experiments we have defined about 1000 nodes in task graphs with regular and irregular topologies, and the same order of processors with mesh, tree, and hypercube topologies. The results are summarized as follows. 1) The system can yield 4.0 times better total communication costs than an arbitrary allocation. 2) It is difficult to select a single strategy capable of providing the best solutions for a wide range of task-processor combinations. 3) Comparison with hypercube-topology-dependent research indicates that our topology-independent allocator produces better results than the dependent ones. 4) The order of computaion time of the allocator is experimentally proved to be O (n2) where n represents the number of tasks.
Interpolation of CT Slices for Laser Stereolithography
Takanori NAGAE Takeshi AGUI Hiroshi NAGAHASHI

PAPER-Image Processing, Computer Graphics and Pattern Recognition

Vol:
E76-D No:8
Page(s):
905-911
An algorithm interpolating parallel cross-sections between CT slices is described. Contours of equiscalar or constant-density surfaces on cross-sections are directly obtained as non-intersecting loops from grayscale slice images. This algorithm is based on a general algorithm that the authors have proposed earlier, constructing triangulated orientable closed surfaces from grayscale volumes and is particularly suited for a new technique, called laser stereolithography, which creates real 3D plastic objects using UV laser to scan and harden liquid polymer. The process of laser stereolithography is executed slice by slice, and this technique really requires some interpolation of intermediate cross-sections between slices. For visualizing, surfaces are only expected to be shaded almost continuously. The local defects are invisible and not cared about if the picture resolution is rather poor. On the contrary, topological faults are fatal to construct solid models by laser stereolithography, i.e., every contour line on cross-sections must be closed with no intersection. Not a single break of a contour line is tolerated. We already have many algorithms available for equiscalar surface construction, and it seems that if we cut the surfaces, then contour lines could be obtained. However, few of them are directly applicable to solid modeling. Marching cubes algorithm, for example, does not ensure the consistency of surface topology. Our algorithm guarantee an adequate topology of contour lines.
An Architecture for Parallelism of OPS5 Production Systems
Tsuyoshi KAWAGUCHI Etsuro HONDA

PAPER-Artificial Intelligence and Cognitive Science

Vol:
E76-D No:8
Page(s):
935-946
In this paper we propose an architecture and an algorithm for the parallel execution of OPS5 production systems. It is known that current OPS5 production system interpreters spend almost 90% of their execution time in the match step. Thus, in this paper we focus on the speedup of the match step. The match algorithm used in OPS5 is called Rete and the algorithm uses a special kind of a date-flow network compiled from the left hand sides of rules. To achieve the maximum degree of parallelism of a given OPS5 program by as few processors as possible, the proposed parallel machine uses loosely coupled multiprocessors. Parallel machines designed for fine-grain parallelism, such as DADO, also use loosely coupled multiprocessors. However, the proposed machine differs from such machines at the following points: use of powerful processors which have large amounts of memories and small cycle times; use of a shared Rete network (parallel machines designed for fine-grain parallelism use an unshared Rete network); high hardware utilization. Basic ideas of the proposed parallel machine are as follows. (1) Use of a modified Rete network in which node sharing is used only for constant-test nodes and each memory node is lumped with the child two-input node. (2) Static allocation of the nodes of the modified Rete network onto processors. (3) Partition of the set of processors into three subsets: constant-test node processors, two-input node processors and conflict-set processors. (4) Use of a ring network for the interconnection network among two-input node processors. In addition to an architecture for parallel execution of OPS5 production systems, we propose a scheme for mapping the modified Rete network into the proposed architecture. The results of simulation experiments showed that the proposed architecture is promising for parallel execution of OPS5 production systems.
Neural Network Approach to Characterization of Cirrhotic Parenchymal Echo Patterns
Shin-ya YOSHINO Akira KOBAYASHI Takashi YAHAGI Hiroyuki FUKUDA Masaaki EBARA Masao OHTO

PAPER-Biomedical Signal Processing

Vol:
E76-A No:8
Page(s):
1316-1322
We have calssified parenchymal echo patterns of cirrhotic liver into four types, according to the size of hypoechoic nodular lesions. Neural network technique has been applied to the characterization of hepatic parenchymal diseases in ultrasonic B-scan texture. We employed a multi-layer feedforward neural network utilizing the back-propagation algorithm. We carried out four kinds of pre-processings for liver parenchymal pattern in the images. We describe the examination of each performance by these pre-processing techniques. We show four results using (1) only magnitudes of FFT pre-processing, (2) both magnitudes and phase angles, (3) data normalized by the maximum value in the dataset, and (4) data normalized by variance of the dataset. Among the 4 pre-processing data treatments studied, the process combining FFT phase angles and magnitudes of FFT is found to be the most efficient.
Multiple-Valued Code Assignment Algorithm for VLSI-Oriented Highly Parallel k-Ary Operation Circuits
Saneaki TAMAKI Michitaka KAMEYAMA

PAPER-Multiple-Valued Architectures and Systems

Vol:
E76-C No:7
Page(s):
1112-1118
Design of high-speed digital circuits such as adders and multipliers is one of the most important issues to implement high performance VLSI systems. This paper proposes a new multiple-valued code assignment algorithm to implement locally computable combinational circuits for k-ary operations. By the decomposition of a given k-ary operation into unary operations, a code assignment algorithm for k-ary operations is developed. Partition theory usually used in the design of sequential circuits is effectively employed for optimal code assignment. Some examples are shown to demonstrate the usefulness of the proposed algorithm.
Hardware Architecture for Kohonen Network
Hidetoshi ONODERA Kiyoshi TAKESHITA Keikichi TAMARU

PAPER-Neural Networks and Chips

Vol:
E76-C No:7
Page(s):
1159-1166
We propose a fully digital architecture for Kohonen network suitable for VLSI implementation. The proposed architecture adopts a functional memory type parallel processor (FMPP) architecture which has a structure similar to a content addressable memory (CAM). One word of CAM is regarded as a processing element and a group of elements forms a neuron. All processing elements execute the same operation in bit-serial but in processor-parallel. Thus the number of instructions for realizing the network algorithm is independent of the number of neurons in the network. With reference to a previously reported CAM, we estimate a network with 96 neurons for speech recognition could be integrated on three chips using a 1.2 µm process, and it operates 50 times faster than a sequential hardware. Owing to its highly regular structure of memories, the proposed hardware architecture is well compatible with current VLSI technology.
Design of Highly Parallel Linear Digital System for ULSI Processors
Masami NAKAJIMA Michitaka KAMEYAMA

PAPER-Multiple-Valued Architectures and Systems

Vol:
E76-C No:7
Page(s):
1119-1125
To realize next-generation high performance ULSI processors, it is a very important issue to reduce the critical delay path which is determined by a cascade chain of basic gates. To design highly parallel digital operation circuits such as an adder and a multiplier, it is difficult to find the optimal code assignment in the non-linear digital system. On the other hand, the use of the linear concept in the digital system seems to be very attractive because analytical methods can be utilized. To meet the requirement, we propose a new design method of highly parallel linear digital circuits for unary operations using the concept of a cycle and a tree. In the linear digital circuit design, the analytical method can be developed using a representation matrix, so that the search procedure for optimal locally computable circuits becomes very simple. The evaluations demonstrate the usefulness of the circuit design algorithm.
A Universal Coding Scheme Based on Minimizing Minimax Redundancy for Sources with an Unknown Model
Joe SUZUKI

PAPER-Information Theory and Coding Theory

Vol:
E76-A No:7
Page(s):
1234-1239
This paper's main objective is to clearly describe the construction of a universal code for minimizing Davisson's minimax redundancy in a range where the true model and stochastic parameters are unknown. Minimax redundancy is defined as the maximum difference between the expected persymbol code length and the per-symbol source entropy in the source range. A universal coding scheme is here formulated in terms of the weight function, i.e., a method is presented for determining a weight function which minimizes the minimax redundancy even when the true model is unknown. It is subsequently shown that the minimax redundancy achieved through the presented coding method is upper-bounded by the minimax redundancy of Rissanen's semi-predictive coding method.
Evaluations for Estimation of an Information Source Based on State Decomposition
Joe SUZUKI

PAPER-Information Theory and Coding Theory

Vol:
E76-A No:7
Page(s):
1240-1251
This paper's main objective is to analyze several procedures which select the model g among a set G of stochastic models to minimize the value of an information criterion in the form of L(g)H[g](zn)+(k(g)/2)c(n), where zn is the n observed data emitted by an information source θ which consists of the model gθ∈G and k(gθ) mutually independent stochastic parameters in the model gθ∈G, H[g](zn) is (-1) (the maximum log likelihood value of the data zn with respect to a model g∈G), and c(n) is a predetermined function (penalty function) of n which controls the amount of penalty for increasing the model size. The result is focused on specific performances when the information criteria are applied to the framework of so-called state decomposition. Especially, upper bounds are derived of the following two performance measures for each penalty function c(n): the error probability of the model selection, and the average Kullback-Leibler information between the true information source and the estimated information source.
Constant Round Perfect ZKIP of Computational Ability
Toshiya ITOH Kouichi SAKURAI

PAPER-Information Security and Cryptography

Vol:
E76-A No:7
Page(s):
1225-1233
In this paper, we show that without any unproven assumption, there exists a "four" move blackbox simulation perfect zero-knowledge interactive proof system of computational ability for any random self-reducible relation R whose domain is in BPP, and that without any unproven assumption, there exists a "four" move blackbox simulation perfect zero-knowledge interactive proof system of knowledge on the prime factorization. These results are optimal in the light of the round complexity, because it is shown that if a relation R has a three move blackbox simulation (perfect) zero-knowledge interactive proof system of computational ability (or of knowledge), then there exists a probabilistic polynomial time algorithm that on input x ∈ {0, 1}*, outputs y such that (x, y)∈R with overwhelming probability if x ∈dom R, and outputs "⊥" with probability 1 if x dom R.
A Programmable Parallel Digital Neurocomputer
Yoshiyuki SHIMOKAWA Yutaka FUWA Naruhiko ARAMAKI

PAPER-Neural Networks and Chips

Vol:
E76-C No:7
Page(s):
1197-1205
We developed programmable high-performance and high-speed neurocomputer for a large neural network using ASIC neurocomputing chips made by CMOS VLSI technology. The neurocomputer consists of one master node and multiple slave nodes which are connected by two data paths, a broadcast bus and a ring bus. The nodes are made by ASIC chips and each chip has plural nodes in it. The node has four types of computation hardware that can be cascaded in series forming a pipeline. Processing speed is proportional to the number of nodes. The neurocomputer is built on one printed circuit board having 65 VLSI chips that offers 1.5 billion connections/sec. The neurocomputer uses SIMD for easy programming and simple hardware. It can execute complicated computations, memory access and memory address control, and data paths control in a single instruction and in a single time step using the pipeline. The neurocomputer processes forward and backward calculations of multilayer perceptron type neural networks, LVQ, feedback type neural networks such as Hopfield model, and any other types by programming. To compute neural computation effectively and simply in a SIMD type neurocomputer, new processing methods are proposed for parallel computation such as delayed instruction execution, and reconfiguration.
Non von Neumann Chip Architecture--Present and Future--
Tadashi AE Reiji AIBARA

INVITED PAPER

Vol:
E76-C No:7
Page(s):
1034-1044
The recent non von Neumann chip architectures are mainly classified into the AI architecture and the neural architecture. We focus on these two categories, and introduce the representatives each with a brief history. The AI chip architecture is difficult to escape essentially from the von Neumann architecture as far as it is language-oriented. The neural architecture, however, may yield an essentially new computer architecture, when the new device technologies will support it. In particular, the optoelectronics and the quantum electronics will provide a lot of powerful technologies.
A Discussion on the Feedback Strategies in Computerized Testing
Takako AKAKURA Keizo NAGAOKA

LETTER

Vol:
E76-A No:7
Page(s):
1199-1203
The authors examined the effect of feedbacking information on learners of their test results obtained through computerized tests. The learner's acceptability of computerized tests was revealed to be improved by distribution and explanation of newly devised feedback charts including data on one's response history and response latency during computerized testing that was carried out in formative evaluation. The feedback chart composed of graphic representation of relationship between degree of difficulty of each question and its response latency got a particularly high evaluation among learners. It was revealed that types of feedback chart that stood highest in learner's estimate varied with the learner traits. This observation will serve to develop educational systems that incorporate computerized tests into school lessons.

3801-3820hit(3945hit)

Keyword Search Result

[Keyword] OMP(3945hit)

Compaction of Test Sets for Combinational Circuits Based on Symbolic Fault Simulation

Optimization of Sequential Synchronous Digital Circuits Using Structural Models

An Automated Approach to Generating Leaf Cells for a Macro Cell Configuration

Design of Josephson Ternary Delta-Gate (δ-Gate)

Microwave Characteristics of High-T_c Superconductors by Improved Three-Fluid Model

Definition of Attributed Random Graph and Proposal of Its Applications

A Modular Inversion Hardware Algorithm with a Redundant Binary Representation

A Network-Topology-Independent Static Task Allocation Strategy for Massively Parallel Computers

Interpolation of CT Slices for Laser Stereolithography

An Architecture for Parallelism of OPS5 Production Systems

Neural Network Approach to Characterization of Cirrhotic Parenchymal Echo Patterns

Multiple-Valued Code Assignment Algorithm for VLSI-Oriented Highly Parallel k-Ary Operation Circuits

Hardware Architecture for Kohonen Network

Design of Highly Parallel Linear Digital System for ULSI Processors

A Universal Coding Scheme Based on Minimizing Minimax Redundancy for Sources with an Unknown Model

Evaluations for Estimation of an Information Source Based on State Decomposition

Constant Round Perfect ZKIP of Computational Ability

A Programmable Parallel Digital Neurocomputer

Non von Neumann Chip Architecture--Present and Future--

A Discussion on the Feedback Strategies in Computerized Testing

Latest Issue

Links

Call for Papers

Submit to IEICE Trans.

Transactions NEWS

Popular articles