The search functionality is under construction.

Author Search Result

[Author] Brendan FLANAGAN(3hit)

1-3hit
  • A Web Page Segmentation Approach Using Visual Semantics

    Jun ZENG  Brendan FLANAGAN  Sachio HIROKAWA  Eisuke ITO  

     
    PAPER-Data Engineering, Web Information Systems

      Vol:
    E97-D No:2
      Page(s):
    223-230

    Web page segmentation has a variety of benefits and potential web applications. Early techniques of web page segmentation are mainly based on machine learning algorithms and rule-based heuristics, which cannot be used for large-scale page segmentation. In this paper, we propose a formulated page segmentation method using visual semantics. Instead of analyzing the visual cues of web pages, this method utilizes three measures to formulate the visual semantics: layout tree is used to recognize the visual similar blocks; seam degree is used to describe how neatly the blocks are arranged; content similarity is used to describe the content coherent degree between blocks. A comparison experiment was done using the VIPS algorithm as a baseline. Experiment results show that the proposed method can divide a Web page into appropriate semantic segments.

  • LTDE: A Layout Tree Based Approach for Deep Page Data Extraction

    Jun ZENG  Feng LI  Brendan FLANAGAN  Sachio HIROKAWA  

     
    PAPER-Artificial Intelligence, Data Mining

      Pubricized:
    2017/02/21
      Vol:
    E100-D No:5
      Page(s):
    1067-1078

    Content extraction from deep Web pages has received great attention in recent years. However, the increasingly complicated HTML structure of Web documents makes it more difficult to recognize the data records by only analyzing the HTML source code. In this paper, we propose a method named LTDE to extract data records from a deep Web page. Instead of analyzing the HTML source code, LTDE utilizes the visual features of data records in deep Web pages. A Web page is considered as a finite set of visual blocks. The data records are the visual blocks that have similar layout. We also propose a pattern recognizing method named layout tree to cluster the similar layout visual blocks. The weight of all clusters is calculated, and the visual blocks in the cluster that has the highest weight are chosen as the data records to be extracted. The experiment results show that LTDE has higher effectiveness and better robustness for Web data extraction compared to previous works.

  • Learning in the Digital Age: Power of Shared Learning Logs to Support Sustainable Educational Practices

    Hiroaki OGATA  Rwitajit MAJUMDAR  Brendan FLANAGAN  

     
    INVITED PAPER

      Pubricized:
    2022/10/19
      Vol:
    E106-D No:2
      Page(s):
    101-109

    During the COVID-19 pandemic there was a rapid shift to emergency remote teaching practices and online tools for education have already gained further attention. While eLearning initiatives are developed and its implementation at scale are widely discussed, this research focuses on the utilization of data which can be logged in such eLearning systems. We demonstrate the need and potential of utilizing learning logs to create services supporting sustainable quality improvement of education. Learning and Evidence Analytics Framework (LEAF), is the overarching technology framework with affordances to adopt evidence-based practices for education. It aims to promote learning for all by introducing data-driven services for personalized approaches.