Skip to main content

Showing 1–17 of 17 results for author: Fujita, S

  1. arXiv:2407.03631  [pdf, other

    cs.CE cs.LG physics.data-an physics.geo-ph

    On the performance of sequential Bayesian update for database of diverse tsunami scenarios

    Authors: Reika Nomura, Louise A. Hirao Vermare, Saneiki Fujita, Donsub Rim, Shuji Moriguchi, Randall J. LeVeque, Kenjiro Terada

    Abstract: Although the sequential tsunami scenario detection framework was validated in our previous work, several tasks remain to be resolved from a practical point of view. This study aims to evaluate the performance of the previous tsunami scenario detection framework using a diverse database consisting of complex fault rupture patterns with heterogeneous slip distributions. Specifically, we compare the… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 15 pages, 12 figures

  2. arXiv:2308.02926  [pdf, other

    cs.IR cs.CL cs.LG cs.NI

    Towards Consistency Filtering-Free Unsupervised Learning for Dense Retrieval

    Authors: Haoxiang Shi, Sumio Fujita, Tetsuya Sakai

    Abstract: Domain transfer is a prevalent challenge in modern neural Information Retrieval (IR). To overcome this problem, previous research has utilized domain-specific manual annotations and synthetic data produced by consistency filtering to finetune a general ranker and produce a domain-specific ranker. However, training such consistency filters are computationally expensive, which significantly reduces… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

  3. arXiv:2208.14210  [pdf, other

    cs.DB cs.AI cs.LG

    Learned k-NN Distance Estimation

    Authors: Daichi Amagata, Yusuke Arai, Sumio Fujita, Takahiro Hara

    Abstract: Big data mining is well known to be an important task for data science, because it can provide useful observations and new knowledge hidden in given large datasets. Proximity-based data analysis is particularly utilized in many real-life applications. In such analysis, the distances to k nearest neighbors are usually employed, thus its main bottleneck is derived from data retrieval. Much efforts h… ▽ More

    Submitted 27 November, 2022; v1 submitted 29 August, 2022; originally announced August 2022.

    Comments: Accepted to SIGSPATIAL2022 (as short paper)

  4. arXiv:2104.06646  [pdf

    cs.CY

    Influenza Surveillance using Search Engine, SNS, On-line Shopping, Q&A Service and Past Flu Patients

    Authors: Taichi Murayama, Nobuyuki Shimizu, Sumio Fujita, Shoko Wakamiya, Eiji Aramaki

    Abstract: Influenza, an infectious disease, causes many deaths worldwide. Predicting influenza victims during epidemics is an important task for clinical, hospital, and community outbreak preparation. On-line user-generated contents (UGC), primarily in the form of social media posts or search query logs, are generally used for prediction for reaction to sudden and unusual outbreaks. However, most studies re… ▽ More

    Submitted 14 April, 2021; originally announced April 2021.

    Comments: 18pages, 3 figures

  5. arXiv:2011.09140  [pdf, other

    cs.CL

    Diverse and Non-redundant Answer Set Extraction on Community QA based on DPPs

    Authors: Shogo Fujita, Tomohide Shibata, Manabu Okumura

    Abstract: In community-based question answering (CQA) platforms, it takes time for a user to get useful information from among many answers. Although one solution is an answer ranking method, the user still needs to read through the top-ranked answers carefully. This paper proposes a new task of selecting a diverse and non-redundant answer set rather than ranking the answers. Our method is based on determin… ▽ More

    Submitted 18 November, 2020; originally announced November 2020.

    Comments: COLING2020, 12 pages

  6. arXiv:2011.04241  [pdf, other

    cs.CL

    Pointing to Subwords for Generating Function Names in Source Code

    Authors: Shogo Fujita, Hidetaka Kamigaito, Hiroya Takamura, Manabu Okumura

    Abstract: We tackle the task of automatically generating a function name from source code. Existing generators face difficulties in generating low-frequency or out-of-vocabulary subwords. In this paper, we propose two strategies for copying low-frequency or out-of-vocabulary subwords in inputs. Our best performing model showed an improvement over the conventional method in terms of our modified F1 and accur… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

    Comments: 12 pages, accepted to COLING2020

  7. Syndromic surveillance using search query logs and user location information from smartphones against COVID-19 clusters in Japan

    Authors: Shohei Hisada, Taichi Murayama, Kota Tsubouchi, Sumio Fujita, Shuntaro Yada, Shoko Wakamiya, Eiji Aramaki

    Abstract: [Background] Two clusters of coronavirus disease 2019 (COVID-19) were confirmed in Hokkaido, Japan in February 2020. To capture the clusters, this study employs Web search query logs and user location information from smartphones. [Material and Methods] First, we anonymously identified smartphone users who used a Web search engine (Yahoo! JAPAN Search) for the COVID-19 or its symptoms via its comp… ▽ More

    Submitted 21 April, 2020; originally announced April 2020.

  8. arXiv:1910.10410  [pdf, other

    cs.IR cs.LG

    BanditRank: Learning to Rank Using Contextual Bandits

    Authors: Phanideep Gampa, Sumio Fujita

    Abstract: We propose an extensible deep learning method that uses reinforcement learning to train neural networks for offline ranking in information retrieval (IR). We call our method BanditRank as it treats ranking as a contextual bandit problem. In the domain of learning to rank for IR, current deep learning models are trained on objective functions different from the measures they are evaluated on. Since… ▽ More

    Submitted 23 October, 2019; originally announced October 2019.

    Comments: 9 pages

  9. arXiv:1908.06664  [pdf, ps, other

    cs.CC cs.DM math.CO

    Safe sets in digraphs

    Authors: Yandong Bai, Jørgen Bang-Jensen, Shinya Fujita, Anders Yeo

    Abstract: A non-empty subset $S$ of the vertices of a digraph $D$ is called a {\it safe set} if \begin{itemize} \item[(i)] for every strongly connected component $M$ of $D-S$, there exists a strongly connected component $N$ of $D[S]$ such that there exists an arc from $M$ to $N$; and \item[(ii)] for every strongly connected component $M$ of $D-S$ and every strongly connected component $N$ of $D[S]$, we ha… ▽ More

    Submitted 19 August, 2019; originally announced August 2019.

  10. arXiv:1902.10895  [pdf

    cs.CV

    What you get is not always what you see: pitfalls in solar array assessment using overhead imagery

    Authors: Wei Hu, Kyle Bradbury, Jordan M. Malof, Boning Li, Bohao Huang, Artem Streltsov, K. Sydny Fujita, Ben Hoen

    Abstract: Effective integration planning for small, distributed solar photovoltaic (PV) arrays into electric power grids requires access to high quality data: the location and power capacity of individual solar PV arrays. Unfortunately, national databases of small-scale solar PV do not exist; those that do are limited in their spatial resolution, typically aggregated up to state or national levels. While se… ▽ More

    Submitted 25 July, 2022; v1 submitted 28 February, 2019; originally announced February 2019.

    Comments: 25 pages

  11. arXiv:1808.09648  [pdf, other

    cs.CL cs.AI cs.CV

    Adapting Visual Question Answering Models for Enhancing Multimodal Community Q&A Platforms

    Authors: Avikalp Srivastava, Hsin Wen Liu, Sumio Fujita

    Abstract: Question categorization and expert retrieval methods have been crucial for information organization and accessibility in community question & answering (CQA) platforms. Research in this area, however, has dealt with only the text modality. With the increasing multimodal nature of web content, we focus on extending these methods for CQA questions accompanied by images. Specifically, we leverage the… ▽ More

    Submitted 25 May, 2019; v1 submitted 29 August, 2018; originally announced August 2018.

    Comments: Submitted for review at CIKM 2019

  12. arXiv:1709.03260  [pdf, ps, other

    cs.IR

    A Short Note on Proximity-based Scoring of Documents with Multiple Fields

    Authors: Tomohiro Manabe, Sumio Fujita

    Abstract: The BM25 ranking function is one of the most well known query relevance document scoring functions and many variations of it are proposed. The BM25F function is one of its adaptations designed for modeling documents with multiple fields. The Expanded Span method extends a BM25-like function by taking into considerations of the proximity between term occurrences. In this note, we combine these two… ▽ More

    Submitted 11 September, 2017; originally announced September 2017.

    Comments: 2 pages

  13. Physically unclonable function using initial waveform of ring oscillators on 65 nm CMOS technology

    Authors: Tetsufumi Tanamoto, Satoshi Takaya, Nobuaki Sakamoto, Hirotsugu Kasho, Shinichi Yasuda, Takao Marukame, Shinobu Fujita, Yuichiro Mitani

    Abstract: A silicon physically unclonable function (PUF) using ring oscillators (ROs) has the advantage of easy application in both an application specific integrated circuit (ASIC) and a field-programmable gate array (FPGA). Here, we provide a RO-PUF using the initial waveform of the ROs based on 65 nm CMOS technology. Compared with the conventional RO-PUF, the number of ROs is greatly reduced and the time… ▽ More

    Submitted 10 February, 2017; originally announced March 2017.

    Comments: 5 pages, 9 figures

    Journal ref: Jpn. J. Appl. Phys. 56, 04CF13 (2017)

  14. arXiv:1606.03147  [pdf, ps, other

    cs.CR

    High-Speed Magnetoresistive Random-Access Memory Random Number Generator Using Error-Correcting Code

    Authors: Tetsufumi Tanamoto, Naoharu Shimomura, Sumio Ikegawa, Mari Matsumoto, Shinobu Fujita, Hiroaki Yoda

    Abstract: A high-speed random number generator (RNG) circuit based on magnetoresistive random-access memory (MRAM) using an error-correcting code (ECC) post processing circuit is presented. ECC post processing increases the quality of randomness by increasing the entropy of random number. { We experimentally show that a small error-correcting capability circuit is sufficient for this post processing. It is… ▽ More

    Submitted 9 June, 2016; originally announced June 2016.

    Comments: 5 pages, 11 figures

    Journal ref: Jpn. J. Appl. Phys. 50, 04DM01 (2011)

  15. Physically Unclonable Function using Initial Waveform of Ring Oscillators

    Authors: Tetsufumi Tanamoto, Shinich Yasuda, Satoshi Takaya, Shinobu Fujita

    Abstract: A silicon physically unclonable function (PUF) is considered to be one of the key security system solutions for local devices in an era in which the internet is pervasive. Among many proposals, a PUF using ring oscillators (RO-PUF) has the advantage of easy application to FPGA. In the conventional RO-PUF, frequency difference between two ROs is used as one bit of ID. Thus, in order to obtain an ID… ▽ More

    Submitted 11 May, 2016; originally announced May 2016.

    Comments: 11 pages, 10 figures

    Journal ref: IEEE Transactions on Circuits and Systems II: Express Briefs Vol. 64, pp827 - 831 (2017)

  16. A Scheme for Maximal Resource Utilization in Peer-to-Peer Live Streaming

    Authors: Bahaa Aldeen Alghazawy, Satoshi Fujita

    Abstract: Peer-to-Peer streaming technology has become one of the major Internet applications as it offers the opportunity of broadcasting high quality video content to a large number of peers with low costs. It is widely accepted that with the efficient utilization of peers and server's upload capacities, peers can enjoy watching a high bit rate video with minimal end-to-end delay. In this paper, we presen… ▽ More

    Submitted 7 October, 2015; originally announced October 2015.

    Comments: 16 pages in International Journal of Computer Networks & Communications (IJCNC) Vol.7, No.5, September 2015

    Journal ref: International Journal of Computer Networks & Communications (IJCNC) Vol.7, No.5, September 2015

  17. arXiv:1204.2712  [pdf, ps, other

    cs.AI cs.HC cs.IR

    Learning to Rank Query Recommendations by Semantic Similarities

    Authors: Sumio Fujita, Georges Dupret, Ricardo Baeza-Yates

    Abstract: Logs of the interactions with a search engine show that users often reformulate their queries. Examining these reformulations shows that recommendations that precise the focus of a query are helpful, like those based on expansions of the original queries. But it also shows that queries that express some topical shift with respect to the original query can help user access more rapidly the informat… ▽ More

    Submitted 12 April, 2012; originally announced April 2012.

    Comments: 2nd International Workshop on Usage Analysis and the Web of Data (USEWOD2012) in the 21st International World Wide Web Conference (WWW2012), Lyon, France, April 17th, 2012

    Report number: WWW2012USEWOD/2012/fuduba ACM Class: H.3.3; H.3.5