Real-Time Computation of Local Neighborhood Functions in Application-Specific Instruction-Set Processors

This paper presents a systematic approach to the design of application-specific instruction-set processors for high speed computation of local neighborhood functions and intra-field deinterlacing. The intended application is real-time processing of high definition video. The approach aims at an effi...

Celý popis

Uložené v:
Podrobná bibliografia
Vydané v:IEEE transactions on very large scale integration (VLSI) systems Ročník 20; číslo 11; s. 2031 - 2043
Hlavní autori: Aubertin, P., Langlois, J. M. P., Savaria, Y.
Médium: Journal Article
Jazyk:English
Vydavateľské údaje: New York, NY IEEE 01.11.2012
Institute of Electrical and Electronics Engineers
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Predmet:
ISSN:1063-8210, 1557-9999
On-line prístup:Získať plný text
Tagy: Pridať tag
Žiadne tagy, Buďte prvý, kto otaguje tento záznam!
Abstract This paper presents a systematic approach to the design of application-specific instruction-set processors for high speed computation of local neighborhood functions and intra-field deinterlacing. The intended application is real-time processing of high definition video. The approach aims at an efficient utilization of the available memory bandwidth by fully exploiting the data parallelism inherent to the target algorithm class. An appropriate choice of custom instructions and application-specific registers is used together with a very long instruction word architecture in order to mimic a pipelined systolic array. This leads to a processing speed close to the limit imposed by memory bandwidth constraints. For three intra-field deinterlacing algorithms and 2-D convolution with four kernel sizes, the design approach yields speedup factors between 36 and 1330, Area-Time (AT) product improvements between 12× and 243×, and energy consumption reduction factors between 13 and 262.
AbstractList This paper presents a systematic approach to the design of application-specific instruction-set processors for high speed computation of local neighborhood functions and intra-field deinterlacing. The intended application is real-time processing of high definition video. The approach aims at an efficient utilization of the available memory bandwidth by fully exploiting the data parallelism inherent to the target algorithm class. An appropriate choice of custom instructions and application-specific registers is used together with a very long instruction word architecture in order to mimic a pipelined systolic array. This leads to a processing speed close to the limit imposed by memory bandwidth constraints. For three intra-field deinterlacing algorithms and 2-D convolution with four kernel sizes, the design approach yields speedup factors between 36 and 1330, Area-Time (AT) product improvements between 12× and 243×, and energy consumption reduction factors between 13 and 262.
This paper presents a systematic approach to the design of application-specific instruction-set processors for high speed computation of local neighborhood functions and intra-field deinterlacing. The intended application is real-time processing of high definition video. The approach aims at an efficient utilization of the available memory bandwidth by fully exploiting the data parallelism inherent to the target algorithm class. An appropriate choice of custom instructions and application-specific registers is used together with a very long instruction word architecture in order to mimic a pipelined systolic array. This leads to a processing speed close to the limit imposed by memory bandwidth constraints. For three intra-field deinterlacing algorithms and 2-D convolution with four kernel sizes, the design approach yields speedup factors between 36 and 1330, Area-Time (AT) product improvements between 12 and 243 , and energy consumption reduction factors between 13 and 262.
This paper presents a systematic approach to the design of application-specific instruction-set processors for high speed computation of local neighborhood functions and intra-field deinterlacing. The intended application is real-time processing of high definition video. The approach aims at an efficient utilization of the available memory bandwidth by fully exploiting the data parallelism inherent to the target algorithm class. An appropriate choice of custom instructions and application-specific registers is used together with a very long instruction word architecture in order to mimic a pipelined systolic array. This leads to a processing speed close to the limit imposed by memory bandwidth constraints. For three intra-field deinterlacing algorithms and 2-D convolution with four kernel sizes, the design approach yields speedup factors between 36 and 1330, Area-Time (AT) product improvements between 12[Formula Omitted] and 243[Formula Omitted], and energy consumption reduction factors between 13 and 262.
Author Aubertin, P.
Langlois, J. M. P.
Savaria, Y.
Author_xml – sequence: 1
  givenname: P.
  surname: Aubertin
  fullname: Aubertin, P.
  email: aubertin@grm.polymtl.ca
  organization: Groupe de Rech. en Microelectron. et Microsystemes, Ecole Polytech. de Montreal, Montreal, QC, Canada
– sequence: 2
  givenname: J. M. P.
  surname: Langlois
  fullname: Langlois, J. M. P.
  email: pierre.langlois@polymtl.ca
  organization: Groupe de Rech. en Microelectron. et Microsystemes, Ecole Polytech. de Montreal, Montreal, QC, Canada
– sequence: 3
  givenname: Y.
  surname: Savaria
  fullname: Savaria, Y.
  email: yvon.savaria@polymtl.ca
  organization: Groupe de Rech. en Microelectron. et Microsystemes, Ecole Polytech. de Montreal, Montreal, QC, Canada
BackLink http://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=26570262$$DView record in Pascal Francis
BookMark eNp9kc1q3DAUhUVJoPnpC7QbQShk4-mVZMn2MgxJOzCkpZl2KzTydUfBIzmSvcjbVxNPs8iiQqCL9J2DOOecnPjgkZCPDBaMQfNl83v9sFpwYGzBWQUcynfkjElZFU1eJ3kGJYqaM3hPzlN6BGBl2cAZ2f1E0xcbt0e6DPthGs3ogqeho-tgTU_v0f3ZbUPchdDSu8nbw3OiztObYeidfcGLhwGt65ylK5_GONn5Ekf6IwaLKYWYLslpZ_qEH47nBfl1d7tZfivW37-uljfrwgpZj4Wpy7JqRSdqxXC7bXjeNTJsmBCVsqxpoUXDODey5SUI22wryYRtuW0bJTtxQa5n3yGGpwnTqPcuWex74zFMSTMQNZegoMro1Rv0MUzR598dKChrmXPM1OcjZVJOpIvGW5f0EN3exGfNlcx5K565euZsDClF7LR1c5pjNK7PlvpQlX6pSh-q0seqspS_kf5z_6_o0yxyiPgqUKC4UqX4C7dGoU0
CODEN IEVSE9
CitedBy_id crossref_primary_10_1016_j_asoc_2013_01_015
crossref_primary_10_1007_s11042_017_4641_x
Cites_doi 10.1109/ASAP.2007.4429996
10.1109/TENCON.2008.4766488
10.1109/DATE.2006.244089
10.1109/TVLSI.2009.2025171
10.1109/5.705528
10.1109/40.848473
10.1109/DSD.2006.16
10.1109/TCSII.2007.896778
10.1109/ICVD.2001.902643
10.1145/581199.581251
10.1109/TURBOCODING.2008.4658677
10.1109/30.920453
10.1109/NEWCAS.2009.5290458
10.1109/FPL.2008.4630032
10.1145/998300.997199
10.1109/TCE.2003.1261227
10.1145/1118299.1118474
10.1023/A:1007996916499
10.1109/92.784091
10.1109/TCSII.2005.857091
10.1109/SFCS.1981.34
10.1109/ICECS.2007.4511157
10.1109/TVLSI.2011.2105512
10.1109/TVLSI.2008.2003164
10.1109/MAHC.2006.45
ContentType Journal Article
Copyright 2015 INIST-CNRS
Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) Nov 2012
Copyright_xml – notice: 2015 INIST-CNRS
– notice: Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) Nov 2012
DBID 97E
RIA
RIE
AAYXX
CITATION
IQODW
7SP
8FD
L7M
F28
FR3
DOI 10.1109/TVLSI.2011.2170204
DatabaseName IEEE All-Society Periodicals Package (ASPP) 2005-present
IEEE All-Society Periodicals Package (ASPP) 1998-Present
IEEE Electronic Library (IEL)
CrossRef
Pascal-Francis
Electronics & Communications Abstracts
Technology Research Database
Advanced Technologies Database with Aerospace
ANTE: Abstracts in New Technology & Engineering
Engineering Research Database
DatabaseTitle CrossRef
Technology Research Database
Advanced Technologies Database with Aerospace
Electronics & Communications Abstracts
Engineering Research Database
ANTE: Abstracts in New Technology & Engineering
DatabaseTitleList
Engineering Research Database
Technology Research Database
Database_xml – sequence: 1
  dbid: RIE
  name: IEEE Electronic Library (IEL)
  url: https://ieeexplore.ieee.org/
  sourceTypes: Publisher
DeliveryMethod fulltext_linktorsrc
Discipline Engineering
Applied Sciences
EISSN 1557-9999
EndPage 2043
ExternalDocumentID 2726207101
26570262
10_1109_TVLSI_2011_2170204
6062664
Genre orig-research
GroupedDBID -~X
.DC
0R~
29I
3EH
4.4
5GY
5VS
6IK
97E
AAJGR
AARMG
AASAJ
AAWTH
ABAZT
ABFSI
ABQJQ
ABVLG
ACGFS
ACIWK
AENEX
AETIX
AGQYO
AGSQL
AHBIQ
AI.
AIBXA
AKJIK
AKQYR
ALLEH
ALMA_UNASSIGNED_HOLDINGS
ATWAV
BEFXN
BFFAM
BGNUA
BKEBE
BPEOZ
CS3
DU5
E.L
EBS
EJD
HZ~
H~9
ICLAB
IEDLZ
IFIPE
IFJZH
IPLJI
JAVBF
LAI
M43
O9-
OCL
P2P
RIA
RIE
RNS
TN5
VH1
AAYXX
CITATION
IQODW
RIG
7SP
8FD
L7M
F28
FR3
ID FETCH-LOGICAL-c358t-a8447d3f3861ebb92b928e1e913376c19d0dea122a5d2403c9b7513cd2cd965f3
IEDL.DBID RIE
ISICitedReferencesCount 6
ISICitedReferencesURI http://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=Summon&SrcAuth=ProQuest&DestLinkType=CitingArticles&DestApp=WOS_CPL&KeyUT=000307125500010&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
ISSN 1063-8210
IngestDate Thu Oct 02 10:26:42 EDT 2025
Mon Jun 30 10:16:47 EDT 2025
Mon Jul 21 09:14:42 EDT 2025
Tue Nov 18 22:20:45 EST 2025
Sat Nov 29 03:36:04 EST 2025
Tue Aug 26 17:03:50 EDT 2025
IsPeerReviewed true
IsScholarly true
Issue 11
Keywords Application-specific instruction-set processors (ASIPs)
High resolution
Processor
Instruction sets
local neighborhood functions
Systolic network
Algorithm
Video signal processing
video processing
Kernel method
Very long instruction word
Pipeline network
Integrated circuit
Electric power consumption
Parallelism
Millimeter wave integrated circuits
Real time processing
Monolithic integrated circuit
Resource management
Custom circuit
deinterlacing
Language English
License https://ieeexplore.ieee.org/Xplorehelp/downloads/license-information/IEEE.html
CC BY 4.0
LinkModel DirectLink
MergedId FETCHMERGED-LOGICAL-c358t-a8447d3f3861ebb92b928e1e913376c19d0dea122a5d2403c9b7513cd2cd965f3
Notes ObjectType-Article-2
SourceType-Scholarly Journals-1
ObjectType-Feature-1
content type line 14
content type line 23
PQID 1030485204
PQPubID 85424
PageCount 13
ParticipantIDs ieee_primary_6062664
proquest_journals_1030485204
proquest_miscellaneous_1038250607
pascalfrancis_primary_26570262
crossref_citationtrail_10_1109_TVLSI_2011_2170204
crossref_primary_10_1109_TVLSI_2011_2170204
PublicationCentury 2000
PublicationDate 2012-11-01
PublicationDateYYYYMMDD 2012-11-01
PublicationDate_xml – month: 11
  year: 2012
  text: 2012-11-01
  day: 01
PublicationDecade 2010
PublicationPlace New York, NY
PublicationPlace_xml – name: New York, NY
– name: New York
PublicationTitle IEEE transactions on very large scale integration (VLSI) systems
PublicationTitleAbbrev TVLSI
PublicationYear 2012
Publisher IEEE
Institute of Electrical and Electronics Engineers
The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
Publisher_xml – name: IEEE
– name: Institute of Electrical and Electronics Engineers
– name: The Institute of Electrical and Electronics Engineers, Inc. (IEEE)
References beucher (ref33) 2007
ref13
ref12
ref15
ref14
ref30
ref11
ref32
ref10
guo (ref18) 2005
ref2
ref1
doyle (ref24) 1988
ref16
ref19
gupta (ref17) 2004
lee (ref26) 2000; 46
porter (ref22) 2006
ref23
jain (ref3) 2001
ref25
ref20
ref21
ref28
mohammadi (ref27) 2009
(ref31) 2007
ref8
(ref29) 0
ref7
ref9
ref4
ref6
ref5
References_xml – ident: ref10
  doi: 10.1109/ASAP.2007.4429996
– start-page: 112
  year: 2005
  ident: ref18
  article-title: Optimized generation of data-path from c codes for FPGAs
  publication-title: Proc Design Automation and Test Eur Conf (DATE)
– ident: ref5
  doi: 10.1109/TENCON.2008.4766488
– ident: ref14
  doi: 10.1109/DATE.2006.244089
– year: 0
  ident: ref29
  publication-title: Tensilica Inc
– start-page: 412
  year: 1988
  ident: ref24
  article-title: Interlaced to sequential conversion for EDTV applications
  publication-title: Proc 2nd Int Workshop Signal Process HDTV
– ident: ref13
  doi: 10.1109/TVLSI.2009.2025171
– ident: ref23
  doi: 10.1109/5.705528
– ident: ref30
  doi: 10.1109/40.848473
– year: 2007
  ident: ref33
  publication-title: Conception et mise en oeuvre de processeurs configurables pour la conversion de taux de trames vidos avec compensation de mouvement
– ident: ref8
  doi: 10.1109/DSD.2006.16
– ident: ref16
  doi: 10.1109/TCSII.2007.896778
– year: 2004
  ident: ref17
  publication-title: Spark A Parallelizing Approach to the High-Level Synthesis of Digital Circuits
– start-page: 76
  year: 2001
  ident: ref3
  article-title: ASIP design methodologies: Survey and issues
  publication-title: Proc 14th Int Conf VLSI Design
  doi: 10.1109/ICVD.2001.902643
– ident: ref2
  doi: 10.1145/581199.581251
– ident: ref11
  doi: 10.1109/TURBOCODING.2008.4658677
– year: 2009
  ident: ref27
  publication-title: An effective Hybrid Video Deinterlacing Algorithm
– volume: 46
  start-page: 999
  year: 2000
  ident: ref26
  article-title: Adaptive scan rate up-conversion system based on human visual characteristics
  publication-title: IEEE Trans Consumer Electron
  doi: 10.1109/30.920453
– ident: ref21
  doi: 10.1109/NEWCAS.2009.5290458
– ident: ref20
  doi: 10.1109/FPL.2008.4630032
– ident: ref19
  doi: 10.1145/998300.997199
– ident: ref25
  doi: 10.1109/TCE.2003.1261227
– ident: ref15
  doi: 10.1145/1118299.1118474
– ident: ref7
  doi: 10.1023/A:1007996916499
– ident: ref6
  doi: 10.1109/92.784091
– start-page: 119
  year: 2006
  ident: ref22
  publication-title: Reconfigurable Computing-Accelerating Computation With Field-Programmable Gate Arrays
– ident: ref4
  doi: 10.1109/TCSII.2005.857091
– ident: ref28
  doi: 10.1109/SFCS.1981.34
– year: 2007
  ident: ref31
  publication-title: Xtensa Energy Estimator (Xenergy) User's Guide for Xtensa Tools Version 7 1 0
– ident: ref32
  doi: 10.1109/ICECS.2007.4511157
– ident: ref9
  doi: 10.1109/TVLSI.2011.2105512
– ident: ref12
  doi: 10.1109/TVLSI.2008.2003164
– ident: ref1
  doi: 10.1109/MAHC.2006.45
SSID ssj0014490
Score 2.0290198
Snippet This paper presents a systematic approach to the design of application-specific instruction-set processors for high speed computation of local neighborhood...
SourceID proquest
pascalfrancis
crossref
ieee
SourceType Aggregation Database
Index Database
Enrichment Source
Publisher
StartPage 2031
SubjectTerms Algorithms
Application specific processors
Application-specific instruction-set processors (ASIPs)
Applied sciences
Circuit properties
Computation
Computer architecture
deinterlacing
Design engineering
Design. Technologies. Operation analysis. Testing
Electric, optical and optoelectronic circuits
Electronics
Exact sciences and technology
High definition video
Image edge detection
Integrated circuits
Integrated circuits by function (including memories and processors)
local neighborhood functions
Mathematical analysis
Mathematical models
Microwave circuits, microwave integrated circuits, microwave transmission lines, submillimeter wave circuits
Processors
Program processors
Real time
Real time systems
Semiconductor electronics. Microelectronics. Optoelectronics. Solid state devices
Studies
Very large scale integration
video processing
Title Real-Time Computation of Local Neighborhood Functions in Application-Specific Instruction-Set Processors
URI https://ieeexplore.ieee.org/document/6062664
https://www.proquest.com/docview/1030485204
https://www.proquest.com/docview/1038250607
Volume 20
WOSCitedRecordID wos000307125500010&url=https%3A%2F%2Fcvtisr.summon.serialssolutions.com%2F%23%21%2Fsearch%3Fho%3Df%26include.ft.matches%3Dt%26l%3Dnull%26q%3D
hasFullText 1
inHoldings 1
isFullTextHit
isPrint
journalDatabaseRights – providerCode: PRVIEE
  databaseName: IEEE Electronic Library (IEL)
  customDbUrl:
  eissn: 1557-9999
  dateEnd: 99991231
  omitProxy: false
  ssIdentifier: ssj0014490
  issn: 1063-8210
  databaseCode: RIE
  dateStart: 19930101
  isFulltext: true
  titleUrlDefault: https://ieeexplore.ieee.org/
  providerName: IEEE
link http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV1La9wwEB6S0EN6aF4t3bxQIbdEiS3LknUMJUsWlqU0myU3I-tBF4Jdspv-_oxkr7chpRCjg0GyLfRJnhmN5huAM44VnheapkwKyvHfSLXQmlYmTxKPl468BbOxnEyKhwf1YwMu-lgY51w8fOYuw2305dvGPIetsitUtlGe8E3YlFK2sVq9x4Bz1TIPiIwWaMesAmQSdTWdje9GLVsnKuAhGvSVEIpZVcKZSL3AYfFtPos3v-Yob4Y77-vpLnzq9Epy3U6EPdhw9T58_Itt8AB-_USlkIaYD9LmcoigkMaTcRBoZBJ2SXFKBKJjMkR5F6ckmdfkeu3lpjFhvZ8bMlpzz9I7tyRdyEHztPgM98Ob6fdb2iVaoCbLiyXVBefSZj4rROqqSjEshUudQgNWCpMqm1inU8Z0bgN_n1GVzNPMWGasErnPvsBW3dTuK5DKK-uMxuIrrrSthGEGdbRUoWUkdTWAdDXypelYyEMyjMcyWiOJKiNaZUCr7NAawHn_zO-Wg-O_rQ8CHn3LDooBnL4CuK9n4fAPE2wAxyvEy24dL8qQhI0XeXztt74aV2Bwq-jaNc-xDZrZiUjk4b8_fQTb2EHWRjAewxaC407gg_mznC-eTuM0fgH2MfGP
linkProvider IEEE
linkToHtml http://cvtisr.summon.serialssolutions.com/2.0.0/link/0/eLvHCXMwlV3fS90wFD44HUwfdJuTXXUug71tmU2aps2jyC5e1l3GvBPfSpofeEHa4b3u799J2luVycCSh0LSNuRLes7JyfkOwEeBFV4UmjKeSyrw30i11JrWJksSj5eOvAUXZT6dFpeX6scafB5iYZxz8fCZ-xJuoy_ftuY2bJUdo7KN8kQ8g41MCM66aK3BZyCE6rgHZEoLtGRWITKJOp5dlOeTjq8TVfAQD_pADMW8KuFUpF7gwPguo8U_P-coccY7T-vrS9juNUty0k2FV7DmmtewdY9vcBeufqJaSEPUB-myOURYSOtJGUQamYZ9UpwUgeqYjFHixUlJ5g05ufNz05iy3s8Nmdyxz9JztyR90EF7s3gDv8ZfZ6dntE-1QE2aFUuqCyFym_q0kMzVteJYCsecQhM2l4Ypm1inGec6s4HBz6g6z1hqLDdWycyne7DetI17C6T2yjqjsfhaKG1rabhBLY0ptI1yXY-ArUa-Mj0PeUiHcV1FeyRRVUSrCmhVPVoj-DQ887tj4fhv692Ax9Cyh2IERw8AHup5OP7DJR_B4Qrxql_JiyqkYRNFFl_7YajGNRgcK7px7W1sg4Z2IpN8__FPv4cXZ7PvZVVOpt8OYBM7y7t4xkNYR6DcO3hu_izni5ujOKX_Aqdv9NY
openUrl ctx_ver=Z39.88-2004&ctx_enc=info%3Aofi%2Fenc%3AUTF-8&rfr_id=info%3Asid%2Fsummon.serialssolutions.com&rft_val_fmt=info%3Aofi%2Ffmt%3Akev%3Amtx%3Ajournal&rft.genre=article&rft.atitle=Real-Time+Computation+of+Local+Neighborhood+Functions+in+Application-Specific+Instruction-Set+Processors&rft.jtitle=IEEE+transactions+on+very+large+scale+integration+%28VLSI%29+systems&rft.au=Aubertin%2C+Philippe&rft.au=Langlois%2C+J.+M.+Pierre&rft.au=Savaria%2C+Yvon&rft.date=2012-11-01&rft.issn=1063-8210&rft.eissn=1557-9999&rft.volume=20&rft.issue=11&rft.spage=2031&rft.epage=2043&rft_id=info:doi/10.1109%2FTVLSI.2011.2170204&rft.externalDBID=n%2Fa&rft.externalDocID=10_1109_TVLSI_2011_2170204
thumbnail_l http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/lc.gif&issn=1063-8210&client=summon
thumbnail_m http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/mc.gif&issn=1063-8210&client=summon
thumbnail_s http://covers-cdn.summon.serialssolutions.com/index.aspx?isbn=/sc.gif&issn=1063-8210&client=summon