SlideShare ist ein Scribd-Unternehmen logo
1 von 4
Downloaden Sie, um offline zu lesen
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 07 Issue: 03 | Mar 2020 www.irjet.net p-ISSN: 2395-0072
© 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 3565
Generation of HTML Code using Machine Learning Techniques from
Mock-Up Images
Shweta Patil1, Rutuja Pawar2, Shraddha Punder3, Jacob John4
1,2,3,4Departmentof Computer Engineering, Pillai HOC College of Engineering and Technology, Rasayan, India
---------------------------------------------------------------------***----------------------------------------------------------------------
Abstract - The designing of a website starts with building
mock-ups for particular site web pages by manually or using
visual computerization and specified mock-up development
tools. Then mock-ups are transferred into structuredHTML or
comparable labeled code by programming engineers. It’s
common practice for developers to transform a web page
mock-up into code. This method is normally repeated on
various more occasions until the perfect design is made. Sadly,
this procedure is complicated and tedious. In this paper, we
present an approach to automate the code generation from
mock-ups which will reduce time and it is cost-effective. Some
profound learning strategies are used to actualize the
proposed framework.
Key Words: HTML, Mock-ups, Machine Learning, Dataset,
Convolution Neural Network.
1. INTRODUCTION
The importance of the Internet websites has increased
considerably due to the progress madeintoday’stechnology.
These days’ sites mirror the essence of the states,
organizations, networks, individuals, etc. The Information
Technology field is ever-evolving. Truly, the devices utilized
by engineers are reliably getting all the more impressive,
supportive and the results quality is additionally expanding.
On the other hand, official organizations expect to offer
progressively productive types of assistance. Most present-
day user-facing software programming applications are
Graphical User Interface (GUI) driven,anddependonalluring
User Interface (UI). The front-end of each site is a “web page”
which is the piece of the site that grabs the attention of the
end user. It is essential to serve a page that grabs the
attention of the end-user, it is simple, but difficult to utilize
and has enough effective nature. In any case, creating pages
that react effectively to these necessities includes a difficult
approach. In the development of web pages, graphic
designers, softwarespecialist,end-users,corporatespecialist
and individuals utilized in a wide range of territories are
required to cooperate. Normally, the procedure begins with
the counterfeit up the structure of the UI by the graphic
designers, according to the necessities of the organization.
Programming specialists composecodeforthewebsitepages
dependent on these drafts. Sites made by organizations go
forward for monetary explanations behind item showcasing
or promoting purposes. The subsequent pages may change
dependent on response got by the end users. This process
includes lots of repetitive work. We rewrite the code for
segments has the same functionsandcontinuousevolutionin
page structure makes is a very boring process. Many startup
companiescreatesoftwareprototypestoshowcasetheirideas
and ensure investor support would likewise significantly
profit by quickly application prototyping. Instead of wasting
time and money on repeatedly designing and coding UI, a
precise computerized approach would almost certainly be
liked. This would permit small scale organizations to put
more concentrate on components and esteem and less on
making an interpretationofstructuresintousefulapplication
code. Avoidthedisappointmentsthatfront-endengineersand
originators face with building precise GUIs, this exposes that
the need for more effective resolution in web pagedesigning.
Designing of web page by automatic code generation is
getting importance as a research topic. Automatic creationof
web pages decreases programming time, process cost and
source utilization. Because of the quicker dynamic plan
arranges, the final site is delivered in a shorter time.
In this study, an approach has been created to automatically
produce the HTML code for the mockup of a website page. It
is planned to perceive the parts made in themock-upimages
and to encode them as indicated by the website page
hierarchy. We can use the dataset from pix2code paper for
generating bootstrap websites. By utilizing Twitter’s
bootstrap, we can merge HTML and CSS and diminishing the
size of the vocabulary. The deep neural network model
including Convolution Neural Networks (CNN) is utilized to
train the images present on the data sets.
2. LITERATURE REVIVEW
An algorithm named Reverse Engineering Mobile
Application User Interfaces (REMAUI) finds the elements of
the UI a mobile application for example buttons, text-boxes
and pictures, it makes the code for them from the
screenshots of an application window [2]. It transforms to
the code from the screen pictures or drawings for mobile
platforms, PC vision and optical character acknowledgment
techniques are utilized. Despite the fact that the REMAUI [2]
strategy works effectively, it doesn’t assist cross-page
change and animations inside the page. Creators builtupthe
P2A [3] algorithm to cure the lacks of the REMAUI
calculation.
Creators built up the pix2code algorithm which expects to
change over the graphical interface for a website page to
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 07 Issue: 03 | Mar 2020 www.irjet.net p-ISSN: 2395-0072
© 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 3566
structured code using deep learning with convolution and
repetitive neural networks [4].
The algorithm Redraw takes mock-ups ofmobileapplication
screens and makes an organized XML code for it [5]. In the
first phase of their implementation PC visionproceduresare
utilized to distinguish singular GUI components. The second
phase includes the order of the recognized components as
indicated by their function, for example, toggle button, text-
area, etc. Right now, convolution neural networks are
utilized. In the last phase, the XML code is produced by
joining the K-Nearest Neighbour’s (KNN) algorithm as
indicated by the web programming hierarchy.
These days open source code libraries, for example, GitHub
[1] are utilized very common to share code and applications.
It is a typical practice to explore this repository and reuse
code when beginning or improving programming ventures.
The common codes in these libraries lessen a similar code
being composed again and again by various individuals.
The creators utilize an inquiry program called SUISE in
which the clients characterize a graphical interface with
straightforward drawingsandkeywords [6].Thisinterfaceis
then finding in existing libraries to get comparative
interfaces. These interfaces are transformed into operable
codes and came back to the end client to choose the most
reasonable interface.
3. PROPOSED SYSTEM
Fig.1 System Architecture
Producing computer code written in a given programming
language from a mock-up image can be differ with the
undertaking of creating English printed sketch from a scene
photography. In the two situations, we need to deliver a
variable-length series of tokens from pixel value. We would
thus be able to partition our concern into threesub-issues.
Initial, a computer vision issue of understanding the given
scene (i.e. HTML picture) and inducing the item present,
their personalities, positions, and poses (for example
buttons,labels, component). Second, a language displaying
issue of getting content (for example code) and producing
grammaticallyandsemanticallyrightexamples.Thelasttest
is to utilize the answers for both past sub-issues by exploit
the inactive factors derived from scene comprehension to
produce relating textualdescription(forexamplePCcodeas
opposed to English) of the articles characterized by these
factors.
3.1 Vision Model
CNNs are presently thestrategyfordecisiontounderstanda
wide scope of vision issues thanks to their topology
permitting them to take in rich inactive presentations from
the pictures theyare trained on [4]. Formappingamock-up
to a learned fixed-length vector we used the CNN model to
perform unsupervised learning.
The input pictures are at first re-sized to 256 × 256 pixels
and the pixel values are standardized before to be fed inthe
CNN [4]. No more pre-processing is carried out. For
encrypting every input picture into a fixed-size output
vector [4], We just used a minimal 3 × 3 responsive field.
These functions are applied twice before to down-sample
along max-pooling,the range of thefirstconvolutionallayer
is 32, trailed bya layer of width 64, lastly width128[4].Two
completely associated layers of size 1024 applying the
amended direct unit initiation complete the vision model.
3.2 Language Model
Westructureda basiclightweight Domain-specific language
(DSL) to represent HTML. Right now, are just inspired by
the HTML format, the various graphical parts, and their
connections; in this way the real printed estimation of the
value is neglected. Also, to reduce size of the search space,
the DSL integrity additionally diminishesthecapacityofthe
vocabulary. As a result,alanguagemodelcanachievetoken-
level language modeling with a discrete input by using one-
hot encoded vectors; eliminating the need for word
embedding techniquessuchasword2vec[10]thatcanbring
in expensive computing. In maximum programming
languages, a component is announced with an opening
token; if child components are consisting inside a block, an
end token is generally required the compiler. In such a
situation where the number of children components
contained in a parent component is variable, it is necessary
to show long term conditions to have the option to close a
block that has been opened. Hochreiter and Schmidhuber
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 07 Issue: 03 | Mar 2020 www.irjet.net p-ISSN: 2395-0072
© 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 3567
proposed the Long Short-Term Memory (LSTM) [7] neural
engineering so as to address this very issue.
3.3 Decoder
This model is trained in a supervised learning way by taking
care of a picture I and a logical arrangement X of T tokens xt
and the token xT as the target label. A CNN-based vision
model encodes the input image I into a vectoral picture p.
The information token xt is encoded by a LSTM-based
language model into intermediary representation qt
permitting the model to concentrate more on specific tokens
and lesson others [8]. This first language model is carried
out as a stack of two LSTM layers with 128 cells each. The
vision- encoded vector p and the language encoded vector qt
[4] are linked into isolated component vector rt which is
thenfed into a second LSTM-based model extract the picture
learned by both the vision model and the language model.
The decoder in this manner figures out how to show the
connection among objects present in the information HTML
picture andthe related tokens present in the DSL code. We
execute decoder as two LSTM layers having512cells,eachas
a stack. The discrete idea of the output permits to lessen the
task to a characterization issue. Specially, our model
consists output model having an equivalent count of cells
from vocabulary size; along these lines producing a
probability appropriation of the competitor tokens at each
time step permitting the utilization of a softmax layer to
perform multi-class characterization.
3.4 Training
Length T of the order utilized for training is main aim to
model long term dependencies; for instance, as farashavean
option to the closing of an opened block. The DSL input
documents utilized for preparing were divided with asliding
window of size 48.For each token in theinputDSLdocument,
the model is along these lines taken care of with both an
information picture and a relevant order of T = 48 tokens.
Despite the fact that the context utilized for preparing is
refreshed at each time step (for example every token) by
sliding the window,exactlytheidenticalinformationpictureI
is reused for tests related with a similar GUI. The special
tokens <START >and <END > are utilized to separately prefix
and addition the DSL records comparably to the strategy
utilized by Karpathy and Fei-Fei [8].
3.5 Sampling
For producing DSL code, we provide the HTML picture I and
a logical grouping X of T = 48 tokens where tokens xt. . . xt-1
are at first set vacant and the past token of the arrangement
xt is set to the unique <START > token. The anticipated token
yt is then used to refresh the following arrangement of
relevant tokens. Simply put, xt . . . xT −1 as xt+1 . . .xT with xt as
yt. The procedure is rehashed until the token <END > is
produced by the model. The produced DSL token
arrangement would then compile to generate desired
output.
4. CONCLUSION
Transformingwebsitesmock-upsintomark-upcodewithless
time along with developmentcost has been a crucial point.In
this paper, we developed an approach which accepts web
pagemock-ups, process them and generatestructuredHTML
code.Adatasetcomprisingpictures,includingdifferentmock-
ups of web page structures were utilized. This dataset is used
to train the CNN model. Although our work exhibits the
capability of such a framework to automate the procedure
of executing GUIs, we just started to expose what is feasible.
Our model comprises of generally not many limitations and
prepared on a comparable little dataset. The nature of the
created code could be definitely enhanced via preparing a
greater model on altogether more information for an
increasing count of time- span.
REFERENCES
[1] Sketch2code. Microsoft AI Labs. [Online]. Available:
https://github.com/Microsoft/ailab/tree/master/Sketch2Co
de/model/images
[2] T. A. Nguyen and C. Csallner, “Reverse Engineering
Mobile Application User Interfaces with REMAUI (T),” in
2015 30th IEEE/ACM International Conference on
Automated Software Engi- neering (ASE). IEEE, nov 2015,
pp.248–259.[Online].Available:
http://ieeexplore.ieee.org/document/7372013/
[3] S. Natarajan and C. Csallner, “P2A:AToolforConverting
Pixels to Animated Mobile Application User Interfaces,”
Proceedings of the5th International Conference onMobile
Software Engineering and Systems -MOBILESoft ’18, pp.
224–235, 2018. [Online]. Available:
http://dl.acm.org/citation.cfm?doid=3197231.3197249
[4] T. Beltramelli, “pix2code: Generating code from a
graphical user inter- face screenshot,” CoRR, vol.
abs/1705.07962,2017. [Online].
Available: http://arxiv.org/abs/1705.07962
[5] K. P. Moran, C. Bernal-Cardenas, M. Curcio, R. Bonett,
and D. Poshy-´vanyk,“Machinelearning-based prototyping
of graphical user interfaces for mobile apps,” IEEE
Transactions on Software Engineering, pp. 1–1, 2018.
[6] [Online]. Available: https://github.com/ [7] S. P. Reiss, Y.
Miao, and Q. Xin, “Seeking the user interface,” Automated
Software Engineer- ing, vol. 25, no. 1, pp. 157–193, mar
2018. [Online].
[7] S. Hochreiter and J. Schmidhuber. Long short-term
memory. Neural computation, 9(8):1735– 1780, 1997.
[8] A. Karpathy and L. Fei-Fei. Deep visual-semantic
alignments for generating image descriptions. In
Proceedings of the IEEE Conference on Computer Vision
and Pattern Recognition, pages 3128–3137, 2015.
International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056
Volume: 07 Issue: 03 | Mar 2020 www.irjet.net p-ISSN: 2395-0072
© 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 3568
[9] A. Graves. Generating sequences with recurrent neural
networks. arXiv preprint arXiv:1308.0850, 2013.
[10] T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J.
Dean. Distributed representations of words and phrases
and their compositionality. In Advances in neural
information processing systems, pages 3111–3119, 2013.

Weitere ähnliche Inhalte

Ähnlich wie IRJET- Generation of HTML Code using Machine Learning Techniques from Mock-Up Images

Appalanaidu_4.4 Years Exp in DotNet Technology
Appalanaidu_4.4 Years Exp in DotNet TechnologyAppalanaidu_4.4 Years Exp in DotNet Technology
Appalanaidu_4.4 Years Exp in DotNet Technology
APPALANAIDU KONDALA
 
Rachit_HMI_Development_resume
Rachit_HMI_Development_resumeRachit_HMI_Development_resume
Rachit_HMI_Development_resume
Rachit Kushwaha
 
William Impey CV
William Impey CVWilliam Impey CV
William Impey CV
Will Impey
 
Anand_Honeywell_11yrs
Anand_Honeywell_11yrsAnand_Honeywell_11yrs
Anand_Honeywell_11yrs
Anand Patil
 

Ähnlich wie IRJET- Generation of HTML Code using Machine Learning Techniques from Mock-Up Images (20)

Analyzing Optimal Practises for Web Frameworks
Analyzing Optimal Practises for Web FrameworksAnalyzing Optimal Practises for Web Frameworks
Analyzing Optimal Practises for Web Frameworks
 
IRJET- An Sla-Aware Cloud Coalition Formation Approach for Virtualized Networks.
IRJET- An Sla-Aware Cloud Coalition Formation Approach for Virtualized Networks.IRJET- An Sla-Aware Cloud Coalition Formation Approach for Virtualized Networks.
IRJET- An Sla-Aware Cloud Coalition Formation Approach for Virtualized Networks.
 
Appalanaidu_4.4 Years Exp in DotNet Technology
Appalanaidu_4.4 Years Exp in DotNet TechnologyAppalanaidu_4.4 Years Exp in DotNet Technology
Appalanaidu_4.4 Years Exp in DotNet Technology
 
Project copy
Project   copyProject   copy
Project copy
 
Web-Based Lighting Automation System
Web-Based Lighting Automation SystemWeb-Based Lighting Automation System
Web-Based Lighting Automation System
 
Automatic Graphical Design Generator
Automatic Graphical Design GeneratorAutomatic Graphical Design Generator
Automatic Graphical Design Generator
 
MAGE PROCESSING BASED BILLING STRUCTURE USING EDGE COMPUTING AND REACTJS
MAGE PROCESSING BASED BILLING STRUCTURE USING EDGE COMPUTING AND REACTJSMAGE PROCESSING BASED BILLING STRUCTURE USING EDGE COMPUTING AND REACTJS
MAGE PROCESSING BASED BILLING STRUCTURE USING EDGE COMPUTING AND REACTJS
 
Automated Image Captioning – Model Based on CNN – GRU Architecture
Automated Image Captioning – Model Based on CNN – GRU ArchitectureAutomated Image Captioning – Model Based on CNN – GRU Architecture
Automated Image Captioning – Model Based on CNN – GRU Architecture
 
IRJET- IoT based Vending Machine with Cashless Payment
IRJET- IoT based Vending Machine with Cashless PaymentIRJET- IoT based Vending Machine with Cashless Payment
IRJET- IoT based Vending Machine with Cashless Payment
 
Andiappan_Profile
Andiappan_ProfileAndiappan_Profile
Andiappan_Profile
 
IRJET- Design Automation of Cam Lobe Modeling in Creo using C#
IRJET- Design Automation of Cam Lobe Modeling in Creo using C#IRJET- Design Automation of Cam Lobe Modeling in Creo using C#
IRJET- Design Automation of Cam Lobe Modeling in Creo using C#
 
Deepcoder to Self-Code with Machine Learning
Deepcoder to Self-Code with Machine LearningDeepcoder to Self-Code with Machine Learning
Deepcoder to Self-Code with Machine Learning
 
Rachit_HMI_Development_resume
Rachit_HMI_Development_resumeRachit_HMI_Development_resume
Rachit_HMI_Development_resume
 
IRJET- Build a Secure Web based Code Editor for C Programming Language
IRJET-  	  Build a Secure Web based Code Editor for C Programming LanguageIRJET-  	  Build a Secure Web based Code Editor for C Programming Language
IRJET- Build a Secure Web based Code Editor for C Programming Language
 
IRJET- Implementation of Web Enabled Notice Board using SOC
IRJET- Implementation of Web Enabled Notice Board using SOCIRJET- Implementation of Web Enabled Notice Board using SOC
IRJET- Implementation of Web Enabled Notice Board using SOC
 
William Impey CV
William Impey CVWilliam Impey CV
William Impey CV
 
Intelligent Real-time 3D Configuration Platform for Customizing E-commerce Pr...
Intelligent Real-time 3D Configuration Platform for Customizing E-commerce Pr...Intelligent Real-time 3D Configuration Platform for Customizing E-commerce Pr...
Intelligent Real-time 3D Configuration Platform for Customizing E-commerce Pr...
 
INTELLIGENT REAL-TIME 3D CONFIGURATION PLATFORM FOR CUSTOMIZING E-COMMERCE PR...
INTELLIGENT REAL-TIME 3D CONFIGURATION PLATFORM FOR CUSTOMIZING E-COMMERCE PR...INTELLIGENT REAL-TIME 3D CONFIGURATION PLATFORM FOR CUSTOMIZING E-COMMERCE PR...
INTELLIGENT REAL-TIME 3D CONFIGURATION PLATFORM FOR CUSTOMIZING E-COMMERCE PR...
 
Anand_Honeywell_11yrs
Anand_Honeywell_11yrsAnand_Honeywell_11yrs
Anand_Honeywell_11yrs
 
IRJET - Architectural Visualizer in Android
IRJET - Architectural Visualizer in AndroidIRJET - Architectural Visualizer in Android
IRJET - Architectural Visualizer in Android
 

Mehr von IRJET Journal

Mehr von IRJET Journal (20)

TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
TUNNELING IN HIMALAYAS WITH NATM METHOD: A SPECIAL REFERENCES TO SUNGAL TUNNE...
 
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURESTUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
STUDY THE EFFECT OF RESPONSE REDUCTION FACTOR ON RC FRAMED STRUCTURE
 
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
A COMPARATIVE ANALYSIS OF RCC ELEMENT OF SLAB WITH STARK STEEL (HYSD STEEL) A...
 
Effect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil CharacteristicsEffect of Camber and Angles of Attack on Airfoil Characteristics
Effect of Camber and Angles of Attack on Airfoil Characteristics
 
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
A Review on the Progress and Challenges of Aluminum-Based Metal Matrix Compos...
 
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
Dynamic Urban Transit Optimization: A Graph Neural Network Approach for Real-...
 
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
Structural Analysis and Design of Multi-Storey Symmetric and Asymmetric Shape...
 
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
A Review of “Seismic Response of RC Structures Having Plan and Vertical Irreg...
 
A REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADASA REVIEW ON MACHINE LEARNING IN ADAS
A REVIEW ON MACHINE LEARNING IN ADAS
 
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
Long Term Trend Analysis of Precipitation and Temperature for Asosa district,...
 
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD ProP.E.B. Framed Structure Design and Analysis Using STAAD Pro
P.E.B. Framed Structure Design and Analysis Using STAAD Pro
 
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
A Review on Innovative Fiber Integration for Enhanced Reinforcement of Concre...
 
Survey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare SystemSurvey Paper on Cloud-Based Secured Healthcare System
Survey Paper on Cloud-Based Secured Healthcare System
 
Review on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridgesReview on studies and research on widening of existing concrete bridges
Review on studies and research on widening of existing concrete bridges
 
React based fullstack edtech web application
React based fullstack edtech web applicationReact based fullstack edtech web application
React based fullstack edtech web application
 
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
A Comprehensive Review of Integrating IoT and Blockchain Technologies in the ...
 
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
A REVIEW ON THE PERFORMANCE OF COCONUT FIBRE REINFORCED CONCRETE.
 
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
Optimizing Business Management Process Workflows: The Dynamic Influence of Mi...
 
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic DesignMultistoried and Multi Bay Steel Building Frame by using Seismic Design
Multistoried and Multi Bay Steel Building Frame by using Seismic Design
 
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
Cost Optimization of Construction Using Plastic Waste as a Sustainable Constr...
 

Kürzlich hochgeladen

Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Dr.Costas Sachpazis
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
dharasingh5698
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
Tonystark477637
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
rknatarajan
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
ankushspencer015
 

Kürzlich hochgeladen (20)

ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
(INDIRA) Call Girl Bhosari Call Now 8617697112 Bhosari Escorts 24x7
 
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
Structural Analysis and Design of Foundations: A Comprehensive Handbook for S...
 
chapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineeringchapter 5.pptx: drainage and irrigation engineering
chapter 5.pptx: drainage and irrigation engineering
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptxBSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
BSides Seattle 2024 - Stopping Ethan Hunt From Taking Your Data.pptx
 
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
(INDIRA) Call Girl Aurangabad Call Now 8617697112 Aurangabad Escorts 24x7
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
AKTU Computer Networks notes --- Unit 3.pdf
AKTU Computer Networks notes ---  Unit 3.pdfAKTU Computer Networks notes ---  Unit 3.pdf
AKTU Computer Networks notes --- Unit 3.pdf
 
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
The Most Attractive Pune Call Girls Manchar 8250192130 Will You Miss This Cha...
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
The Most Attractive Pune Call Girls Budhwar Peth 8250192130 Will You Miss Thi...
 
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Walvekar Nagar Call Me 7737669865 Budget Friendly No Advance Booking
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...Call for Papers - International Journal of Intelligent Systems and Applicatio...
Call for Papers - International Journal of Intelligent Systems and Applicatio...
 
Coefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptxCoefficient of Thermal Expansion and their Importance.pptx
Coefficient of Thermal Expansion and their Importance.pptx
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
 
Generative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPTGenerative AI or GenAI technology based PPT
Generative AI or GenAI technology based PPT
 
Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)Java Programming :Event Handling(Types of Events)
Java Programming :Event Handling(Types of Events)
 

IRJET- Generation of HTML Code using Machine Learning Techniques from Mock-Up Images

  • 1. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 07 Issue: 03 | Mar 2020 www.irjet.net p-ISSN: 2395-0072 © 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 3565 Generation of HTML Code using Machine Learning Techniques from Mock-Up Images Shweta Patil1, Rutuja Pawar2, Shraddha Punder3, Jacob John4 1,2,3,4Departmentof Computer Engineering, Pillai HOC College of Engineering and Technology, Rasayan, India ---------------------------------------------------------------------***---------------------------------------------------------------------- Abstract - The designing of a website starts with building mock-ups for particular site web pages by manually or using visual computerization and specified mock-up development tools. Then mock-ups are transferred into structuredHTML or comparable labeled code by programming engineers. It’s common practice for developers to transform a web page mock-up into code. This method is normally repeated on various more occasions until the perfect design is made. Sadly, this procedure is complicated and tedious. In this paper, we present an approach to automate the code generation from mock-ups which will reduce time and it is cost-effective. Some profound learning strategies are used to actualize the proposed framework. Key Words: HTML, Mock-ups, Machine Learning, Dataset, Convolution Neural Network. 1. INTRODUCTION The importance of the Internet websites has increased considerably due to the progress madeintoday’stechnology. These days’ sites mirror the essence of the states, organizations, networks, individuals, etc. The Information Technology field is ever-evolving. Truly, the devices utilized by engineers are reliably getting all the more impressive, supportive and the results quality is additionally expanding. On the other hand, official organizations expect to offer progressively productive types of assistance. Most present- day user-facing software programming applications are Graphical User Interface (GUI) driven,anddependonalluring User Interface (UI). The front-end of each site is a “web page” which is the piece of the site that grabs the attention of the end user. It is essential to serve a page that grabs the attention of the end-user, it is simple, but difficult to utilize and has enough effective nature. In any case, creating pages that react effectively to these necessities includes a difficult approach. In the development of web pages, graphic designers, softwarespecialist,end-users,corporatespecialist and individuals utilized in a wide range of territories are required to cooperate. Normally, the procedure begins with the counterfeit up the structure of the UI by the graphic designers, according to the necessities of the organization. Programming specialists composecodeforthewebsitepages dependent on these drafts. Sites made by organizations go forward for monetary explanations behind item showcasing or promoting purposes. The subsequent pages may change dependent on response got by the end users. This process includes lots of repetitive work. We rewrite the code for segments has the same functionsandcontinuousevolutionin page structure makes is a very boring process. Many startup companiescreatesoftwareprototypestoshowcasetheirideas and ensure investor support would likewise significantly profit by quickly application prototyping. Instead of wasting time and money on repeatedly designing and coding UI, a precise computerized approach would almost certainly be liked. This would permit small scale organizations to put more concentrate on components and esteem and less on making an interpretationofstructuresintousefulapplication code. Avoidthedisappointmentsthatfront-endengineersand originators face with building precise GUIs, this exposes that the need for more effective resolution in web pagedesigning. Designing of web page by automatic code generation is getting importance as a research topic. Automatic creationof web pages decreases programming time, process cost and source utilization. Because of the quicker dynamic plan arranges, the final site is delivered in a shorter time. In this study, an approach has been created to automatically produce the HTML code for the mockup of a website page. It is planned to perceive the parts made in themock-upimages and to encode them as indicated by the website page hierarchy. We can use the dataset from pix2code paper for generating bootstrap websites. By utilizing Twitter’s bootstrap, we can merge HTML and CSS and diminishing the size of the vocabulary. The deep neural network model including Convolution Neural Networks (CNN) is utilized to train the images present on the data sets. 2. LITERATURE REVIVEW An algorithm named Reverse Engineering Mobile Application User Interfaces (REMAUI) finds the elements of the UI a mobile application for example buttons, text-boxes and pictures, it makes the code for them from the screenshots of an application window [2]. It transforms to the code from the screen pictures or drawings for mobile platforms, PC vision and optical character acknowledgment techniques are utilized. Despite the fact that the REMAUI [2] strategy works effectively, it doesn’t assist cross-page change and animations inside the page. Creators builtupthe P2A [3] algorithm to cure the lacks of the REMAUI calculation. Creators built up the pix2code algorithm which expects to change over the graphical interface for a website page to
  • 2. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 07 Issue: 03 | Mar 2020 www.irjet.net p-ISSN: 2395-0072 © 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 3566 structured code using deep learning with convolution and repetitive neural networks [4]. The algorithm Redraw takes mock-ups ofmobileapplication screens and makes an organized XML code for it [5]. In the first phase of their implementation PC visionproceduresare utilized to distinguish singular GUI components. The second phase includes the order of the recognized components as indicated by their function, for example, toggle button, text- area, etc. Right now, convolution neural networks are utilized. In the last phase, the XML code is produced by joining the K-Nearest Neighbour’s (KNN) algorithm as indicated by the web programming hierarchy. These days open source code libraries, for example, GitHub [1] are utilized very common to share code and applications. It is a typical practice to explore this repository and reuse code when beginning or improving programming ventures. The common codes in these libraries lessen a similar code being composed again and again by various individuals. The creators utilize an inquiry program called SUISE in which the clients characterize a graphical interface with straightforward drawingsandkeywords [6].Thisinterfaceis then finding in existing libraries to get comparative interfaces. These interfaces are transformed into operable codes and came back to the end client to choose the most reasonable interface. 3. PROPOSED SYSTEM Fig.1 System Architecture Producing computer code written in a given programming language from a mock-up image can be differ with the undertaking of creating English printed sketch from a scene photography. In the two situations, we need to deliver a variable-length series of tokens from pixel value. We would thus be able to partition our concern into threesub-issues. Initial, a computer vision issue of understanding the given scene (i.e. HTML picture) and inducing the item present, their personalities, positions, and poses (for example buttons,labels, component). Second, a language displaying issue of getting content (for example code) and producing grammaticallyandsemanticallyrightexamples.Thelasttest is to utilize the answers for both past sub-issues by exploit the inactive factors derived from scene comprehension to produce relating textualdescription(forexamplePCcodeas opposed to English) of the articles characterized by these factors. 3.1 Vision Model CNNs are presently thestrategyfordecisiontounderstanda wide scope of vision issues thanks to their topology permitting them to take in rich inactive presentations from the pictures theyare trained on [4]. Formappingamock-up to a learned fixed-length vector we used the CNN model to perform unsupervised learning. The input pictures are at first re-sized to 256 × 256 pixels and the pixel values are standardized before to be fed inthe CNN [4]. No more pre-processing is carried out. For encrypting every input picture into a fixed-size output vector [4], We just used a minimal 3 × 3 responsive field. These functions are applied twice before to down-sample along max-pooling,the range of thefirstconvolutionallayer is 32, trailed bya layer of width 64, lastly width128[4].Two completely associated layers of size 1024 applying the amended direct unit initiation complete the vision model. 3.2 Language Model Westructureda basiclightweight Domain-specific language (DSL) to represent HTML. Right now, are just inspired by the HTML format, the various graphical parts, and their connections; in this way the real printed estimation of the value is neglected. Also, to reduce size of the search space, the DSL integrity additionally diminishesthecapacityofthe vocabulary. As a result,alanguagemodelcanachievetoken- level language modeling with a discrete input by using one- hot encoded vectors; eliminating the need for word embedding techniquessuchasword2vec[10]thatcanbring in expensive computing. In maximum programming languages, a component is announced with an opening token; if child components are consisting inside a block, an end token is generally required the compiler. In such a situation where the number of children components contained in a parent component is variable, it is necessary to show long term conditions to have the option to close a block that has been opened. Hochreiter and Schmidhuber
  • 3. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 07 Issue: 03 | Mar 2020 www.irjet.net p-ISSN: 2395-0072 © 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 3567 proposed the Long Short-Term Memory (LSTM) [7] neural engineering so as to address this very issue. 3.3 Decoder This model is trained in a supervised learning way by taking care of a picture I and a logical arrangement X of T tokens xt and the token xT as the target label. A CNN-based vision model encodes the input image I into a vectoral picture p. The information token xt is encoded by a LSTM-based language model into intermediary representation qt permitting the model to concentrate more on specific tokens and lesson others [8]. This first language model is carried out as a stack of two LSTM layers with 128 cells each. The vision- encoded vector p and the language encoded vector qt [4] are linked into isolated component vector rt which is thenfed into a second LSTM-based model extract the picture learned by both the vision model and the language model. The decoder in this manner figures out how to show the connection among objects present in the information HTML picture andthe related tokens present in the DSL code. We execute decoder as two LSTM layers having512cells,eachas a stack. The discrete idea of the output permits to lessen the task to a characterization issue. Specially, our model consists output model having an equivalent count of cells from vocabulary size; along these lines producing a probability appropriation of the competitor tokens at each time step permitting the utilization of a softmax layer to perform multi-class characterization. 3.4 Training Length T of the order utilized for training is main aim to model long term dependencies; for instance, as farashavean option to the closing of an opened block. The DSL input documents utilized for preparing were divided with asliding window of size 48.For each token in theinputDSLdocument, the model is along these lines taken care of with both an information picture and a relevant order of T = 48 tokens. Despite the fact that the context utilized for preparing is refreshed at each time step (for example every token) by sliding the window,exactlytheidenticalinformationpictureI is reused for tests related with a similar GUI. The special tokens <START >and <END > are utilized to separately prefix and addition the DSL records comparably to the strategy utilized by Karpathy and Fei-Fei [8]. 3.5 Sampling For producing DSL code, we provide the HTML picture I and a logical grouping X of T = 48 tokens where tokens xt. . . xt-1 are at first set vacant and the past token of the arrangement xt is set to the unique <START > token. The anticipated token yt is then used to refresh the following arrangement of relevant tokens. Simply put, xt . . . xT −1 as xt+1 . . .xT with xt as yt. The procedure is rehashed until the token <END > is produced by the model. The produced DSL token arrangement would then compile to generate desired output. 4. CONCLUSION Transformingwebsitesmock-upsintomark-upcodewithless time along with developmentcost has been a crucial point.In this paper, we developed an approach which accepts web pagemock-ups, process them and generatestructuredHTML code.Adatasetcomprisingpictures,includingdifferentmock- ups of web page structures were utilized. This dataset is used to train the CNN model. Although our work exhibits the capability of such a framework to automate the procedure of executing GUIs, we just started to expose what is feasible. Our model comprises of generally not many limitations and prepared on a comparable little dataset. The nature of the created code could be definitely enhanced via preparing a greater model on altogether more information for an increasing count of time- span. REFERENCES [1] Sketch2code. Microsoft AI Labs. [Online]. Available: https://github.com/Microsoft/ailab/tree/master/Sketch2Co de/model/images [2] T. A. Nguyen and C. Csallner, “Reverse Engineering Mobile Application User Interfaces with REMAUI (T),” in 2015 30th IEEE/ACM International Conference on Automated Software Engi- neering (ASE). IEEE, nov 2015, pp.248–259.[Online].Available: http://ieeexplore.ieee.org/document/7372013/ [3] S. Natarajan and C. Csallner, “P2A:AToolforConverting Pixels to Animated Mobile Application User Interfaces,” Proceedings of the5th International Conference onMobile Software Engineering and Systems -MOBILESoft ’18, pp. 224–235, 2018. [Online]. Available: http://dl.acm.org/citation.cfm?doid=3197231.3197249 [4] T. Beltramelli, “pix2code: Generating code from a graphical user inter- face screenshot,” CoRR, vol. abs/1705.07962,2017. [Online]. Available: http://arxiv.org/abs/1705.07962 [5] K. P. Moran, C. Bernal-Cardenas, M. Curcio, R. Bonett, and D. Poshy-´vanyk,“Machinelearning-based prototyping of graphical user interfaces for mobile apps,” IEEE Transactions on Software Engineering, pp. 1–1, 2018. [6] [Online]. Available: https://github.com/ [7] S. P. Reiss, Y. Miao, and Q. Xin, “Seeking the user interface,” Automated Software Engineer- ing, vol. 25, no. 1, pp. 157–193, mar 2018. [Online]. [7] S. Hochreiter and J. Schmidhuber. Long short-term memory. Neural computation, 9(8):1735– 1780, 1997. [8] A. Karpathy and L. Fei-Fei. Deep visual-semantic alignments for generating image descriptions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 3128–3137, 2015.
  • 4. International Research Journal of Engineering and Technology (IRJET) e-ISSN: 2395-0056 Volume: 07 Issue: 03 | Mar 2020 www.irjet.net p-ISSN: 2395-0072 © 2020, IRJET | Impact Factor value: 7.34 | ISO 9001:2008 Certified Journal | Page 3568 [9] A. Graves. Generating sequences with recurrent neural networks. arXiv preprint arXiv:1308.0850, 2013. [10] T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean. Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems, pages 3111–3119, 2013.