Pure client-side and pure server-side implementation solutions for HTML to PDF conversion

Pure client-side and pure server-side implementation solutions for HTML to PDF conversion

need

After the user fills out the form and clicks Save, they can directly download the PDF document.

Solution

Server generation

Ideas

Google Chrome developed the Chrome Headless feature on its own in 2017, and launched puppeteer at the same time. It can be understood as a browser without an interface but can complete server functions.

So we can start the puppeteer browser on the server, open the target URL, and use the built-in conversion function of the chrome browser to convert html to pdf.

Server generates core code

First, you need to install puppeteer. NPM installation may fail. It is best to use cnpm Taobao mirror to install it.

Enter cnpm i puppeteer -S to install dependencies.

Create a js file, just open the URL with puppeteer browser and save the pdf.

// html2pdf.js

const puppeteer = require('puppeteer');
(async function(){
    // Start the service const browser = await puppeteer.launch();
    // Open the tab const page = await browser.newPage();
    // Go to this address await page.goto('https://koa.bootcss.com/#context');
    // Convert html page to pdf and save to path
    await page.pdf({path:"test.pdf",format:'A4'})
    // Close the browser await browser.close();
})();

Then enter node html2pdf.js in the console to start the service.

Of course, you can also export module methods using module.export according to business logic.

shortcoming

Unable to save dynamic form data

Since the page is requested from the server, if the user input is not saved on the request address, the intercepted PDF will be the initial state of the page without being filled in.

In other words, it can only convert static pages, but our requirements require a lot of user input, so it passes.

Client generates core code

Ideas

  • Use html2canvas, input the DOM node that needs to be converted, and traverse it to convert it into a canvas
  • Convert the canvas to a base64 image, use jsPDF to create a PDF file, and insert the image into the PDF.

shortcoming

distortion.

We can clearly see that, since it is similar to taking a screenshot of the page and then inserting the screenshot into PDF, the resolution and configuration of the page are likely to affect the quality of the output image.

At the same time, because it is a screenshot, functions such as page links may be lost.

Text truncation

When the canvas is larger than the size of a PDF page, the output will be wrong. At this time, we need to determine whether the canvas exceeds the A4 size. If so, split the canvas and insert it into different pages.

Now the problem arises again. Since the image is segmented, it is very likely that the image or text will be cut off in half because we cannot analyze the structure of the item inside the canvas.

Core code

Our needs do not include images and links, so the distortion problem has little impact on us. At the same time, our form consists of multiple repeated items of equal length, and these items are very short and will not exceed an A4 paper (although this is not rigorous, if necessary, you can get the DOM element width and height and crop according to the DOM element height).

So I plan to split the canvas directly according to the item, and save each item on a page of A4 paper.

There are several core methods you need to understand before you get started:

html2canvas

   // DOM is the DOM node to be converted html2canvas(DOM,{
        backgroundColor:"#ffffff",
        width:width,
        height:height,
        scale:2,
        allowTaint:true,
    }).then((canvas)=>{
        // canvas is the canvas after successful conversion})

jsPDF

   // Create an instance let pdf = new jsPDF('','pt','a4');
    // Add the image to the pdf file // The first parameter is the file format (base64) to be inserted, the second is the file format // The third and fourth are the coordinates of the upper left corner of the image, and the last two are the width and height of the image after insertion pdf.addImage(image,'JPEG',10,10,height,width);
    // Add a new page pdf.addPage()
    // Save the pdf file pdf.save()

canvas

  // canvas is the image to be cropped // sx, sy are the coordinates for starting cropping // swidth, sHeight are the width and height of the cropping // dx, dy are the coordinates for inserting the cropped image into the canvas // sWidth, sHeight are the width and height of the cropped image in the canvas cxt.drawImage(canvas, sx, sy, sWidth, sHeight, dx, dy, sWidth, sHeight);
/**
 * @description: Convert form to pdf file* @return: pdf
 */
onSubmit(){
    // This is the form I want to convert, there are many identical forms in it let form = this.$refs.form;
    // Get the width and height of the element let width = form.getBoundingClientRect().width;
    let height = form.getBoundingClientRect().height;
    html2canvas(form,{
        backgroundColor:"#ffffff",
        width:width,
        height:height,
        scale:2,
        allowTaint:true,
    }).then((canvas)=>{
        let pdf = new jsPDF('','pt','a4');
        // Cut the picture let canvasList = this.splitCanvas(canvas,this.forms.length);

        // Traverse the canvas list and add an image to each page canvasList.forEach((item,index)=>{
            // Convert the image format to base64
            let itemImage = item.toDataURL('image/jpeg',1.0);
            // Reserve 10px margin, the width of A4 paper is 595px on a 72-resolution monitor
            pdf.addImage(itemImage,'JPEG',10,10,575.28,575.28/item.width*item.height);
            // If it is not the last page, paginate index == this.forms.length-1 ? '' : pdf.addPage();
        })
        // Save the file let blob = pdf.output('blob');
        
        pdf.save('test.pdf');
    })
},
/**
 * @description: Cut canvas * @param {number} num number of slices * @param {canvas} canvas 
 * @return {array} canvas list*/
splitCanvas(canvas,num){
    let height = canvas.height,width = canvas.width;
    let chunkHeight = height/num; // The height of each slice let chunkList = []; // Store the result canvas
    for(let i=0; i<height ; i+=chunkHeight){
        // Initialize the crop rectangle position let sx = 0,sy = i,sWidth = width,sHeight = chunkHeight,dx = 0, dy = 0;
        // Create a canvas node let canvasItem = document.createElement("canvas");
        // Initialize canvas size canvasItem.height = chunkHeight;
        canvasItem.width = width;
        let cxt = canvasItem.getContext("2d");
        // Put the cropped image into the new canvas node cxt.drawImage(canvas,sx,sy,sWidth,sHeight,dx,dy,sWidth,sHeight);
        chunkList.push(canvasItem); 
    }
    return chunkList;
},

Final result

Page after the form is saved

Effect of converting to pdf

This concludes this article on pure client-side and pure server-side implementation solutions for HTML to PDF. For more relevant HTML to PDF content, please search for previous articles on 123WORDPRESS.COM or continue to browse the related articles below. I hope you will support 123WORDPRESS.COM in the future!

<<:  Specific method to add foreign key constraints in mysql

>>:  Founder font library Chinese and English file name comparison table

Recommend

A detailed introduction to Linux memory management and addressing

Table of contents 1. Concept Memory management mo...

Practice of el-cascader cascade selector in elementui

Table of contents 1. Effect 2. Main code 1. Effec...

Detailed explanation of MySQL file storage

What is a file system We know that storage engine...

Calling Baidu Map to obtain longitude and latitude in Vue

In the project, it is necessary to obtain the lat...

HTML Tutorial: Collection of commonly used HTML tags (4)

Related articles: Beginners learn some HTML tags ...

Vue implements adding watermark to uploaded pictures

This article shares the specific implementation c...

Introduction to the process of creating TCP connection in Linux system

Table of contents Steps to create TCP in Linux Se...

MySQL insert json problem

MySQL 5.7.8 and later began to support a native J...

Table related arrangement and Javascript operation table, tr, td

Table property settings that work well: Copy code ...

Detailed usage of Vue timer

This article example shares the specific code of ...

Correct use of MySQL partition tables

Overview of MySQL Partitioned Tables We often enc...